Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • M metaseq
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 95
    • Issues 95
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 41
    • Merge requests 41
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Infrastructure Registry
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Administrator
  • metaseq
  • Issues
  • #220
Closed
Open
Issue created Jul 13, 2022 by Administrator@rootOwner

Create generation benchmarks on different model sizes

Created by: punitkoura

🚀 Feature Request

The goal of this task is to track the generation speed (in WPS) of the various OPT model sizes. We can also include profiling to identify potential bottlenecks in the generation process. This investigation would help us speed up generation.

Motivation

To improve the speed of text generation from OPT models.

Pitch

Create a generation benchmark that takes in any of our models, and runs a fixed generation, and reports timing per token. Model name should be configurable.

Alternatives

N/A

Additional context

N/A

Assignee
Assign to
Time tracking