Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • M metaseq
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 95
    • Issues 95
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 41
    • Merge requests 41
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Infrastructure Registry
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Administrator
  • metaseq
  • Merge requests
  • !395

Add GLU activations [v2]

  • Review changes

  • Download
  • Email patches
  • Plain diff
Merged Administrator requested to merge gluact into main Oct 07, 2022
  • Overview 8
  • Commits 8
  • Pipelines 0
  • Changes 8

Created by: ruanslv

Patch Description v2 for https://github.com/facebookresearch/metaseq/pull/343, with an attempt to decouple gate logic from decoder.

Also consolidated our GeLU implementation to a new version of gelu_accurate that explicitly defines the multiplying constants + relies on JIT for better performance.

Testing steps Running ablations to compare perfomance against previous runs and making sure PPL matches

Assignee
Assign to
Reviewers
Request review from
Time tracking
Source branch: gluact