Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • M metaseq
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 95
    • Issues 95
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 41
    • Merge requests 41
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Infrastructure Registry
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Administrator
  • metaseq
  • Merge requests
  • !403

[checkpoint] Change logic around is_checkpoint_sharded

  • Review changes

  • Download
  • Email patches
  • Plain diff
Merged Administrator requested to merge is_checkpoint_sharded into main Oct 11, 2022
  • Overview 6
  • Commits 3
  • Pipelines 0
  • Changes 1

Created by: stephenroller

Patch Description Recent changes to dataloaders have resulted in us creating checkpoint files that are more unequal in size, as shard0 contains the data state. This breaks this older heuristic we used to determine if a checkpoint was Zero2 or Zero3.

This patch changes the heuristic to just read from the config of the file itself.

This patch also gives a better error message for a common mistake.

Testing steps Internal evaluations.

Assignee
Assign to
Reviewers
Request review from
Time tracking
Source branch: is_checkpoint_sharded