1. 03 Jul, 2022 1 commit
  2. 30 Jun, 2022 1 commit
    • Stephen Roller's avatar
      [api] Remove beam search (#187) · 42884515
      Stephen Roller authored
      * Greedy is implemented.
      
      * Add early stopping, sort beams by probability
      
      * Pass on temperature from UI. Early stopping.
      
      * Kill some more dead code.
      
      * Typo
      
      * Fix lint.
      42884515
  3. 27 Jun, 2022 3 commits
  4. 24 Jun, 2022 2 commits
  5. 23 Jun, 2022 4 commits
  6. 22 Jun, 2022 4 commits
  7. 21 Jun, 2022 4 commits
  8. 20 Jun, 2022 4 commits
  9. 18 Jun, 2022 1 commit
  10. 17 Jun, 2022 2 commits
  11. 13 Jun, 2022 1 commit
  12. 06 Jun, 2022 1 commit
  13. 04 Jun, 2022 1 commit
  14. 02 Jun, 2022 2 commits
  15. 01 Jun, 2022 1 commit
  16. 31 May, 2022 2 commits
    • ngoyal2707's avatar
      BF16 support (#30) · 1639c607
      ngoyal2707 authored
      1639c607
    • Susan Zhang's avatar
      Progress bar cleanup (#96) · b972b949
      Susan Zhang authored
      * remove unused build_progress_bar method
      
      * remove noop/none logging as a log format option
      
      * make log_format default to json and remove default_log_format arg
      
      * move progress_bar to submodule
      
      * rename to base_progress_bar
      
      * module rename
      
      * move out json progress bar
      
      * move out tensorboard progress bar
      
      * move out wandb progress wrapper
      
      * split out helpers to utils, avoid circular import
      
      * fix broken init
      
      * remove module rename for progress bar
      
      * move get_precise_epoch to higher level utils
      
      * lint
      
      * remove no progress bar flag
      
      * fix more imports
      
      * cleanup
      
      * remove utils
      
      * add license
      b972b949
  17. 30 May, 2022 1 commit
  18. 24 May, 2022 2 commits
  19. 23 May, 2022 1 commit
  20. 22 May, 2022 1 commit
    • Susan Zhang's avatar
      Dynamic loss scaler changes for 66b (#115) · 0ba5657a
      Susan Zhang authored
      * set defaults for scale window, and init scale
      
      * remove more default logic for scale window
      
      * also decrease scale window with loss scale
      
      * remove floating point error for hitting min loss scale
      
      * also log out scale_window
      
      * 0.03125 -> 2 ** -5
      0ba5657a
  21. 20 May, 2022 1 commit