Automatically evaluate checkpoints after copying to NFS (#550)
* add async eval with dummy
* full testing setup, add configs
* fix naming
* only eval at frequency
* remove logging used for testing
* change to real frequencies
* remove logging
* added improvements
* remove eval last checkpoint
* flake8 lint
* change naming, add comment, always evaluate at end of training
* black lint
* rename to training_finished
Co-authored-by:
Peter Albert <peteralbert@fb.com>
Showing
+99 -43
Please register or sign in to comment