checkpoint manager

This commit is contained in:
Dobromir Popov
2025-06-24 21:41:50 +03:00
parent c9d1e029c5
commit 706eb13912
7 changed files with 978 additions and 1 deletions

6
_dev/notes.md Normal file
View File

@ -0,0 +1,6 @@
how we manage our training W&B checkpoints? we need to clean up old checlpoints. for every model we keep 5 checkpoints maximum and rotate them. by default we always load te best, and during training when we save new we discard the 6th ordered by performance
add integration of the checkpoint manager to all training pipelines
we stopped showing executed trades on the chart. let's add them back