{'train_runtime': 515.6494, 'train_samples_per_second': 1.765, 'train_steps_per_second': 0.019, 'train_tokens_per_second': 47.164, 'train_loss': 45420.82977294922, 'epoch': 0, 'num_input_tokens_seen': 44128} 0%| | 0/10 [08:20<?, ?it/s] [INFO|trainer.py:3910] 2025-02-20 ...