DeepSeek-Math/train
2025-06-19 17:46:50 +05:30
..
model.py Add commented options for output attentions and hidden states in DeepSeekMathConfig 2025-06-19 17:46:50 +05:30
train.py Implement DeepSeek-Math model and training pipeline with dataset handling and distributed training support 2025-06-19 17:27:05 +05:30