DeepSeek-V3/inference
Pedro Dessanti 9bf00671cf
Update model.py
Enabling mixed precision training to reduce memory usage and potentially speed up training.
2025-01-27 09:04:34 -03:00
..
configs Release DeepSeek-V3 2024-12-26 19:01:57 +08:00
convert.py Enhance documentation and update .gitignore for model conversion scripts 2025-01-05 18:18:18 +00:00
fp8_cast_bf16.py Enhance documentation and update .gitignore for model conversion scripts 2025-01-05 18:18:18 +00:00
generate.py Enhance documentation and update .gitignore for model conversion scripts 2025-01-05 18:18:18 +00:00
kernel.py Enhance documentation and update .gitignore for model conversion scripts 2025-01-05 18:18:18 +00:00
model.py Update model.py 2025-01-27 09:04:34 -03:00
requirements.txt Release DeepSeek-V3 2024-12-26 19:01:57 +08:00