Update finetune_deepseekcoder.py

Using torch.float16 or torch.cuda.amp can significantly reduce memory usage and speed up training by performing computations with lower precision.
2025-07-24 18:09:10 -04:00 · 2025-01-29 12:38:12 +05:30 · 2025-01-29 12:38:12 +05:30 · 78d0fd332a
commit 78d0fd332a
parent b7ba565956
1 changed files with 1 additions and 1 deletions
--- a/finetune/finetune_deepseekcoder.py
+++ b/finetune/finetune_deepseekcoder.py
@ -143,7 +143,7 @@ def train():

    model = transformers.AutoModelForCausalLM.from_pretrained(
        model_args.model_name_or_path,
-        torch_dtype=torch.bfloat16
+        torch_dtype=torch.float16
    )

    if training_args.local_rank == 0: