Commit Graph

12 Commits

Author SHA1 Message Date
Nripesh Niketan
487f2395aa
Merge a5336884cf into 4cc6253d5c 2025-04-09 19:02:29 +08:00
huxuedan
d29a967601 modify the explanation of MLA 2025-02-26 17:07:39 +08:00
Xingkai Yu
1398800ebf
fix scores mask 2025-02-14 20:26:45 +08:00
Xingkai Yu
5ee97a83f0
fix comment 2025-02-07 16:42:55 +08:00
Xingkai Yu
87a01053e4
Merge pull request #556 from XxAlonexX/main
Fix Linear Layer Bias Initialization
2025-02-05 16:23:02 +08:00
XxAlonexX
6a30b43249 Fix Linear Layer Bias Initialization 2025-02-04 10:38:45 +05:30
Nripesh Niketan
b6e3910fd0
Fix small error 2025-01-30 16:04:00 +00:00
Nripesh Niketan
e75ce46245 feat: Enhance device compatibility and update PyTorch version 2025-01-30 00:06:55 +00:00
Roman Fitzjalen
2756e130c2 clarify assertion error 2025-01-28 13:16:54 +01:00
enoch kan
bc77f22afc Updated model.py docstrings 2025-01-05 18:24:31 +00:00
GeeeekExplorer
fd011c11aa torch rmsnorm 2025-01-05 14:33:48 +08:00
stack-heap-overflow
4c2fdb8f55 Release DeepSeek-V3 2024-12-26 19:01:57 +08:00