Commit Graph

79 Commits

Author SHA1 Message Date
Triex
90627c6d54 docs: Tidy spacing/line breaks 2025-06-05 04:11:55 +10:00
Triex
a7f0a5391b docs: Tidy 2025-06-05 04:10:56 +10:00
Triex
9c25e23773 docs: Improved README, add additional references 2025-06-05 04:08:59 +10:00
Triex
9aedaae1d5 docs: Tidy list items @ README 2025-06-04 11:41:41 +10:00
Triex
69c1bab49e docs: Further tidy initial proposal idea 2025-06-04 11:38:26 +10:00
Triex
e480e15e5f docs: Tidy README 2025-06-04 11:36:38 +10:00
Triex
5c57ab1f8c docs: Move why section up to top of root README proposal/architecture notes - more cohesive flow 2025-05-23 04:23:39 +10:00
Alex Zarov
21654d7511
docs: Tidy title 2025-05-23 04:09:48 +10:00
Triex
ef39c76b2d docs: Tidy title 2025-05-23 04:08:57 +10:00
Triex
59c80bf948 docs: Enhanced draft code, table of contents + copy 2025-05-23 03:54:47 +10:00
Triex
715d0d2e6d docs: Tidy introduction 2025-05-23 03:33:03 +10:00
Triex
3af7848785 docs: Initial architecture notes for Zig implementation 2025-05-23 03:29:53 +10:00
Triex
a1895012dd feat: Initial MacBook optimisation draft for DeepSeek V3 inference > moving to Zig instead 2025-05-23 01:53:02 +10:00
Xingkai Yu
4cc6253d5c
Merge pull request #666 from codinglover222/deepseek-doc-fix
fix an args description.
2025-04-09 09:50:40 +08:00
Huang Panpan
57d7bd45df
Merge pull request #736 from shihaobai/main
Docs: add LightLLM as supported engine
2025-04-08 22:18:33 +08:00
Xingkai Yu
88d6547df2
Merge pull request #816 from KPCOFGS/main
Update README.md
2025-04-08 17:27:09 +08:00
Xingkai Yu
741b06ebca
Merge pull request #720 from xiaokongkong/main
modify the explanation of MLA
2025-04-08 17:20:37 +08:00
Shixian Sheng
a5d2ad229e
Update README.md 2025-03-26 08:58:35 -04:00
DeepSeekDDM
a878eada08
Delete DeepSeek_V3.pdf 2025-03-16 23:42:21 +08:00
DeepSeekDDM
98e67a71f4
Update paper link 2025-03-16 23:41:52 +08:00
shihaobai
408e6e188a
Update README.md
polish
2025-03-03 20:16:37 +08:00
shihaobai
73f2954fa8 polish 2025-03-03 20:10:18 +08:00
shihaobai
1ab09c8780 Docs: add LightLLM as supported engine 2025-03-03 19:23:08 +08:00
huxuedan
d29a967601 modify the explanation of MLA 2025-02-26 17:07:39 +08:00
DeepSeekDDM
592fd5daf8
Delete CITATION.cff 2025-02-24 11:50:20 +08:00
DeepSeekDDM
c9353aba6c
Update bib info 2025-02-24 11:25:44 +08:00
Huang Panpan
f09f5fa321
Merge pull request #616 from Konano/chore-readme
chore: update README.md to improve layout
2025-02-18 18:04:06 +08:00
oyzh
4a65fd9221 fix an args description. 2025-02-15 11:02:28 +08:00
Xingkai Yu
1398800ebf
fix scores mask 2025-02-14 20:26:45 +08:00
Konano
f07bccc49e
fix: resolve center alignment issue in preview 2025-02-14 12:12:16 +08:00
Konano
0866cab5f9
chore: update README.md to improve layout and image attributes 2025-02-14 12:02:10 +08:00
Konano
e15f67af1c
chore: update README.md to improve layout and image attributes 2025-02-08 18:28:40 +08:00
Huang Panpan
2f7b80eece
Merge pull request #611 from Konano/chore-stale
chore: add stale issue management configuration
2025-02-08 16:10:06 +08:00
Konano
76d8d39560
chore: add stale issue management configuration 2025-02-08 15:12:09 +08:00
Xingkai Yu
5ee97a83f0
fix comment 2025-02-07 16:42:55 +08:00
Xingkai Yu
1d7d440461
Merge pull request #432 from luislh-dev/main
remove redundant asterisks in README
2025-02-05 16:53:53 +08:00
Xingkai Yu
09d108620a
Merge pull request #440 from spenserblack/main
Add syntax highlighting to requirements code block
2025-02-05 16:50:03 +08:00
Xingkai Yu
d0f8c4fca3
Merge pull request #528 from WSL0809/main
Fix table bold formatting in TriviaQA EM comparison
2025-02-05 16:33:18 +08:00
Xingkai Yu
87a01053e4
Merge pull request #556 from XxAlonexX/main
Fix Linear Layer Bias Initialization
2025-02-05 16:23:02 +08:00
Huang Panpan
a157077c61
Merge pull request #408 from fitzjalen/refactor
Clarify assertion errors
2025-02-05 12:03:02 +08:00
Huang Panpan
c32c957fb0
Merge pull request #364 from Dhie-boop/feature/table-of-content
Add table of contents to README for better navigation
2025-02-05 11:39:08 +08:00
XxAlonexX
6a30b43249 Fix Linear Layer Bias Initialization 2025-02-04 10:38:45 +05:30
luislopez-developer
97b35f1fca docs: remove redundant asterisks in note 2025-02-03 15:02:04 -05:00
wangsl
d5c08b384b
Update README.md
fix(table): correct bold formatting for TriviaQA EM comparison

- Remove redundant bolding on LLaMA3.1 405B (82.7)
- Retain single bold style for DeepSeek-V3's highest score (82.9)
- Aligns with evaluation convention of highlighting only the best performance
2025-02-02 02:34:59 +08:00
Spenser Black
760d22821f
Add syntax highlighting to requirements code block 2025-01-28 18:07:15 -05:00
Dhieu
6784e1976d Fix TOC links to correctly link to headings in Markdown 2025-01-28 17:14:35 +03:00
Roman Fitzjalen
2756e130c2 clarify assertion error 2025-01-28 13:16:54 +01:00
Dhieu
ddc501b80e Add table of contents to README 2025-01-27 14:18:17 +03:00
Huang Panpan
b5d872ead0
Merge pull request #341 from enochkan/main
docs: Add system requirements for DeepSeek-Infer demo
2025-01-26 09:29:50 +08:00
enoch kan
53d8dc9966 docs: Update system requirements with GitHub Markdown callout 2025-01-25 22:29:54 +00:00