Commit Graph

  • 84e7789ef8
    Merge branch 'deepseek-ai:main' into fix-bf16-weight-metadata #749 Yanyue Xie 2025-03-27 15:13:01 -0400
  • dcc938bc09 Performance upgrade #818 HyptexPvP 2025-03-26 14:37:43 -0400
  • a5d2ad229e
    Update README.md #816 Shixian Sheng 2025-03-26 08:58:35 -0400
  • 263f44a907 resumen.md german 2025-03-25 17:29:28 -0300
  • 9271ce77b2 fix readme es german 2025-03-25 17:23:39 -0300
  • 4dc6248aaa readme es german 2025-03-25 15:43:29 -0300
  • 951881dfe7 readmeweihtgs_es.md german 2025-03-25 15:40:03 -0300
  • 4efaf49eef Bump transformers in /inference in the pip group across 1 directory dependabot[bot] 2025-02-11 16:36:07 +0000
  • 8b1908b7da
    Merge pull request #6 from nodoubtz/nodoubtz-patch-5 Nodoubtz 2025-03-25 12:45:06 -0400
  • 22247feb37
    Merge pull request #9 from nodoubtz/nodoubtz-patch-7 Nodoubtz 2025-03-25 12:44:43 -0400
  • d8b69589f5
    Merge pull request #13 from nodoubtz/nodoubtz-patch-11 Nodoubtz 2025-03-25 12:44:12 -0400
  • ebe6bf0f56
    Merge pull request #10 from nodoubtz/nodoubtz-patch-8 Nodoubtz 2025-03-25 12:43:41 -0400
  • 0623163343 add bf16 to fp8 yzlnew 2025-03-25 19:38:45 +0800
  • b691bc09b4 Mejora: actualicé el README #804 #803 aquinovo 2025-03-24 15:58:17 -0600
  • 5cc66fcdeb
    Merge pull request #21 from nodoubtz/nodoubtz-patch-19 Nodoubtz 2025-03-23 17:32:34 -0400
  • 491f14a6be
    Create LICENSE Nodoubtz 2025-03-23 17:31:20 -0400
  • b3e0f2b005 fork #798 Rupali mangla 2025-03-23 14:53:40 +0530
  • 82a134bee0
    Merge pull request #20 from nodoubtz/nodoubtz-patch-18 #799 Nodoubtz 2025-03-22 14:27:17 -0400
  • 06b92077ac
    Create greetings.yml Nodoubtz 2025-03-22 14:26:34 -0400
  • d040eae355
    Merge pull request #19 from nodoubtz/nodoubtz-patch-17 Nodoubtz 2025-03-22 14:24:11 -0400
  • 7d23978b98
    Create msbuild.yml Nodoubtz 2025-03-22 14:23:25 -0400
  • 732e4c50c0
    Merge pull request #18 from nodoubtz/nodoubtz-patch-16 Nodoubtz 2025-03-22 14:22:53 -0400
  • 84e905fbd3
    Create go.yml Nodoubtz 2025-03-22 14:22:14 -0400
  • 9a994893da
    Merge pull request #17 from nodoubtz/nodoubtz-patch-15 Nodoubtz 2025-03-22 14:20:03 -0400
  • bfce3bd699
    Create python-app.yml Nodoubtz 2025-03-22 14:19:15 -0400
  • 07777c5c9c
    Merge pull request #16 from nodoubtz/nodoubtz-patch-14 Nodoubtz 2025-03-22 14:17:31 -0400
  • ee6f790d7a
    Create alibabacloud.yml Nodoubtz 2025-03-22 14:16:44 -0400
  • a3fb930ee1
    Merge pull request #15 from nodoubtz/nodoubtz-patch-13 Nodoubtz 2025-03-22 14:15:01 -0400
  • 69fd4cb792
    Create google.yml Nodoubtz 2025-03-22 14:14:10 -0400
  • 032096f568
    Merge pull request #14 from nodoubtz/nodoubtz-patch-12 Nodoubtz 2025-03-22 14:12:54 -0400
  • 7982f14f23
    Create azure-webapps-python.yml Nodoubtz 2025-03-22 14:12:04 -0400
  • bccac3e3dc
    Create azure-functions-app-python.yml Nodoubtz 2025-03-22 14:10:04 -0400
  • 5f238ec41a
    Merge pull request #12 from nodoubtz/nodoubtz-patch-10 Nodoubtz 2025-03-22 14:07:32 -0400
  • b20e008f84
    Create c-cpp.yml Nodoubtz 2025-03-22 14:06:32 -0400
  • 020b89c640
    Merge pull request #11 from nodoubtz/nodoubtz-patch-9 Nodoubtz 2025-03-22 14:04:14 -0400
  • b10ecfe48f
    Create codeql.yml Nodoubtz 2025-03-22 14:03:03 -0400
  • 47993f9318
    Create docker-publish.yml Nodoubtz 2025-03-22 14:01:33 -0400
  • 2f21798107
    Create manual.yml Nodoubtz 2025-03-22 13:55:20 -0400
  • 3e1d4cfa25
    Merge pull request #7 from nodoubtz/nodoubtz-patch-6 Nodoubtz 2025-03-22 13:51:33 -0400
  • f0ccd7666e
    Create pylint.yml Nodoubtz 2025-03-22 13:50:30 -0400
  • aa72c158bf my intro #796 ritikajindal1127 2025-03-22 22:52:05 +0530
  • e6cbe50ea7
    Create static.yml Nodoubtz 2025-03-21 22:48:32 -0400
  • 45ba56b1a9
    Merge branch 'deepseek-ai:main' into patch-2 #793 Nodoubtz 2025-03-21 22:08:27 -0400
  • cf5db96939
    Create ibm.yml #791 Nodoubtz 2025-03-21 21:59:29 -0400
  • d659f51464
    Merge branch 'deepseek-ai:main' into main Nodoubtz 2025-03-21 21:53:41 -0400
  • 0fbd7b02d2
    Merge 8941732439 into a878eada08 #172 Jafar Saad 2025-03-22 04:54:19 +0700
  • 0fbb04d6be
    DeepSeek Commit #786 Mohamed Elgendy 2025-03-20 17:44:07 +0200
  • 08b00b7bdc
    DeepSeek Commit2 Mohamed Elgendy 2025-03-20 17:41:21 +0200
  • 33e384c101
    DeepSeek Commit Mohamed Elgendy 2025-03-20 17:37:41 +0200
  • 31c621793d
    abstract #785 musvaage 2025-03-20 10:06:38 -0500
  • a878eada08
    Delete DeepSeek_V3.pdf DeepSeekDDM 2025-03-16 23:42:21 +0800
  • 98e67a71f4
    Update paper link DeepSeekDDM 2025-03-16 23:41:52 +0800
  • 79a8026ae8
    Create devcontainer.json Nodoubtz 2025-03-15 13:39:16 -0400
  • e4f555ca6f intro #729 musvaage 2025-03-14 13:23:44 -0500
  • 100a5aca1e
    Merge branch 'deepseek-ai:main' into patch-2 #769 #768 Nodoubtz 2025-03-14 07:40:03 -0400
  • 2988c05daa
    Update generate.py #764 Antonio Gallardo García 2025-03-13 13:38:21 +0100
  • 3eefc21c1c
    Add files via upload #762 sana329 2025-03-11 14:27:35 +0530
  • 9844d3f642
    Add files via upload sana329 2025-03-11 14:26:22 +0530
  • d3d00f45be
    deepseek file uploded sana329 2025-03-11 14:25:23 +0530
  • 0cd41b2a5b
    Merge branch 'deepseek-ai:main' into main #770 #756 Nodoubtz 2025-03-08 19:11:07 -0500
  • ac83e3fb95
    Update fp8_cast_bf16.py #753 SS7896 2025-03-09 05:25:34 +0700
  • 7fbf99b32b Add zh version of README #747 windsonsea 2025-03-05 16:24:01 +0800
  • 3421621d7b
    NoneType check #751 A-transformer 2025-03-06 19:33:10 +0400
  • be411d69f4 Fix: add metadata to bf16 safetensors for loading using transformers root 2025-03-06 14:25:47 +0800
  • 408e6e188a
    Update README.md #736 shihaobai 2025-03-03 20:16:37 +0800
  • 73f2954fa8 polish shihaobai 2025-03-03 20:10:18 +0800
  • ebd889518d
    Update kernel.py #735 sunndy 2025-03-03 19:38:53 +0800
  • 1ab09c8780 Docs: add LightLLM as supported engine shihaobai 2025-03-03 19:23:08 +0800
  • d29a967601 modify the explanation of MLA #720 huxuedan 2025-02-26 17:06:54 +0800
  • 9539eba28d
    Rename DeepSeek_V3.pdf to DeepSeekv3pdf #718 krackn88 2025-02-25 17:07:37 -0500
  • 6db27b90e0
    Merge branch 'deepseek-ai:main' into main #687 Can Deliktaş 2025-02-25 19:01:24 +0300
  • d257cbe733
    Critical Improvements for Model Correctness, Efficiency, and Robustness #717 Abdur Rahman 2025-02-25 21:58:34 +0600
  • 3432a23b65
    Merge daba5c1f78 into 592fd5daf8 #488 Ivan Lloyd Roquero 2025-02-25 21:11:38 +0800
  • b156e5450b
    Update README.md #437 Dhieu 2025-02-25 12:26:30 +0300
  • 00da010bd1
    Merge 687f06b004 into 592fd5daf8 #564 sudopacman 2025-02-24 11:44:08 +0600
  • 592fd5daf8
    Delete CITATION.cff #732 DeepSeekDDM 2025-02-24 11:50:20 +0800
  • c9353aba6c
    Update bib info DeepSeekDDM 2025-02-24 11:25:44 +0800
  • 57582c60f4
    Absorb w_uk into wo #702 Xu Song 2025-02-22 14:50:32 +0800
  • 973f949e94
    Update feature_request.md #701 Adugna Gizaw 2025-02-22 04:11:12 +0300
  • f23598fb22
    Create CODE_OF_CONDUCT.md Yen Huynh 2025-02-21 16:21:39 -0500
  • 39cc27e8f0
    Merge 79f733dda5 into f09f5fa321 #699 Yen Huynh 2025-02-21 16:18:01 -0500
  • 79f733dda5
    Create CODE_OF_CONDUCT.md #699 Yen Huynh 2025-02-21 16:17:45 -0500
  • a3d882baf8
    Merge 1766d255bc into f09f5fa321 #674 NKCSRairdrop NFT Finance Guardian 2025-02-21 18:33:37 +0800
  • e1d0b2ad64 update by rr #690 roshan1727 2025-02-20 19:45:53 +0530
  • cb8d1f72e6
    Update README_Turkish.md Can Deliktaş 2025-02-19 18:55:04 +0300
  • c909a3b3d5
    Delete languages/turkish directory Can Deliktaş 2025-02-19 18:28:30 +0300
  • 3bca5239dc
    Create README_WEIGHTS_Turkish.md Can Deliktaş 2025-02-19 18:26:46 +0300
  • 6b1cd5993a
    Create README_Turkish.md Can Deliktaş 2025-02-19 18:26:21 +0300
  • 556c115fff
    Delete languages/turkish directory Can Deliktaş 2025-02-19 18:24:07 +0300
  • 3fa7464346
    README_Turkish Can Deliktaş 2025-02-19 18:23:37 +0300
  • 4aca6bd241
    Delete languages /Turkish directory Can Deliktaş 2025-02-19 18:20:45 +0300
  • 4cbd0ab179
    Create t Can Deliktaş 2025-02-19 18:20:24 +0300
  • 06ab453160
    Merge branch 'deepseek-ai:main' into main Can Deliktaş 2025-02-19 18:10:35 +0300
  • 5bb008364b
    Add files via upload Can Deliktaş 2025-02-19 18:10:06 +0300
  • 213bbf5ecf
    Rename README_WEIGHTS_Turkish.mdmd to README_WEIGHTS_Turkish.md Can Deliktaş 2025-02-19 18:08:32 +0300
  • 76fd958ed4
    Rename README_turkish.md to README_Turkish.md Can Deliktaş 2025-02-19 18:08:10 +0300
  • 085ed17781
    Rename README_WEIGHTS.md to README_WEIGHTS_Turkish.mdmd Can Deliktaş 2025-02-19 18:07:49 +0300
  • 14fddd7cb7
    Rename README.md to README_turkish.md Can Deliktaş 2025-02-19 18:07:09 +0300
  • 79d72ecd8d Optimize Multi-head Latent Attention (MLA) with Fast Path for Short Sequences #684 XxAlonexX 2025-02-19 10:35:28 +0530
  • f8b7c3b6e7 Merge branch 'main' of github.com:XxAlonexX/DeepSeek-V3 XxAlonexX 2025-02-19 10:32:29 +0530