Commit Graph

  • 53d8dc9966 docs: Update system requirements with GitHub Markdown callout #341 enoch kan 2025-01-25 22:29:54 +0000
  • 722e6885ef docs: Improve system requirements section formatting enoch kan 2025-01-25 22:26:48 +0000
  • 53b055bc1e docs: Add system requirements for DeepSeek-Infer demo enoch kan 2025-01-25 22:21:51 +0000
  • a1ca60c540
    Update requirements.txt #333 @Infer 2025-01-24 22:50:58 +0630
  • 8a45dade7b
    simplify loop in convert.py #331 qpwo 2025-01-24 01:18:42 -0800
  • 8c40067fb2
    make convert.py use multiple processes qpwo 2025-01-24 01:12:19 -0800
  • 7117f260e9 mask标记声明 #271 wanglei 2025-01-14 15:29:06 +0800
  • 9cb0b3ca37
    Merge 8941732439 into ee4c4ea32b #172 Jafar Saad 2025-01-13 02:06:14 +0700
  • ee4c4ea32b
    Merge pull request #234 from wangfuchun-fc/patch-1 Xingkai Yu 2025-01-07 17:53:28 +0800
  • 25109d2ccd
    Merge pull request #230 from jacksonpradolima/main Huang Panpan 2025-01-07 14:05:15 +0800
  • fdbd5be754
    Merge pull request #193 from enochkan/main Huang Panpan 2025-01-07 14:02:11 +0800
  • 3779a89770
    fix: fix readme doc typo. #234 wangfuchun-fc 2025-01-06 22:00:32 +0800
  • cd28bb1bf4
    Merge pull request #1 from twlhitesh/improve-accessibility #229 Hitesh Yadav 2025-01-06 15:54:11 +0530
  • 7d95c3e5ec Improve accessibility for CAPTCHA challenges in DeepSeek v3 #232 Hitesh Yadav 2025-01-06 15:53:37 +0530
  • c070549279 Add CITATION.cff to provide citation metadata #230 Jackson Antonio do Prado Lima 2025-01-05 21:46:37 -0300
  • bc77f22afc Updated model.py docstrings #193 enoch kan 2025-01-05 18:24:31 +0000
  • a1296f099e Enhance documentation and update .gitignore for model conversion scripts enoch kan 2025-01-05 18:18:18 +0000
  • bc9459df40 refactor(inference): modularize model architecture for improved maintainability Hitesh Yadav 2025-01-05 16:28:10 +0530
  • fd011c11aa torch rmsnorm GeeeekExplorer 2025-01-05 14:33:48 +0800
  • dbd2938f1b
    Create azure-webapps-python.yml #225 shhmfa 2025-01-04 10:07:01 -0500
  • 9b288b86cc
    Update README.md Xingkai Yu 2025-01-03 15:30:48 +0800
  • 0d16ea24c8
    Merge pull request #206 from kutt/patch-1 Huang Panpan 2025-01-03 09:48:03 +0800
  • 21bc231f32
    use alert formatting for notes in readme #206 kutt 2025-01-02 15:02:52 +0100
  • f5fb13ee14
    Created fuzz_target.py #168 Shivam7-1 2024-12-31 16:10:31 +0530
  • 8710ec2ecb
    require model-parallel in convert.py Xingkai Yu 2024-12-31 18:05:55 +0800
  • 8941732439
    Create python-app.yml #172 #163 Jafar Saad 2024-12-31 14:35:29 +0700
  • 7c2466b310
    Update issue templates Huang Panpan 2024-12-31 14:49:05 +0800
  • dd6882bc3d
    Delete LICENSE-MODEL #113 gogo67xxxggg 2024-12-30 15:20:45 -0600
  • ba602e3800
    Delete .github/workflows directory #112 gogo67xxxggg 2024-12-30 15:07:59 -0600
  • 641ed202ec
    Create python-package-conda.yml gogo67xxxggg 2024-12-30 15:07:35 -0600
  • 1b8e18cc29
    Merge pull request #21 from eltociear/patch-1 Huang Panpan 2024-12-30 15:03:30 +0800
  • 94410f8d58
    Merge pull request #33 from zhyncs/main Haswell Iris 2024-12-30 14:37:38 +0800
  • 68d0061937 upd #33 zhyncs 2024-12-30 14:25:28 +0800
  • 2fc98d1cdf upd zhyncs 2024-12-30 14:21:00 +0800
  • a1edf4138e upd zhyncs 2024-12-30 14:18:00 +0800
  • 8638950ec2 docs: update SGLang usage zhyncs 2024-12-30 14:13:27 +0800
  • 83dd18eda4
    Update README.md DeepSeekDDM 2024-12-30 11:04:14 +0800
  • 710c8b8b6e
    docs: update README.md #21 Ikko Eltociear Ashimine 2024-12-29 00:43:11 +0900
  • 8f1c9488b5
    handle missing scale_inv_name (#2) Yang Wang 2024-12-27 09:34:38 +0800
  • c8087bd8b8
    Merge pull request #9 from simon-mo/vllm Huang Panpan 2024-12-27 09:16:09 +0800
  • e2c15caf04 add version #9 simon-mo 2024-12-26 17:11:31 -0800
  • cf47874d8e Docs: add vLLM as supported engine simon-mo 2024-12-26 17:10:33 -0800
  • 6e7c5ee471 remove interface #5 AK391 2024-12-26 18:09:56 +0100
  • 2ab082f2f5 add gradio app AK391 2024-12-26 17:25:34 +0100
  • 65d8f5f1e9
    Add CUDA cache clearing in memory management #2 Yang Wang 2024-12-26 23:18:39 +0800
  • e6e66fd23f
    sort filename to reduce memory costs Yang Wang 2024-12-26 23:14:44 +0800
  • 1e3a83629e
    handle missing scale_inv_name Yang Wang 2024-12-26 23:09:17 +0800
  • 4c2fdb8f55 Release DeepSeek-V3 stack-heap-overflow 2024-12-26 19:01:57 +0800
  • 4b58dc6bfc
    Initial commit stack-heap-overflow 2024-12-26 17:52:41 +0800