deepseekmirror
Updated 2025-02-21 00:11:33 -05:00
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Updated 2025-02-20 01:39:09 -05:00
Updated 2025-02-18 10:02:26 -05:00
i have a feeling this might need to be mirrored
Updated 2025-02-18 05:04:06 -05:00
Janus-Series: Unified Multimodal Understanding and Generation Models
Updated 2025-02-01 02:58:14 -05:00
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Updated 2024-09-25 06:23:55 -04:00
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Updated 2024-09-24 08:09:45 -04:00
Expert Specialized Fine-Tuning
Updated 2024-09-22 11:46:39 -04:00
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Updated 2024-08-20 22:45:40 -04:00
Updated 2024-08-15 23:33:21 -04:00
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Updated 2024-04-24 01:01:06 -04:00
DeepSeek Coder: Let the Code Write Itself
Updated 2024-04-15 22:16:07 -04:00
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Updated 2024-04-15 03:55:36 -04:00
A curated list of open-source projects related to DeepSeek Coder
Updated 2024-04-02 23:20:36 -04:00
DeepSeek LLM: Let there be answers
Updated 2024-02-04 07:22:15 -05:00