DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Updated 2025-02-20 01:39:09 -05:00
i have a feeling this might need to be mirrored
Updated 2025-02-18 05:04:06 -05:00
Janus-Series: Unified Multimodal Understanding and Generation Models
Updated 2025-02-01 02:58:14 -05:00
Expert Specialized Fine-Tuning
Updated 2024-09-22 11:46:39 -04:00
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Updated 2024-08-20 22:45:40 -04:00
Updated 2024-08-15 23:33:21 -04:00
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Updated 2024-04-24 01:01:06 -04:00
DeepSeek Coder: Let the Code Write Itself
Updated 2024-04-15 22:16:07 -04:00
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Updated 2024-04-15 03:55:36 -04:00
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Updated 2024-01-16 07:17:59 -05:00