DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Go to file
2024-01-02 11:32:22 +08:00
README.md Initial commit 2024-01-02 11:32:22 +08:00

DeepSeek-MoE