Update README.md

2025-06-20 16:44:02 -04:00 · 2024-05-06 22:50:03 +08:00 · 2024-05-06 22:50:03 +08:00 · e23eeb51a8
commit e23eeb51a8
parent 9c7aa9ce01
1 changed files with 1 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -165,7 +165,7 @@ We evaluate our model on LiveCodeBench (0901-0401), a benchmark designed for liv

 ## 4. Model Architecture
 DeepSeek-V2 adopts innovative architectures to guarantee economical training and efficient inference： 
- For attention, we design IEAttn, which utilizes low-rank key-value union compression to eliminate the bottleneck of inference-time key-value cache, thus supporting efficient inference. 
+- For attention, we design MLA (Multi-head Latent Attention), which utilizes low-rank key-value union compression to eliminate the bottleneck of inference-time key-value cache, thus supporting efficient inference. 
 - For Feed-Forward Networks (FFNs), we adopt DeepSeekMoE architecture, a high-performance MoE architecture that enables training stronger models at lower costs. 

 <p align="center">