From b5caaa40dd6a8d3387b96aef801a47137564c24a Mon Sep 17 00:00:00 2001 From: ZHU QIHAO <18811325956@163.com> Date: Fri, 3 Nov 2023 09:27:13 +0800 Subject: [PATCH] Update README.md --- Evaluation/HumanEval/README.md | 1 - 1 file changed, 1 deletion(-) diff --git a/Evaluation/HumanEval/README.md b/Evaluation/HumanEval/README.md index 9c9a299..b672e23 100644 --- a/Evaluation/HumanEval/README.md +++ b/Evaluation/HumanEval/README.md @@ -58,7 +58,6 @@ We report experimental results here for 8 main-stream programming languages, **p | GPT-4 | - | **84.1%** | **76.4%** | **81.6%**| **77.2%**| **77.4%**| **79.1%**| **58.2%**| **78.0%**| **76.5%**| | | | | | | | | | | | | | DeepSeek-Coder-Instruct | 1.3B | 65.2% | 45.3% | 51.9% | 45.3% | 59.7% |55.1% | 12.7% | 52.2% | 48.4% | -| DeepSeek-Coder-Instruct | 5.7B | 76.2% | 61.5% | 68.4% | 65.2%| 62.9%| 67.1%| 34.2%| 69.6%| 63.1%| | DeepSeek-Coder-Instruct | 6.7B | 78.9% | 63.4% | 68.4% | 68.9%| 67.2%| 72.8%| 36.7%| 72.7%| 66.1%| | DeepSeek-Coder-Instruct | 33B | **79.3%** | **68.9%** | **73.4%** | **72.7%**| **67.9%**| **74.1%**| **43.0%**| **73.9%**| **69.2%**|