From 618ecfb0c95837a5a568a238a38906c1505cbf4d Mon Sep 17 00:00:00 2001 From: Alex Zarov Date: Wed, 11 Jun 2025 19:45:50 +1000 Subject: [PATCH] docs: Update `README.md` --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index b5ac65c..3137cb7 100644 --- a/README.md +++ b/README.md @@ -30,10 +30,10 @@ A **DRAFT proposal & foundation** for implementing DeepSeek V3 in Zig to create - ✅ **Apple Silicon M-series detection** (hardware detection via sysctl) - ✅ Comprehensive build system draft - ✅ **BLAS integration working** (Apple Accelerate backend functional) -- ✅ **Improved matrix operations** (1000+ GFLOPS performance) +- ✅ **Improved matrix operations** (1000+ GFLOPS performance on an M1 Macbook) - ⚠️ **NOT PRODUCTION READY** - Draft implementation for research/development -**Performance Update**: ~~Current naive algorithms are ~1000x slower than optimized BLAS~~ **BLAS integration now functional.** Matrix multiplication: **2.1ms for 1024×1024** at **1000+ GFLOPS**. This represents significant improvement over our initial naive implementation. See [experimental benchmarks](experimental/README.md#benchmarks) for detailed performance data. +**Performance Update**: ~~Current naive algorithms are ~1000x slower than optimized BLAS~~ **BLAS integration now functional.** Matrix multiplication: **2.1ms for 1024×1024** at **1000+ GFLOPS** on an M1 Macbook. This represents significant improvement over our initial naive implementation. See [experimental benchmarks](experimental/README.md#benchmarks) for detailed performance data. ## Why This Matters