DeepSeek-V3

Triex 0f980354f8 feat: Enhanced device detection handling, added metal initial draft, theoretically-reliable metal mac detection -> `experimental` implementation ✅ Implemented initial Apple Silicon detection using sysctl system calls ✅ Added proper M1/M2/M3/M4 generation detection via CPU brand string ✅ Fixed memory leaks that occured during dev with proper allocator cleanup ✅ Enhanced Metal backend foundation with device capabilities ✅ Added `test_m_series.zig` for hardware verification 🔧 Key Technical Improvements: - Real hardware detection via `hw.model` (eg; `MacBookPro17,1`) - CPU brand string parsing for accurate M-series identification - Unified memory strategy detection (even under Rosetta) - Apple Neural Engine capability detection - Memory-safe device info structures 🧪 Verified on Apple Silicon: - M1 correctly detected (generation 1, no variant) - 16GB unified memory properly identified - Builds cleanly with Zig `0.15.0-dev.703+597dd328e` - No false positives for M1 Pro/Max/Ultra variants 📋 Updated README status to reflect experimental draft implementation ⚠️ Clearly marked as research/development foundation, not production ready	2025-06-11 17:43:04 +10:00
.github	chore: add stale issue management configuration	2025-02-08 15:12:09 +08:00
experimental	feat: Enhanced device detection handling, added metal initial draft, theoretically-reliable metal mac detection -> `experimental` implementation	2025-06-11 17:43:04 +10:00
figures	Release DeepSeek-V3	2024-12-26 19:01:57 +08:00
inference	feat: Initial MacBook optimisation draft for DeepSeek V3 inference > moving to Zig instead	2025-05-23 01:53:02 +10:00
.gitignore	feat: Enhanced device detection handling, added metal initial draft, theoretically-reliable metal mac detection -> `experimental` implementation	2025-06-11 17:43:04 +10:00
dzv3-logo.svg	docs: Initial architecture notes for Zig implementation	2025-05-23 03:29:53 +10:00
LICENSE-CODE	Release DeepSeek-V3	2024-12-26 19:01:57 +08:00
LICENSE-MODEL	Release DeepSeek-V3	2024-12-26 19:01:57 +08:00
MACBOOK_SETUP.md	feat: Initial MacBook optimisation draft for DeepSeek V3 inference > moving to Zig instead	2025-05-23 01:53:02 +10:00
README_WEIGHTS.md	Release DeepSeek-V3	2024-12-26 19:01:57 +08:00
README-bak.md	docs: Tidy README	2025-06-04 11:36:38 +10:00
README-DEEPSEEK_LEGACY.md	docs: Initial architecture notes for Zig implementation	2025-05-23 03:29:53 +10:00
README.md	feat: Enhanced device detection handling, added metal initial draft, theoretically-reliable metal mac detection -> `experimental` implementation	2025-06-11 17:43:04 +10:00

Triex 0f980354f8 feat: Enhanced device detection handling, added metal initial draft, theoretically-reliable metal mac detection -> experimental implementation

✅ Implemented initial Apple Silicon detection using sysctl system calls
✅ Added proper M1/M2/M3/M4 generation detection via CPU brand string
✅ Fixed memory leaks that occured during dev with proper allocator cleanup
✅ Enhanced Metal backend foundation with device capabilities
✅ Added `test_m_series.zig` for hardware verification

🔧 Key Technical Improvements:
- Real hardware detection via `hw.model` (eg; `MacBookPro17,1`)
- CPU brand string parsing for accurate M-series identification
- Unified memory strategy detection (even under Rosetta)
- Apple Neural Engine capability detection
- Memory-safe device info structures

🧪 Verified on Apple Silicon:
- M1 correctly detected (generation 1, no variant)
- 16GB unified memory properly identified
- Builds cleanly with Zig `0.15.0-dev.703+597dd328e`
- No false positives for M1 Pro/Max/Ultra variants

📋 Updated README status to reflect experimental draft implementation
⚠️  Clearly marked as research/development foundation, not production ready

Aspect	Current (PyTorch)	Target (Zig)	Current Draft
Cold start	10-30s	< 2s	Not measured
Memory usage	20-40GB	< 16GB	16GB+ for basic ops
Dependencies	~2GB runtime	Single binary	✅ Single binary
Deployment	Complex	Copy & run	✅ Copy & run
Matrix Mul (1024×1024)	~1ms (optimized)	< 1ms	6418ms (naive)

README.md

DeepZig V3: A High-Performance LLM Architecture

Overview

Why This Matters

Expected Benefits vs Current Reality

Why Zig?

Proposed Architecture

Draft Web API Framework

Planned Endpoints (Basic Structure Implemented)

Deployment Vision

Implementation Plan Status

Phase 1: Foundation ✅ DRAFT COMPLETE

Phase 2: Core Model (IN PROGRESS)

Phase 3: Backends (PLANNED)

Phase 4: Web Integration (DRAFT STRUCTURE)

Technical Challenges

Platform-Specific Opportunities

Apple Silicon (M-Series) ✅ Draft Detection Implemented

x86_64 Architecture

NVIDIA GPUs

Getting Started

For the Current Zig Implementation:

Development Approach

Seeking Contributors

Current Limitations & Next Steps

References

README.md Unescape Escape

DeepZig V3: A High-Performance LLM Architecture

Overview

Why This Matters

Expected Benefits vs Current Reality

Why Zig?

Proposed Architecture

Draft Web API Framework

Planned Endpoints (Basic Structure Implemented)

Deployment Vision

Implementation Plan Status

Phase 1: Foundation ✅ DRAFT COMPLETE

Phase 2: Core Model (IN PROGRESS)

Phase 3: Backends (PLANNED)

Phase 4: Web Integration (DRAFT STRUCTURE)

Technical Challenges

Platform-Specific Opportunities

Apple Silicon (M-Series) ✅ Draft Detection Implemented

x86_64 Architecture

NVIDIA GPUs

Getting Started

For the Current Zig Implementation:

Development Approach

Seeking Contributors

Current Limitations & Next Steps

References

README.md