mirror of
https://github.com/deepseek-ai/DeepSeek-V3.git
synced 2025-04-19 18:18:57 -04:00
Type Hints & Path Management: Added comprehensive type annotations Used pathlib.Path for safer path handling Enhanced Error Handling: Structured exception handling throughout Clear error messages with context Safe resource cleanup Memory Management: LRU cache implementation with OrderedDict Configurable cache size Explicit GPU memory cleanup Logging System: Configurable logging levels Detailed progress tracking Structured error reporting Code Organization: Split into focused, testable functions Clear separation of concerns Documented public methods Validation & Safety: Input path validation Weight type checking Clone tensors to prevent reference issues Performance: Optimized file loading with LRU cache Batched tensor processing Asynchronous CUDA operations Metadata & Traceability: Added conversion metadata to output files Preserved original index structure Enhanced output index information Configuration: Centralized constants Device-aware execution (CUDA/CPU) Progress Tracking: Nested progress bars Detailed file processing status |
||
---|---|---|
.. | ||
configs | ||
convert.py | ||
fp8_cast_bf16.py | ||
generate.py | ||
kernel.py | ||
model.py | ||
requirements.txt |