mirror of
https://github.com/deepseek-ai/DeepSeek-LLM.git
synced 2025-02-23 06:09:06 -05:00
Update README.md
add notes on ChatLLM.cpp.
This commit is contained in:
parent
81cc42c689
commit
21ffed3f33
@ -346,6 +346,9 @@ python convert-hf-to-gguf.py <MODEL_PATH> --outfile <GGUF_PATH> --model-name dee
|
|||||||
./quantize <GGUF_PATH> <OUTPUT_PATH> q4_0
|
./quantize <GGUF_PATH> <OUTPUT_PATH> q4_0
|
||||||
./main -m <OUTPUT_PATH> -n 128 -p <PROMPT>
|
./main -m <OUTPUT_PATH> -n 128 -p <PROMPT>
|
||||||
```
|
```
|
||||||
|
|
||||||
|
You can also try out other [ggml](https://github.com/ggerganov/ggml)-based inferencers, such as [ChatLLM.cpp](https://github.com/foldl/chatllm.cpp), as well.
|
||||||
|
|
||||||
#### GPTQ(exllamav2)
|
#### GPTQ(exllamav2)
|
||||||
|
|
||||||
`UPDATE:`[exllamav2](https://github.com/turboderp/exllamav2) has been able to support HuggingFace Tokenizer. Please pull the latest version and try out.
|
`UPDATE:`[exllamav2](https://github.com/turboderp/exllamav2) has been able to support HuggingFace Tokenizer. Please pull the latest version and try out.
|
||||||
|
Loading…
Reference in New Issue
Block a user