Update README.md

add notes on ChatLLM.cpp.
2025-06-20 16:33:50 -04:00 · 2023-12-09 18:10:38 +08:00 · 2023-12-09 18:10:38 +08:00 · 21ffed3f33
commit 21ffed3f33
parent 81cc42c689
1 changed files with 3 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -346,6 +346,9 @@ python convert-hf-to-gguf.py <MODEL_PATH> --outfile <GGUF_PATH> --model-name dee
 ./quantize <GGUF_PATH> <OUTPUT_PATH> q4_0
 ./main -m <OUTPUT_PATH> -n 128 -p <PROMPT>
 ```
 You can also try out other [ggml](https://github.com/ggerganov/ggml)-based inferencers, such as [ChatLLM.cpp](https://github.com/foldl/chatllm.cpp), as well.
 #### GPTQ(exllamav2)
 `UPDATE:`[exllamav2](https://github.com/turboderp/exllamav2) has been able to support HuggingFace Tokenizer. Please pull the latest version and try out.