docs: update README for LMDeploy support

2025-07-02 22:41:52 -04:00 · 2024-06-26 16:22:55 +08:00 · 2024-06-26 16:22:55 +08:00 · 0d928f89a4
commit 0d928f89a4
parent c15bff20d9
1 changed files with 8 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -321,6 +321,14 @@ generated_text = [output.outputs[0].text for output in outputs]
 print(generated_text)
 ```

+### Inference with LMDeploy (recommended)
+To utilize [LMDeploy](https://github.com/InternLM/lmdeploy) for model inference, please install LMDeploy >= v0.5.0. The current LMDeploy PyTorch Engine already supports the [implementation](https://github.com/InternLM/lmdeploy/pull/1621) of the MLA version for DeepSeek V2. If you want to use the pipeline API, please refer to [pipeline](https://lmdeploy.readthedocs.io/en/latest/api/pipeline.html?highlight=pipeline#pipeline).
+
+```bash
+# use LMDeploy PyTorch Engine as OpenAI compatible Server
+python3 -m lmdeploy serve api_server DeepSeek-V2 --backend pytorch
+```
+
 ### LangChain Support
 Since our API is compatible with OpenAI, you can easily use it in [langchain](https://www.langchain.com/).
 Here is an example: