docs: local serving with ollama documented (#8807)

Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>
2025-12-26 05:48:36 +08:00 · 2025-06-17 14:18:18 +03:00 · 2025-06-17 14:18:18 +03:00 · d81d2f62cb
commit d81d2f62cb
parent ddaa186971
1 changed files with 12 additions and 0 deletions
--- a/docs/usage/llms/local-llms.mdx
+++ b/docs/usage/llms/local-llms.mdx
@ -126,6 +126,18 @@ vllm serve all-hands/openhands-lm-32b-v0.1 \
    --enable-prefix-caching
 ```

+### Create an OpenAI-Compatible Endpoint with Ollama
+
+- Install Ollama following [the official documentation](https://ollama.com/download).
+- For Ollama configuration, use `ollama/<modelname>` as custom model in web. Api key also can be set to `ollama`.
+- Example launch command for Devstral LM 24B:
+
+```bash
+OLLAMA_CONTEXT_LENGTH=32768 OLLAMA_HOST=0.0.0.0:11434 OLLAMA_KEEP_ALIVE=-1 nohup ollama serve&
+#The minimum context size is ~8196, even the system prompt won't fit smaller
+ollama pull devstral:latest
+```
+
 ## Advanced: Run and Configure OpenHands

 ### Run OpenHands