chore: add chatbox demo

This commit is contained in:
Han Xiao
2025-02-09 20:08:54 +08:00
parent 589a4f507a
commit 18f0312c38

View File

@@ -289,7 +289,7 @@ I kept the evaluation simple, LLM-as-a-judge and collect some [ego questions](./
I mainly look at 3 things: total steps, total tokens, and the correctness of the final answer.
```bash
npm run eval ./src/evals/ego-questions.json
npm run eval ./src/evals/ego-questions
```
Here's the table comparing plain `gemini-2.0-flash` and `gemini-2.0-flash + node-deepresearch` on the ego set.