update gaia

This commit is contained in:
lazychih114
2025-03-10 07:52:07 +08:00
parent a31f4de05b
commit 49979497d7
2 changed files with 15 additions and 4 deletions

View File

@@ -172,9 +172,14 @@ Example tasks you can try:
# 🧪 Experiments
We provided a script to reproduce the results on GAIA.
You can check the `run_gaia_roleplaying.py` file and run the following command:
To reproduce OWL's GAIA benchmark score of 58.18:
1. Switch to the `gaia58.18` branch:
```bash
git checkout gaia58.18
```
1. Run the evaluation script:
```bash
python run_gaia_roleplaying.py
```