Files
OpenHands/evaluation/benchmarks/tau_bench/README.md
2025-12-31 03:12:50 +00:00

23 lines
413 B
Markdown

# Tau-Bench Evaluation
This directory contains the evaluation scripts for Tau-Bench.
## Setup
First, make sure you have installed the `tau-bench` package:
```bash
pip install tau-bench
```
## Running Evaluation
To run the evaluation, use the following command:
```bash
python evaluation/benchmarks/tau_bench/run_infer.py \
--agent-cls CodeActAgent \
--llm-config <your_llm_config> \
--env retail
```