Files
OpenHands/evaluation/benchmarks/tau_bench/README.md
2025-12-31 03:12:50 +00:00

413 B

Tau-Bench Evaluation

This directory contains the evaluation scripts for Tau-Bench.

Setup

First, make sure you have installed the tau-bench package:

pip install tau-bench

Running Evaluation

To run the evaluation, use the following command:

python evaluation/benchmarks/tau_bench/run_infer.py \
  --agent-cls CodeActAgent \
  --llm-config <your_llm_config> \
  --env retail