mirror of
https://github.com/OpenHands/OpenHands.git
synced 2026-03-22 05:37:20 +08:00
413 B
413 B
Tau-Bench Evaluation
This directory contains the evaluation scripts for Tau-Bench.
Setup
First, make sure you have installed the tau-bench package:
pip install tau-bench
Running Evaluation
To run the evaluation, use the following command:
python evaluation/benchmarks/tau_bench/run_infer.py \
--agent-cls CodeActAgent \
--llm-config <your_llm_config> \
--env retail