6 Commits

Author SHA1 Message Date
Mateusz Kwiatkowski
6562297615
Replace shebang with /usr/bin/env bash for improved portability (#6876)
Co-authored-by: Xingyao Wang <xingyao@all-hands.dev>
2025-02-24 18:07:28 +00:00
Boxuan Li
4443417c75
A few fixes for TAC evaluation harness (#6586) 2025-02-14 21:01:57 -08:00
Boxuan Li
ef12bc5381
Evaluation harness: Add agent config option (#6662) 2025-02-13 15:05:03 -05:00
Boxuan Li
62402cd617
The-Agent-Company evaluation harness: Support splits (#6577) 2025-02-02 13:12:01 +08:00
Boxuan Li
6a4442e590
[Evaluation] Add summarise_results script for TheAgentCompany benchmark (#5811) 2024-12-27 20:33:41 -08:00
Boxuan Li
b1719bb3db
Add TheAgentCompany evaluation harness (#5731) 2024-12-22 14:12:30 -05:00