OpenHands

mirror of https://github.com/OpenHands/OpenHands.git synced 2025-12-26 13:52:43 +08:00

Author	SHA1	Message	Date
Xingyao Wang	a5195b0e65	chore: clean up sandbox and ssh related configs (#3301 ) * clean up sandbox and ssh related stuff * remove ssh hostname * remove ssh hostname * remove ssh password * update config * fix typo that breaks the test	2024-08-08 22:15:40 +00:00
Xingyao Wang	90d0a62469	(arch) Switch default runtime to EventStream Runtime (#3271 ) * switch default to eventstream runtime * remove pull docker from makefile * fix unittest * fix file store path * try deprecate server runtime * remove persist sandbox * move file utils * remove server runtime related workflow * remove unused method * attempt to remove the reliance on filestore for BE * fix async for list file * fix list_files to post * fix list files * add suffix to directory * make sure list file returns abs path; make sure other backend endpoints accpets abs path * remove server runtime test workflow * set git config in runtime	2024-08-08 10:11:49 +08:00
Kaushik Deka	415843476c	Feat: Add Vision Input Support for LLM with Vision Capabilities (#2848 ) * add image feature * fix-linting * check model support for images * add comment * Add image support to other models * Add images to chat * fix linting * fix test issues * refactor variable names and import * fix tests * fix chat message tests * fix linting * add pydantic class message * use message * remove redundant comments * remove redundant comments * change Message class * remove unintended change * fix integration tests using regenerate.sh * rename image_bas64 to images_url, fix tests * rename Message.py to message, change reminder append logic, add unit tests * remove comment, fix error to merge * codeact_swe_agent * fix f string * update eventstream integration tests * add missing if check in codeact_swe_agent * update integration tests * Update frontend/src/components/chat/ChatInput.tsx * Update frontend/src/components/chat/ChatInput.tsx * Update frontend/src/components/chat/ChatInput.tsx * Update frontend/src/components/chat/ChatInput.tsx * Update frontend/src/components/chat/ChatMessage.tsx --------- Co-authored-by: tobitege <tobitege@gmx.de> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com>	2024-08-04 02:26:22 +08:00
Xingyao Wang	2e60d25eae	[Agent, LLM] Make sure codeact agent produce message in u/a/u/a order (#3193 ) * make sure codeact agent produce message in u/a/u/a order * integration tests * sync message changes to codeact swe * fix integration tests --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-08-02 00:17:53 +08:00
Engel Nyst	3328669b89	fix Finish action to sent its 'thoughts' in the prompt (#3149 )	2024-07-27 17:37:44 +00:00
tobitege	c0adca1e30	fix DummyAgent (#3137 )	2024-07-26 18:59:25 +00:00
Graham Neubig	98276cf733	Change doc title of agent hub (#3100 ) This PR changes the title of the `agenthub` doc from "Agent Framework Research" to "Agent Hub".	2024-07-25 01:28:40 +00:00
linshaoxin-maker	800e25eac1	Modify codeAct paper link (#3076 )	2024-07-23 20:25:54 +00:00
Xingyao Wang	6b16a5da0b	[Eval,Arch] Update GPTQ eval and add `headless_mode` for Controller (#2994 ) * update and polish gptq eval * fix typo * Update evaluation/gpqa/README.md Co-authored-by: Graham Neubig <neubig@gmail.com> * Update evaluation/gpqa/run_infer.py Co-authored-by: Graham Neubig <neubig@gmail.com> * add headless mode to all appropriate agent controller call * delegate set to error when in headless mode * try to deduplicate a bit * make headless_mode default to True and only change it to false for AgentSession --------- Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-07-20 03:35:48 +00:00
Graham Neubig	3a21198424	Remove monologue agent (#3036 ) * Remove monologue agent * Fixes	2024-07-19 19:25:05 +00:00
Graham Neubig	c897791024	Refactor LLM config (#2953 ) * Add max_message_chars to LLM * Refactor LLM config * Fix tests * Made some functions class functions * Fix regression * Fixed comments	2024-07-17 09:16:04 -04:00
Anush Kumar V	8f76587e5c	docs: updated docstrings using ruff's autofix feature (#2923 ) * Updated documentation using ruff's autofix feature * Updated pyproject.toml to include docstring validations * Updated documentation using ruff's autofix feature * Updated pyproject.toml to include docstring validations * Updated docstrings using ruff's autfix feature * Deleted opendevin/runtime/utils/soource.py, Keeping in sync with main --------- Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-07-16 01:35:33 +00:00
Raj Maheshwari	64be2cb466	[Fix] Minor bug in parse_response of CodeActResponseParser (#2912 )	2024-07-13 14:36:27 +00:00
Xingyao Wang	e45ddeb2a2	arch: deprecating recall action and `search_memory` (#2900 ) * deprecating recall action * fix integration tests * fix integration tests * remove search memory	2024-07-12 19:23:21 +00:00
Xingyao Wang	1b54800a29	[Agent] Improve edits by adding back `edit_file_by_line` (#2722 ) * add replace-based block edit & preliminary test case fix * further fix the insert behavior * make edit only work on first occurence * bump codeact version since we now use new edit agentskills * update prompt for new agentskills * update integration tests * make run_infer.sh executable * remove code block for edit_file * update integration test for prompt changes * default to not use hint for eval * fix insert emptyfile bug * throw value error when `to_replace` is empty * make `_edit_or_insert_file` return string so we can try to fix some linter errors (best attempt) * add todo * update integration test * fix sandbox test for this PR * fix inserting with additional newline * rename to edit_file_by_replace * add back `edit_file_by_line` * update prompt for new editing tool * fix integration tests * bump codeact version since there are more changes * add back append file * fix current line for append * fix append unit tests * change the location where we show edited line no to agent and fix tests * update integration tests * fix global window size affect by open_file bug * fix global window size affect by open_file bug * increase window size to 300 * add file beginning and ending marker to avoid looping * expand the editor window to better display edit error for model * refractor to breakdown edit to internal functions * reduce window to 200 * move window to 100 * refractor to cleanup some logic into _calculate_window_bounds * fix integration tests * fix sandbox test on new prompt * update demonstration with new changes * fix integration * initialize llm inside process_instance to circumvent "AttributeError: Can't pickle local object" * update kwargs * retry for internal server error * fix max iteration * override max iter from config * fix integration tests * remove edit file by line * fix integration tests * add instruction to avoid hanging * Revert "add instruction to avoid hanging" This reverts commit 06fd2c59387c1c2348bc95cb487af1eb913c6ddd. * handle content policy violation error * fix integration tests * fix typo in prompt - the window is 100 * update all integration tests --------- Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Xingyao Wang <xingyao@all-hands.dev>	2024-07-11 15:30:20 +00:00
Boxuan Li	c68478f470	Customize LLM config per agent (#2756 ) Currently, OpenDevin uses a global singleton LLM config and a global singleton agent config. This PR allows customers to configure an LLM config for each agent. A hypothetically useful scenario is to use a cheaper LLM for repo exploration / code search, and a more powerful LLM to actually do the problem solving (CodeActAgent). Partially solves #2075 (web GUI improvement is not the goal of this PR)	2024-07-09 22:05:54 -07:00
Engel Nyst	d37b2973b2	Refactoring: event stream based agent history (#2709 ) * add to event stream sync * remove async from tests * small logging spam fix * remove swe agent * arch refactoring: use history from the event stream * refactor agents * monologue agent * ruff * planner agent * micro-agents * refactor history in evaluations * evals history refactoring * adapt evals and tests * unit testing stuck * testing micro agents, event stream * fix planner agent * fix tests * fix stuck after rename * fix test * small clean up * fix merge * fix merge issue * fix integration tests * Update agenthub/dummy_agent/agent.py * fix tests * rename more clearly; add todo; clean up	2024-07-07 21:04:23 +00:00
Shimada666	d22ff73905	Make the sandbox Python runtime completely transparent (#2796 ) * Make the sandbox Python runtime completely independent * fix source bashrc * add pip install instruction for ipython to fix intergration tests for codeact swe * update integration tests * change flake8 command to (maybe) fix sandbox tests? * make lint support both unittest & sandbox * fix agnostic image build error * refactor build script --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: tobitege <tobitege@gmx.de>	2024-07-06 15:22:44 +02:00
Xingyao Wang	a47713ecb0	[Arch] Remove supports for Background Commands (#2803 ) * depracting docker exec box * remove doc exec from workflow and docs * remove background commands * Update tests/unit/test_sandbox.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * replace for-loop with assignment * fix integration tests * fix integration tests for shell script * fix integration tests * increase max iter to fix some monologue agent issue * fix integration test again * fix integration tests (seems related to run_user issue) --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-07-06 03:38:05 +08:00
மனோஜ்குமார் பழனிச்சாமி	143f38d25a	Refactored sandbox config and added fast boot (#2455 ) * Refactored sandbox config and added fastboot * added tests * fixed tests * fixed tests * intimate user about breaking change * remove default config from eval * check for lowercase env * add test * Revert Migration * migrate old sandbox configs * resolve merge conflict * revert migration 2 * Revert "remove default config from eval" This reverts commit de57c588dbf29a3327798ce68976e2d2277b8bb1. * change type to box_type * fix var name * linted * lint * lint comments * fix tests * fix tests * fix typo * fix box_type, remove fast_boot * add tests for sandbox config * fix test * update eval docs * small removal comments * adapt toml template * old fields shouldn't be in the app dataclass * fix old keys in app config * clean up exec box --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-07-05 03:30:21 +00:00
Xingyao Wang	0d3b3ffbf8	[Arch] Removing docker exec box (#2802 ) * depracting docker exec box * remove doc exec from workflow and docs	2024-07-04 23:15:25 +00:00
sven	1b10e2b9d5	Make CodeAct finish task (#2673 ) * Added feature to CodeAct agent to finish action instead of waiting for user input. * Minor change * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com> * updated integration tests with claude-sonnet-3.5 * Update agenthub/codeact_agent/prompt.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * updated tests to remove typo in prompt * resolve merge conflicts II * revert unintended change of regenerate script * re-regenerating prompts to resolve merge conflicts --------- Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-07-04 11:21:46 -07:00
Leo	c2f557edde	refactor: multiple code improvements (#2771 )	2024-07-04 18:51:22 +08:00
Xingyao Wang	41ddba84bd	[Agent] (Potentially) improve Editing using `diff` (#2685 ) * add replace-based block edit & preliminary test case fix * further fix the insert behavior * make edit only work on first occurence * bump codeact version since we now use new edit agentskills * update prompt for new agentskills * update integration tests * make run_infer.sh executable * remove code block for edit_file * update integration test for prompt changes * default to not use hint for eval * fix insert emptyfile bug * throw value error when `to_replace` is empty * make `_edit_or_insert_file` return string so we can try to fix some linter errors (best attempt) * add todo * update integration test * fix sandbox test for this PR	2024-07-02 11:50:15 +09:00
Engel Nyst	1975689cd4	remove swe agent (#2708 )	2024-07-01 12:27:14 +09:00
Engel Nyst	e24c52d060	Small refactoring of obs truncation (#2701 ) * refactor truncate_content a bit to be usable by all agents * adjust doc	2024-06-30 12:12:08 +02:00
Boxuan Li	e45b311c35	Remove MAX_CHARS traffic control (#2694 ) * Remove MAX_CHARS limiting * More cleanup	2024-06-29 12:59:41 -07:00
Boxuan Li	7766a3283e	CodeActAgent: Fix delegate history (#2672 )	2024-06-28 16:37:23 +09:00
Boxuan Li	8bce806dce	Tweak prompts of ManagerAgent and CommitWriterAgent (#2609 ) * Tweak prompts of ManagerAgent and CommitWriterAgent * Fix prompts	2024-06-24 00:14:28 -07:00
Engel Nyst	80fe13f4be	rename our completion as a drop-in replacement of litellm completion (#2509 )	2024-06-19 05:25:25 +02:00
tobitege	823298e0d0	fix: Agentskills enhancements (#2384 ) * avoid repeat logging of unneeded messages * refactored append/edit_file (tests next) * agentskills and unit test fixes * testing * more changes and test prompts * smaller changes * final test fixes * remove dead code from test_agent.py * reverting unneeded changes * updated tests, more tweaks to skills * refactor (#2442) * chores: fix DelegatorAgent description (#2446) * change * change comments * fix * stopped container to prevent port issues. (#2447) * chore: remove useless browsing code in CodeActSWEAgent (#2438) * remove useless * fix integration test * Regenerate test_ipython_module artifacts for CodeActSWEAgent --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Merge remote-tracking branch 'upstream/main' into agent-fileops * unneeded tweak * * fix edit_file to not introduce extra newline * updated docstrings with more details for LLM * fix legacy typo in prompts causing ]] instead of ] * several mock files regenerated * Regen'ed CodeActSWEAgent integration tests * fix _print_window signature; explicit exception type in _is_valid_path * splitlines with named param --------- Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-06-16 15:06:46 -04:00
Yufan Song	426e429b18	chore: remove useless browsing code in CodeActSWEAgent (#2438 ) * remove useless * fix integration test * Regenerate test_ipython_module artifacts for CodeActSWEAgent --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-06-15 10:53:03 +08:00
Yufan Song	c82666fea3	chores: fix DelegatorAgent description (#2446 ) * change * change comments * fix	2024-06-15 00:04:43 +00:00
Yufan Song	fd29b8faa8	refactor (#2442 )	2024-06-14 19:02:25 -04:00
Yufan Song	7f8c324d3a	Refactor CodeActSWEAgent, add response parser (#2368 ) * refactor code * fix * seperate code * Update agenthub/codeact_swe_agent/response_parser.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update agenthub/codeact_swe_agent/response_parser.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update agenthub/codeact_swe_agent/action_parser.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * remove browsing --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-06-13 04:14:05 +00:00
Yufan Song	0c92144220	Refactor MonologueAgent, PlannerAgent add response parser (#2400 ) * refactor monologue * refactor planner_agent * fix bug * add back code * add back code	2024-06-13 12:00:27 +08:00
Yufan Song	c6951eb6c1	refactor browsing agent response parse (#2366 ) * refactor browsing * add comments * change file name * Rename resposne_parser.py to response_parser.py * Fixed typos * Typo fix --------- Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-06-11 03:46:33 +00:00
tobitege	9605106e72	feat: append_file incl. all tests [agentskills] (#2346 ) * new skill: append_file incl. all tests * more tests needed caring * file_name for append_file/edit_file; updated tests	2024-06-10 17:18:40 +00:00
Yufan Song	f7491bd2fa	Refactor response to action in agent step (#2350 ) * refactor action parser * Fix typos * fix typo --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-06-10 10:17:30 +00:00
Temo	e925cefeef	Refactored prompt.py to reduce token usage (#1996 ) * Refactored prompt.py to reduce token usage * Reverted some destructive changes * Update agenthub/codeact_agent/prompt.py * Update agenthub/codeact_agent/prompt.py * Update agenthub/codeact_agent/prompt.py * Update agenthub/codeact_agent/prompt.py * Update agenthub/codeact_agent/prompt.py * Update agenthub/codeact_agent/prompt.py * Update agenthub/codeact_agent/prompt.py * Apply suggestions from code review * Apply suggestions from code review * Update agenthub/codeact_agent/prompt.py * fix integration test * make lint * feat: support ToolQA benchmark (#2263) * Add files via upload * Update README.md * Update run_infer.py * Update utils.py * make lint * Update evaluation/toolqa/run_infer.py --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: yufansong <yufan@risingwave-labs.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * feat: revert hiden special paths change in file action (#2328) * revert change in file action * remove useless code * make lint * Support gpqa benchmark evaluation (#2080) * feat: add gpqa benchmark evaluation * add metrics * reset configs in final block * make lint --------- Co-authored-by: yufansong <yufan@risingwave-labs.com> * fix(frontend): prevent API key from resetting after modal change (#2329) * remove bottom chatbox fade * Modal wider; fix lint error * settings: attempt to not clear api key for same provider * prevent api key from resetting after changing the model * revert other changes and fix post test tear down error --------- Co-authored-by: amanape <83104063+amanape@users.noreply.github.com> * fix: codeact bug [If running a command that never returns, it gets stuck #1895] (#2034) * fix: codeact bug https://github.com/OpenDevin/OpenDevin/issues/1895 * fix: add CmdRunAction timeout hint. * Update agenthub/codeact_agent/prompt.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * regenerate integration test --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: yufansong <yufan@risingwave-labs.com> * Feat: Support Gorilla APIBench (#2081) * removed unused files from gorilla * Update run_infer.py, removed unused imports * Update utils.py * Update ast_eval_hf.py * Update ast_eval_tf.py * Update ast_eval_th.py * Create README.md * Update run_infer.py * make lint * Update run_infer.py * fix lint --------- Co-authored-by: yufansong <yufan@risingwave-labs.com> * remote useless (#2332) * fix integration test * Update agenthub/codeact_agent/prompt.py * Update agenthub/codeact_agent/prompt.py * fix integration test --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: Frank Xu <frankxu2004@gmail.com> Co-authored-by: yufansong <yufan@risingwave-labs.com> Co-authored-by: yueqis <141804823+yueqis@users.noreply.github.com> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Yufan Song <33971064+yufansong@users.noreply.github.com> Co-authored-by: Jaskirat Singh <1.jaskiratsingh@gmail.com> Co-authored-by: tobitege <tobitege@gmx.de> Co-authored-by: amanape <83104063+amanape@users.noreply.github.com> Co-authored-by: Aaron Xia <zhhuaxia@gmail.com> Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-06-09 10:19:05 -07:00
Frank Xu	bd00f0f049	Restore previous browsing agent behavior when evaluating on WebArena and miniwob++ only (#2341 ) * restore eval mode * fix	2024-06-09 04:10:02 -04:00
Boxuan Li	a9a2f10170	Revamp AgentRejectAction and allow ManagerAgent to handle rejection (#1735 ) * Fix AgentRejectAction handling * Add ManagerAgent to integration tests * Fix regenerate.sh * Fix merge * Update README for micro-agents * Add test reject to regenerate.sh * regenerate.sh: Add support for running a specific test and/or agent * Refine reject schema, and allow ManagerAgent to handle reject * Add test artifacts for test_simple_task_rejection * Fix manager agent tests * Fix README * test_simple_task_rejection: check final agent state * Integration test: exit if mock prompt not found * Update test_simple_task_rejection tests * Fix test_edits test artifacts after prompt update * Fix ManagerAgent test_edits * WIP * Fix tests * update test_edits for ManagerAgent * Skip local sandbox for reject test * Fix test comparison	2024-06-08 23:12:30 -07:00
Aaron Xia	b5a17efc45	fix: codeact bug [If running a command that never returns, it gets stuck #1895 ] (#2034 ) * fix: codeact bug https://github.com/OpenDevin/OpenDevin/issues/1895 * fix: add CmdRunAction timeout hint. * Update agenthub/codeact_agent/prompt.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * regenerate integration test --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: yufansong <yufan@risingwave-labs.com>	2024-06-08 16:40:23 +00:00
tobitege	b431fce938	tests: more Agentskills tests; updated .gitignore (#2307 ) * added tests related to backticks * updated .gitignore * added extra linter test for #2210 * hotfix for integration test --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-06-07 16:29:03 +00:00
Boxuan Li	45ce09d70e	CodeActAgent: Delegate to BrowsingAgent for browsing tasks (#2103 )	2024-06-07 00:53:47 -07:00
Aleksandar	b0b19e6c25	Update AgentHubREADME.md (#2290 ) Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com>	2024-06-06 11:14:41 +00:00
Frank Xu	48151bdbb0	[feat] WebArena benchmark, MiniWoB++ benchmark and related arch changes (#2170 ) * add webarena, and revamp messaging for webarena eval * add changes for browsergym * update infer script * fix unit tests * update * add multiple run for miniwob * update instruction, remove personal path * update * add code for getting final reward, fix integration, add results * add avg cost calculation	2024-06-06 09:01:20 +08:00
RainRat	3b0e1361a4	fix typos (#2267 ) * fix typos no functional change * fix typos * fix typos * fix integration test --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Leo <ifuryst@gmail.com> Co-authored-by: yufansong <yufan@risingwave-labs.com>	2024-06-05 23:06:40 +08:00
Leo	9ada36e30b	fix: restore python linting. (#2228 ) * fix: restore python linting. Signed-off-by: ifuryst <ifuryst@gmail.com> * update: extend the Python lint check to evaluation. Signed-off-by: ifuryst <ifuryst@gmail.com> * Update evaluation/logic_reasoning/instruction.txt --------- Signed-off-by: ifuryst <ifuryst@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-06-04 06:36:19 +00:00
RainRat	ed6dcc8381	fix typos (#2187 ) * fix typos no functional change * fix typos	2024-06-01 20:40:30 +00:00

1 2 3 4

200 Commits