OpenHands

mirror of https://github.com/OpenHands/OpenHands.git synced 2025-12-26 05:48:36 +08:00

Author	SHA1	Message	Date
Robert Brennan	01ae22ef57	Rename OpenDevin to OpenHands (#3472 ) * Replace OpenDevin with OpenHands * Update CONTRIBUTING.md * Update README.md * Update README.md * update poetry lock; move opendevin folder to openhands * fix env var * revert image references in docs * revert permissions * revert permissions --------- Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-08-20 00:44:54 +08:00
Engel Nyst	92b1a2da5c	Refactor agent to accept agent config (#3430 ) * refactor agents to receive their agent config * add unit test * fix test * fix tests	2024-08-17 18:11:30 +02:00
Xingyao Wang	a5195b0e65	chore: clean up sandbox and ssh related configs (#3301 ) * clean up sandbox and ssh related stuff * remove ssh hostname * remove ssh hostname * remove ssh password * update config * fix typo that breaks the test	2024-08-08 22:15:40 +00:00
Kaushik Deka	415843476c	Feat: Add Vision Input Support for LLM with Vision Capabilities (#2848 ) * add image feature * fix-linting * check model support for images * add comment * Add image support to other models * Add images to chat * fix linting * fix test issues * refactor variable names and import * fix tests * fix chat message tests * fix linting * add pydantic class message * use message * remove redundant comments * remove redundant comments * change Message class * remove unintended change * fix integration tests using regenerate.sh * rename image_bas64 to images_url, fix tests * rename Message.py to message, change reminder append logic, add unit tests * remove comment, fix error to merge * codeact_swe_agent * fix f string * update eventstream integration tests * add missing if check in codeact_swe_agent * update integration tests * Update frontend/src/components/chat/ChatInput.tsx * Update frontend/src/components/chat/ChatInput.tsx * Update frontend/src/components/chat/ChatInput.tsx * Update frontend/src/components/chat/ChatInput.tsx * Update frontend/src/components/chat/ChatMessage.tsx --------- Co-authored-by: tobitege <tobitege@gmx.de> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com>	2024-08-04 02:26:22 +08:00
Graham Neubig	c897791024	Refactor LLM config (#2953 ) * Add max_message_chars to LLM * Refactor LLM config * Fix tests * Made some functions class functions * Fix regression * Fixed comments	2024-07-17 09:16:04 -04:00
Anush Kumar V	8f76587e5c	docs: updated docstrings using ruff's autofix feature (#2923 ) * Updated documentation using ruff's autofix feature * Updated pyproject.toml to include docstring validations * Updated documentation using ruff's autofix feature * Updated pyproject.toml to include docstring validations * Updated docstrings using ruff's autfix feature * Deleted opendevin/runtime/utils/soource.py, Keeping in sync with main --------- Co-authored-by: Graham Neubig <neubig@gmail.com>	2024-07-16 01:35:33 +00:00
Xingyao Wang	e45ddeb2a2	arch: deprecating recall action and `search_memory` (#2900 ) * deprecating recall action * fix integration tests * fix integration tests * remove search memory	2024-07-12 19:23:21 +00:00
Boxuan Li	c68478f470	Customize LLM config per agent (#2756 ) Currently, OpenDevin uses a global singleton LLM config and a global singleton agent config. This PR allows customers to configure an LLM config for each agent. A hypothetically useful scenario is to use a cheaper LLM for repo exploration / code search, and a more powerful LLM to actually do the problem solving (CodeActAgent). Partially solves #2075 (web GUI improvement is not the goal of this PR)	2024-07-09 22:05:54 -07:00
Engel Nyst	d37b2973b2	Refactoring: event stream based agent history (#2709 ) * add to event stream sync * remove async from tests * small logging spam fix * remove swe agent * arch refactoring: use history from the event stream * refactor agents * monologue agent * ruff * planner agent * micro-agents * refactor history in evaluations * evals history refactoring * adapt evals and tests * unit testing stuck * testing micro agents, event stream * fix planner agent * fix tests * fix stuck after rename * fix test * small clean up * fix merge * fix merge issue * fix integration tests * Update agenthub/dummy_agent/agent.py * fix tests * rename more clearly; add todo; clean up	2024-07-07 21:04:23 +00:00
Xingyao Wang	a47713ecb0	[Arch] Remove supports for Background Commands (#2803 ) * depracting docker exec box * remove doc exec from workflow and docs * remove background commands * Update tests/unit/test_sandbox.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * replace for-loop with assignment * fix integration tests * fix integration tests for shell script * fix integration tests * increase max iter to fix some monologue agent issue * fix integration test again * fix integration tests (seems related to run_user issue) --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-07-06 03:38:05 +08:00
மனோஜ்குமார் பழனிச்சாமி	143f38d25a	Refactored sandbox config and added fast boot (#2455 ) * Refactored sandbox config and added fastboot * added tests * fixed tests * fixed tests * intimate user about breaking change * remove default config from eval * check for lowercase env * add test * Revert Migration * migrate old sandbox configs * resolve merge conflict * revert migration 2 * Revert "remove default config from eval" This reverts commit de57c588dbf29a3327798ce68976e2d2277b8bb1. * change type to box_type * fix var name * linted * lint * lint comments * fix tests * fix tests * fix typo * fix box_type, remove fast_boot * add tests for sandbox config * fix test * update eval docs * small removal comments * adapt toml template * old fields shouldn't be in the app dataclass * fix old keys in app config * clean up exec box --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-07-05 03:30:21 +00:00
Xingyao Wang	0d3b3ffbf8	[Arch] Removing docker exec box (#2802 ) * depracting docker exec box * remove doc exec from workflow and docs	2024-07-04 23:15:25 +00:00
Boxuan Li	e45b311c35	Remove MAX_CHARS traffic control (#2694 ) * Remove MAX_CHARS limiting * More cleanup	2024-06-29 12:59:41 -07:00
Boxuan Li	8bce806dce	Tweak prompts of ManagerAgent and CommitWriterAgent (#2609 ) * Tweak prompts of ManagerAgent and CommitWriterAgent * Fix prompts	2024-06-24 00:14:28 -07:00
Engel Nyst	80fe13f4be	rename our completion as a drop-in replacement of litellm completion (#2509 )	2024-06-19 05:25:25 +02:00
Boxuan Li	a9a2f10170	Revamp AgentRejectAction and allow ManagerAgent to handle rejection (#1735 ) * Fix AgentRejectAction handling * Add ManagerAgent to integration tests * Fix regenerate.sh * Fix merge * Update README for micro-agents * Add test reject to regenerate.sh * regenerate.sh: Add support for running a specific test and/or agent * Refine reject schema, and allow ManagerAgent to handle reject * Add test artifacts for test_simple_task_rejection * Fix manager agent tests * Fix README * test_simple_task_rejection: check final agent state * Integration test: exit if mock prompt not found * Update test_simple_task_rejection tests * Fix test_edits test artifacts after prompt update * Fix ManagerAgent test_edits * WIP * Fix tests * update test_edits for ManagerAgent * Skip local sandbox for reject test * Fix test comparison	2024-06-08 23:12:30 -07:00
RainRat	3b0e1361a4	fix typos (#2267 ) * fix typos no functional change * fix typos * fix typos * fix integration test --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Leo <ifuryst@gmail.com> Co-authored-by: yufansong <yufan@risingwave-labs.com>	2024-06-05 23:06:40 +08:00
Boxuan Li	9b371b1b5f	Refactor agent delegation and tweak micro agents (#1910 ) This PR fixes #1897. In addition, this PR fixes and tweaks a few micro-agents. For the first time, I am able to use ManagerAgent to complete test_write_simple_script and test_edits tasks in integration tests, so this PR also adds ManagerAgent as part of integration tests. test_write_simple_script involves delegation to CoderAgent while test_edits involves delegation to TypoFixerAgent. Also for the first time, I am able to use DelegateAgent to complete test_write_simple_script and test_edits tasks in integration tests, so this PR also adds DelegateAgent as part of integration tests. It involves delegation to StudyRepoForTaskAgent, CoderAgent and VerifierAgent. This PR is a blocker for #1735 and likely #1945.	2024-05-28 20:01:16 -07:00
Engel Nyst	b9a5be2569	Add ruff for shared mutable defaults (B) (#1938 ) * Add ruff for shared mutable defaults (B) * Apply B006, B008 on current files, except fast API * Update agenthub/SWE_agent/prompts.py Co-authored-by: Graham Neubig <neubig@gmail.com> * fix unintended behavior change * this is correct, tell Ruff to leave it alone --------- Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-22 20:06:00 -07:00
Yufan Song	d18e6c85a0	feat: add metrics related to cost for better observability (#1944 ) * add metrics for total_cost * make lint * refact codeact * change metrics into llm * add costs list, add into state * refactor log completion * refactor and test others * make lint * Update opendevin/core/metrics.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update opendevin/llm/llm.py Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> * refactor * add code --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>	2024-05-22 08:53:31 +00:00
Boxuan Li	b845a38169	Small improvements & fixes to SWE-Bench (#1874 ) I was able to run a few benchmark instances from SWE-Bench by myself following the documentation - it was great! In general the experience was smooth, thanks to @xingyaoww, @libowen2121 and the team! I made a few small enhancements and fixes to further improve the developer experience. Always use poetry run python (using python from poetry's virtual environment) over python or python3 in scripts to make sure the behavior is consistent. Make AGENT configurable. One can use an argument to control which agent they would like to benchmark. To facilitate this, I removed hardcoded CodeActAgent from run_infer.sh, and also added VERSION attribute to all agents, as the benchmark needs to record the agent version. Make EVAL_LIMIT configurable. One can use an argument to control how many instances they'd like to benchmark. Useful for debugging & development purposes. Fix 'eval_output_dir' not defined error in run_infer.py. Other enhancements to the README file and logs. I also notice that a lot of code from run_infer.py could be shared by other benchmarks, but since we only have one benchmark now, I think we could avoid over-engineering. A refactor and code dedup would be useful in the future once we have more benchmarks, though.	2024-05-20 08:03:30 +00:00
Xia Zhenhua	76abca361c	feat: simplify state.history with to_memory call in micro-agent. Or the call to LLM may exceed the token limit. (#1806 ) * feat: simplify state.history with to_memory call in micro-agent. * feat: merge master and replace to_memory with event_to_memory. --------- Co-authored-by: aaren.xzh <aaren.xzh@antfin.com>	2024-05-15 14:47:37 +02:00
Xia Zhenhua	bf14b47890	feat: make other agents support asking user input in MessageAction. (#1777 ) * feat: make other agents support asking user input in MessageAction. * Update agenthub/micro/_instructions/actions/message.md Co-authored-by: Robert Brennan <accounts@rbren.io> * Update agenthub/micro/_instructions/actions/message.md Co-authored-by: Robert Brennan <accounts@rbren.io> * feat: make other agents support asking user input in MessageAction. * Regenerate test artifacts --------- Co-authored-by: aaren.xzh <aaren.xzh@antfin.com> Co-authored-by: Robert Brennan <accounts@rbren.io> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-15 00:44:45 -07:00
Robert Brennan	dcb5d1ce0a	Add permanent storage option for EventStream (#1697 ) * add storage classes * add minio * add event stream storage * storage test working * use fixture * event stream test passing * better serialization * factor out serialization pkg * move more serialization * fix tests * fix test * remove __all__ * add rehydration test * add more rehydration test * fix fixture * fix dict init * update tests * lock * regenerate tests * Update opendevin/events/stream.py * revert tests * revert old integration tests * only add fields if present * regen tests * pin pyarrow * fix unit tests * remove cause from memories * revert tests * regen tests	2024-05-14 11:09:45 -04:00
Robert Brennan	beb74a19f6	Use event stream for the runtime (#1776 ) * rebuild PR from scratch * fix max_iter * regenerate tests * cut down on history * Update opendevin/controller/agent_controller.py * regenerate tests * revert swe agent * revert some codeact chagnes * regenerate tests * add source to dict * only add source if not none * try to fix coverage issue * lock * add gevent	2024-05-14 13:35:25 +00:00
Robert Brennan	b028bd46bb	Use messages to drive tasks (#1688 ) * finish is working * start reworking main_goal * remove main_goal from microagents * remove main_goal from other agents * fix issues * revert codeact line * make plan a subclass of task * fix frontend for new plan setup * lint * fix type * more lint * fix build issues * fix codeact mgs * fix edge case in regen script * fix task validation errors * regenerate integration tests * fix up tests * fix sweagent * revert codeact prompt * update integration tests * update integration tests * handle loading state * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update opendevin/controller/agent_controller.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update agenthub/codeact_agent/codeact_agent.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * Update opendevin/controller/state/plan.py Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> * update docs * regenerate tests * remove none from state type * revert test files * update integration tests * rename plan to root_task * revert plugin perms * regen integration tests * tweak integration script * prettier * fix test * set workspace up for regeneration * regenerate tests * Change directory of copy * Updated tests * Disable PlannerAgent test * Fix listen * Updated prompts * Disable planner again * Make codecov more lenient * Update agenthub/README.md * Update opendevin/server/README.md * re-enable planner tests * finish top level tasks * regen planner * fix root task factory --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com> Co-authored-by: Xingyao Wang <xingyao6@illinois.edu> Co-authored-by: Graham Neubig <neubig@gmail.com> Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>	2024-05-13 23:14:15 +00:00
Engel Nyst	e5f1dbf5e7	Move json utility to the custom json parsing; apply it to the monologue-like agents (#1740 )	2024-05-12 13:39:38 -04:00
Engel Nyst	98adbf54ec	Small refactoring (#1614 ) * move MemoryCondenser, LongTermMemory, json, out of the monologue * PlannerAgent and Microagents use the custom json.loads/dumps * Move short term history out of monologue agent... * move memory in their package * add __init__	2024-05-11 17:15:19 +02:00
Jim Su	f8d4b1ab0d	Use generic types (#1680 )	2024-05-10 04:21:22 +02:00
Xia Zhenhua	4a72e83938	fix: AgentThinkAction deleted caused bug. (#1662 ) * fix: AgentThinkAction deleted caused bug. * fix: AgentThinkAction deleted caused bug in plannerAgent. * fix: plan content-not-changed caused frontend crash bug. --------- Co-authored-by: aaren.xzh <aaren.xzh@antfin.com>	2024-05-09 09:04:02 -04:00
Boxuan Li	af5bdf67aa	Add AgentRejectAction across multiple modules (#1615 ) * Add AgentRejectAction across multiple modules This commit introduces the AgentRejectAction class and integrates it across various modules and actions. It includes updates to READMEs, action definitions, and agent controllers to handle the new 'reject' action. This functionality will allow agents to properly signal task rejection. * Fix unit test * Remove wrong generates attributes from a few micro-agents	2024-05-08 10:03:14 -07:00
Aleksandar	4bf4119259	Introduce TypoFixerAgent for in-place typo corrections in agenthub/micro (#1613 ) * Add TypoFixerAgent micro-agent to fix typos * Improve parse_response to accurately extract the first complete JSON object * Add tests for parse_response function handling complex scenarios * Fix tests and logic to use action_from_dict * Fix small formatting issues	2024-05-07 13:25:35 +02:00
Frank Xu	26dcf4fd7c	remove screenshot in browser observation (#1588 ) * remove screenshot in browser observation * refactor utils * allow only dict * fix screenshot not showing up in frontend --------- Co-authored-by: Robert Brennan <accounts@rbren.io>	2024-05-06 13:56:28 -04:00
Xia Zhenhua	2d24521222	fix: file in _instructions directory not set correctly. (#1602 ) Co-authored-by: aaren.xzh <aaren.xzh@antfin.com>	2024-05-06 08:09:03 -04:00
Graham Neubig	74e159add6	Remove screenshot from microagent prompt (#1550 ) * Remove screenshot from microagent prompt * Update recursive search * Update to handle various data types	2024-05-03 09:34:39 -04:00
RainRat	3cdff79173	fix typos (#1537 )	2024-05-03 09:41:32 +03:00
Robert Brennan	fadcdc117e	Migrate to new folder structure in preparation for refactor (#1531 ) * fix up folder structure * update docs * fix imports * fix imports * fix imoprt * fix imports * fix imports * fix imports * fix test import * fix tests * fix main import	2024-05-02 17:01:54 +00:00
Robert Brennan	ce7c7eaae4	Refactor actions and observations (#1479 ) * refactor actions and events * remove type_key * remove stream * move import * move import * fix NullObs * reorder imports * fix lint * fix dataclasses * remove blank fields * fix nullobs * fix sidebar labels * fix test compilation * switch to asdict * lint * fix whitespace * fix executable * delint * fix run * remove NotImplementeds * fix path prefix * remove null files * add debug * add more debug info * fix dataclass on null * remove debug * revert sandbox * fix merge issues * fix tyeps * Update opendevin/events/action/browse.py	2024-05-02 15:44:54 +00:00
Boxuan Li	11d8253215	Add new CommitWriterAgent to auto-generate commit messages from staged diffs (#1484 ) * Add new CommitWriterAgent to auto-generate commit messages from staged diffs This commit introduces the CommitWriterAgent along with its configuration and detailed task description. The agent is designed to analyze git diffs staged for commit and automatically generate succinct and relevant commit messages. * Remove devnote section from yaml and add README	2024-05-02 09:42:55 -04:00
Boxuan Li	30e10e7a4a	Micro-agent: Remove planner specific actions (#1515 )	2024-05-02 08:01:58 -04:00
Boxuan Li	c7dd443fa2	CoderAgent: Render summary prompt conditionally (#1461 ) * CoderAgent: Render repo summary conditionally * Add unittests --------- Co-authored-by: Robert Brennan <accounts@rbren.io>	2024-05-01 15:40:20 +00:00
Jirka Borovec	0c2ebfd6e1	Ruff: use I rule for isort (#1410 ) Ruff: use I rule for isort	2024-04-29 15:41:58 -07:00
Boxuan Li	319b9ac0f3	Fix micro-agents schema bug (#1424 ) * Fix micro agents definitions * Add tests for micro agents * Add to CI * Revert "Add to CI" This reverts commit 94f3b4e7c8408a1b0267f3847cbaefdcd995db05. * Remove test artifacts for ManagerAgent --------- Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-04-29 10:34:03 -04:00
Robert Brennan	5543fe2c3e	fix up prompts (#1345 )	2024-04-24 18:30:18 -04:00
Robert Brennan	1e95fa435d	Microagents and Delegation (#1238 ) * basic microagent structure * start on jinja * add instructions parser * add action instructions * add history instructions * fix a few issues * fix a few issues * fix issues * fix agent encoding * fix up anon class * prompt to fix errors * less debug info when errors happen * add another traceback * add output to finish * fix math prompt * fix pg prompt * fix up json prompt * fix math prompt * fix math prompt * fix repo prompt * fix up repo explorer * update lock * revert changes to agent_controller * refactor microagent registration a bit * create delegate action * delegation working * add finish action to manager * fix tests * rename microagents registry * rename fn * logspam * add metadata to manager agent * fix message * move repo_explorer * add delegator agent * rename agent_definition * fix up input-output plumbing * fix tests * Update agenthub/micro/math_agent/agent.yaml Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update agenthub/delegator_agent/prompt.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update agenthub/delegator_agent/prompt.py Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * remove prompt.py * fix lint * Update agenthub/micro/postgres_agent/agent.yaml Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * Update agenthub/micro/postgres_agent/agent.yaml Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> * fix error --------- Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>	2024-04-24 17:46:14 -04:00

45 Commits