Robert Brennan
01ae22ef57
Rename OpenDevin to OpenHands ( #3472 )
...
* Replace OpenDevin with OpenHands
* Update CONTRIBUTING.md
* Update README.md
* Update README.md
* update poetry lock; move opendevin folder to openhands
* fix env var
* revert image references in docs
* revert permissions
* revert permissions
---------
Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>
2024-08-20 00:44:54 +08:00
Xingyao Wang
a2ea17909d
chore: remove deprecated RuntimeTool ( #3443 )
2024-08-18 09:45:45 +08:00
Engel Nyst
92b1a2da5c
Refactor agent to accept agent config ( #3430 )
...
* refactor agents to receive their agent config
* add unit test
* fix test
* fix tests
2024-08-17 18:11:30 +02:00
Kaushik Deka
415843476c
Feat: Add Vision Input Support for LLM with Vision Capabilities ( #2848 )
...
* add image feature
* fix-linting
* check model support for images
* add comment
* Add image support to other models
* Add images to chat
* fix linting
* fix test issues
* refactor variable names and import
* fix tests
* fix chat message tests
* fix linting
* add pydantic class message
* use message
* remove redundant comments
* remove redundant comments
* change Message class
* remove unintended change
* fix integration tests using regenerate.sh
* rename image_bas64 to images_url, fix tests
* rename Message.py to message, change reminder append logic, add unit tests
* remove comment, fix error to merge
* codeact_swe_agent
* fix f string
* update eventstream integration tests
* add missing if check in codeact_swe_agent
* update integration tests
* Update frontend/src/components/chat/ChatInput.tsx
* Update frontend/src/components/chat/ChatInput.tsx
* Update frontend/src/components/chat/ChatInput.tsx
* Update frontend/src/components/chat/ChatInput.tsx
* Update frontend/src/components/chat/ChatMessage.tsx
---------
Co-authored-by: tobitege <tobitege@gmx.de>
Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com>
2024-08-04 02:26:22 +08:00
Graham Neubig
3a21198424
Remove monologue agent ( #3036 )
...
* Remove monologue agent
* Fixes
2024-07-19 19:25:05 +00:00
Graham Neubig
c897791024
Refactor LLM config ( #2953 )
...
* Add max_message_chars to LLM
* Refactor LLM config
* Fix tests
* Made some functions class functions
* Fix regression
* Fixed comments
2024-07-17 09:16:04 -04:00
Anush Kumar V
8f76587e5c
docs: updated docstrings using ruff's autofix feature ( #2923 )
...
* Updated documentation using ruff's autofix feature
* Updated pyproject.toml to include docstring validations
* Updated documentation using ruff's autofix feature
* Updated pyproject.toml to include docstring validations
* Updated docstrings using ruff's autfix feature
* Deleted opendevin/runtime/utils/soource.py, Keeping in sync with main
---------
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-07-16 01:35:33 +00:00
Xingyao Wang
e45ddeb2a2
arch: deprecating recall action and search_memory ( #2900 )
...
* deprecating recall action
* fix integration tests
* fix integration tests
* remove search memory
2024-07-12 19:23:21 +00:00
Boxuan Li
e45b311c35
Remove MAX_CHARS traffic control ( #2694 )
...
* Remove MAX_CHARS limiting
* More cleanup
2024-06-29 12:59:41 -07:00
Engel Nyst
80fe13f4be
rename our completion as a drop-in replacement of litellm completion ( #2509 )
2024-06-19 05:25:25 +02:00
Yufan Song
0c92144220
Refactor MonologueAgent, PlannerAgent add response parser ( #2400 )
...
* refactor monologue
* refactor planner_agent
* fix bug
* add back code
* add back code
2024-06-13 12:00:27 +08:00
Aaron Xia
42c6b506b5
Lazy launching BrowseEnv / making BrowseEnv optional ( #2155 )
...
* feat: lazy launching browser; browser optional for diffrent agents.
* style: lint
* fix: integration test fail due to browser not started.
* fix: run by cli and integration test failed.
* fix: lint
* fix: lint
---------
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-05-31 16:40:42 -04:00
Yufan Song
d18e6c85a0
feat: add metrics related to cost for better observability ( #1944 )
...
* add metrics for total_cost
* make lint
* refact codeact
* change metrics into llm
* add costs list, add into state
* refactor log completion
* refactor and test others
* make lint
* Update opendevin/core/metrics.py
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>
* Update opendevin/llm/llm.py
Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>
* refactor
* add code
---------
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>
Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>
2024-05-22 08:53:31 +00:00
Boxuan Li
b845a38169
Small improvements & fixes to SWE-Bench ( #1874 )
...
I was able to run a few benchmark instances from SWE-Bench by myself following the documentation - it was great! In general the experience was smooth, thanks to @xingyaoww, @libowen2121 and the team! I made a few small enhancements and fixes to further improve the developer experience.
Always use poetry run python (using python from poetry's virtual environment) over python or python3 in scripts to make sure the behavior is consistent.
Make AGENT configurable. One can use an argument to control which agent they would like to benchmark. To facilitate this, I removed hardcoded CodeActAgent from run_infer.sh, and also added VERSION attribute to all agents, as the benchmark needs to record the agent version.
Make EVAL_LIMIT configurable. One can use an argument to control how many instances they'd like to benchmark. Useful for debugging & development purposes.
Fix 'eval_output_dir' not defined error in run_infer.py.
Other enhancements to the README file and logs.
I also notice that a lot of code from run_infer.py could be shared by other benchmarks, but since we only have one benchmark now, I think we could avoid over-engineering. A refactor and code dedup would be useful in the future once we have more benchmarks, though.
2024-05-20 08:03:30 +00:00
Robert Brennan
b028bd46bb
Use messages to drive tasks ( #1688 )
...
* finish is working
* start reworking main_goal
* remove main_goal from microagents
* remove main_goal from other agents
* fix issues
* revert codeact line
* make plan a subclass of task
* fix frontend for new plan setup
* lint
* fix type
* more lint
* fix build issues
* fix codeact mgs
* fix edge case in regen script
* fix task validation errors
* regenerate integration tests
* fix up tests
* fix sweagent
* revert codeact prompt
* update integration tests
* update integration tests
* handle loading state
* Update agenthub/codeact_agent/codeact_agent.py
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>
* Update opendevin/controller/agent_controller.py
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>
* Update agenthub/codeact_agent/codeact_agent.py
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>
* Update opendevin/controller/state/plan.py
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>
* update docs
* regenerate tests
* remove none from state type
* revert test files
* update integration tests
* rename plan to root_task
* revert plugin perms
* regen integration tests
* tweak integration script
* prettier
* fix test
* set workspace up for regeneration
* regenerate tests
* Change directory of copy
* Updated tests
* Disable PlannerAgent test
* Fix listen
* Updated prompts
* Disable planner again
* Make codecov more lenient
* Update agenthub/README.md
* Update opendevin/server/README.md
* re-enable planner tests
* finish top level tasks
* regen planner
* fix root task factory
---------
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>
Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>
Co-authored-by: Graham Neubig <neubig@gmail.com>
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>
2024-05-13 23:14:15 +00:00
Jim Su
f8d4b1ab0d
Use generic types ( #1680 )
2024-05-10 04:21:22 +02:00
Robert Brennan
fadcdc117e
Migrate to new folder structure in preparation for refactor ( #1531 )
...
* fix up folder structure
* update docs
* fix imports
* fix imports
* fix imoprt
* fix imports
* fix imports
* fix imports
* fix test import
* fix tests
* fix main import
2024-05-02 17:01:54 +00:00
Robert Brennan
ce7c7eaae4
Refactor actions and observations ( #1479 )
...
* refactor actions and events
* remove type_key
* remove stream
* move import
* move import
* fix NullObs
* reorder imports
* fix lint
* fix dataclasses
* remove blank fields
* fix nullobs
* fix sidebar labels
* fix test compilation
* switch to asdict
* lint
* fix whitespace
* fix executable
* delint
* fix run
* remove NotImplementeds
* fix path prefix
* remove null files
* add debug
* add more debug info
* fix dataclass on null
* remove debug
* revert sandbox
* fix merge issues
* fix tyeps
* Update opendevin/events/action/browse.py
2024-05-02 15:44:54 +00:00
Jirka Borovec
0c2ebfd6e1
Ruff: use I rule for isort ( #1410 )
...
Ruff: use I rule for isort
2024-04-29 15:41:58 -07:00
Boxuan Li
e0c7492609
Traffic Control: Add new config MAX_CHARS ( #1015 )
...
* Add new config MAX_CHARS
* Fix mypy linting issues
2024-04-12 19:01:52 +00:00
Jack Quimby
d6128941b7
Doc: Document difference between agents ( #722 )
...
* doc: Guide for using local LLM with Ollama
* forgot to delete print statement
* typos
* Updated guide - new working method
* Move to docs folder
* Fixed front end overwrite local model name
* Update llm.py
* Delete docs/examples/images/example.png
deleted example.png
* Documentation of agent differences
* rename examples to documentation
* Docstrings for all agents
* typo fix
* typo fixes
* Typo fixes
* more typo fixes
* typo fix
* typo fixes
* typos fixed
* Typo fixes
* top 10 list
* typo fix
* typo fix
* typos to the moon
* typos fixed
* typo fix
* typo fix
* anotha one
* The rest of the typos
* Corrected agent descriptions
* Agents markdown updated
---------
Co-authored-by: Robert Brennan <accounts@rbren.io>
2024-04-05 12:51:25 -05:00
Robert Brennan
a6f0c066b5
Implement Planning ( #267 )
...
* add outline of agent
* add plan class
* add initial prompt
* plumb plan through a bit
* refactor state management
* move task into state
* fix errors
* add prompt parsing
* add task actions
* better serialization
* more serialization hacks
* fix fn
* fix recursion error
* refine prompt
* better description of run
* update prompt
* tighter planning mechanism
* prompt tweaks
* fix merge
* fix lint issues
* add error handling for tasks
* add graphic for plans
* remove base_path from file actions
* rename subtask to task
* better planning
* prompt updates for verification
* remove verify field
* ruff
* mypy
* fix actions
2024-03-29 11:47:29 -04:00