Engel Nyst
65a39ba716
fix obs
2024-07-08 05:27:43 +02:00
Engel Nyst
493fafa86d
fix delegates summary
2024-07-08 05:27:09 +02:00
Engel Nyst
82b35ef425
quick fix history and state
2024-07-08 05:25:05 +02:00
Engel Nyst
0c9e7d5737
WIP restore
2024-07-08 05:20:12 +02:00
OpenDevin
3c0d206bef
Refactor Agent class to manage its own history and update CodeActAgent to use self.history
2024-07-08 05:17:38 +02:00
Engel Nyst
5707ad3cd7
WIP
2024-07-08 05:16:23 +02:00
Engel Nyst
642764ef2c
[WIP] try to use condenser internally in history
2024-07-08 05:12:14 +02:00
Engel Nyst
1df3c9ca02
add delegate summaries
2024-07-08 04:59:02 +02:00
மனோஜ்குமார் பழனிச்சாமி
c6aa50779d
Update regenerate.sh ( #2832 )
2024-07-07 23:52:03 +02:00
Ralf D. Müller
ba0f57c279
added netcat to the requirements ( #2822 )
2024-07-07 21:32:56 +00:00
Engel Nyst
d37b2973b2
Refactoring: event stream based agent history ( #2709 )
...
* add to event stream sync
* remove async from tests
* small logging spam fix
* remove swe agent
* arch refactoring: use history from the event stream
* refactor agents
* monologue agent
* ruff
* planner agent
* micro-agents
* refactor history in evaluations
* evals history refactoring
* adapt evals and tests
* unit testing stuck
* testing micro agents, event stream
* fix planner agent
* fix tests
* fix stuck after rename
* fix test
* small clean up
* fix merge
* fix merge issue
* fix integration tests
* Update agenthub/dummy_agent/agent.py
* fix tests
* rename more clearly; add todo; clean up
2024-07-07 21:04:23 +00:00
மனோஜ்குமார் பழனிச்சாமி
9dc2d2c80f
Refactor: Remove extra log ( #2687 )
2024-07-08 05:37:13 +09:00
Shimada666
e35c1ff74a
Display real-time build logs for the agnostic image ( #2830 )
...
* Display real-time build logs for the agnostic image and improve wget's output.
* remove unused code
2024-07-08 04:35:16 +08:00
மனோஜ்குமார் பழனிச்சாமி
34c765688b
Streamline Logging Events ( #2532 )
...
* Skip duplicate log
* log user actions
* fix tests
* log all action _step
* refactor log
* revert test
* refactor log
* visual diff
* disable overriding event source
* Revert "disable overriding event source"
This reverts commit b0047cc0cdead215044ad8ce73f5bad47df99e08.
* Refactor logic
* refactored runtime on_event
* fix merge conflict
in Web UI, it shows as red color (seems deletion but added)
* linted
---------
Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>
2024-07-08 05:34:47 +09:00
மனோஜ்குமார் பழனிச்சாமி
adf1a0d556
Bugfix: add missing f-string for logging debug message in task creation ( #2836 )
2024-07-07 17:36:19 +02:00
மனோஜ்குமார் பழனிச்சாமி
85a817304e
Check exit code ( #2834 )
2024-07-07 17:35:22 +02:00
Graham Neubig
d0384cafdd
Two fixes to swe bench eval ( #2831 )
...
* Two fixes to swe bench eval
* Add error message
* Change dumping of metadata
2024-07-07 07:21:50 +00:00
மனோஜ்குமார் பழனிச்சாமி
3a3694ca17
doc: Mention negative feedback feature in bug report. ( #2827 )
...
* doc: Mention feedback feature in bug report.
* Update .github/ISSUE_TEMPLATE/bug_template.yml
Co-authored-by: Graham Neubig <neubig@gmail.com>
---------
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-07-07 06:56:12 +00:00
Bin Lei
c8e5848add
fix git diff TIMEOUT problem in swe_bench evaluation ( #2828 )
...
* fix git diff TIMEOUT problem in swe_bench evaluation
* fix git diff TIMEOUT problem in swe_bench evaluation
* Update evaluation/swe_bench/swe_env_box.py
Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com>
---------
Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com>
2024-07-07 06:30:59 +00:00
Shimada666
0973e31f00
Update custom sandbox usage guide ( #2829 )
2024-07-07 05:33:35 +02:00
Shimada666
82f256be96
trim the sandbox image and install plugin dependencies in agnostic image ( #2792 )
...
* trim the sandbox image
* remove wrong code
* readd python
* readd python
* fix script
* readd nano
2024-07-06 17:38:37 +02:00
மனோஜ்குமார் பழனிச்சாமி
d6570bd572
Fix gemini-1.5-flash crash due to missing 'vertexai' module ( #2826 )
...
* Fix gemini-1.5-flash crash due to missing 'vertexai' module
* Update poetry.lock
2024-07-06 16:27:59 +02:00
Shimada666
d22ff73905
Make the sandbox Python runtime completely transparent ( #2796 )
...
* Make the sandbox Python runtime completely independent
* fix source bashrc
* add pip install instruction for ipython to fix intergration tests for codeact swe
* update integration tests
* change flake8 command to (maybe) fix sandbox tests?
* make lint support both unittest & sandbox
* fix agnostic image build error
* refactor build script
---------
Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>
Co-authored-by: tobitege <tobitege@gmx.de>
2024-07-06 15:22:44 +02:00
Leo
9b0ff117ab
CI: Support uploading frontend unit test coverage. ( #2772 )
...
* CI: Support uploading frontend unit test coverage.
* Add make-i18n before test.
* Update vitest configuration to include only .ts and .tsx files in coverage.
* remove .only in test and fix the failed tests.
* Add text summary.
* Move vite-tsconfig-paths to dev dep. Adjust UTs.
---------
Signed-off-by: ifuryst <ifuryst@gmail.com>
Co-authored-by: sp.wack <83104063+amanape@users.noreply.github.com>
2024-07-06 12:16:30 +08:00
Xingyao Wang
f6dc89b41a
[Evaluation] Simplify eval & and multi-processing related fixes ( #2810 )
...
* initialize agent inside process_instance_fn;
* remove dependency on `config.max_iterations`
* switch back to only include llm config to metadata
2024-07-06 07:18:46 +08:00
Xingyao Wang
a47713ecb0
[Arch] Remove supports for Background Commands ( #2803 )
...
* depracting docker exec box
* remove doc exec from workflow and docs
* remove background commands
* Update tests/unit/test_sandbox.py
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>
* replace for-loop with assignment
* fix integration tests
* fix integration tests for shell script
* fix integration tests
* increase max iter to fix some monologue agent issue
* fix integration test again
* fix integration tests (seems related to run_user issue)
---------
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>
2024-07-06 03:38:05 +08:00
mamoodi
99284da476
Use README as the only place for getting started instructions ( #2815 )
...
Co-authored-by: Mahmoud Work <mahmoudwork@mahmouds-mini.home>
2024-07-05 19:44:29 +02:00
mamoodi
9ccc64fa7e
Update README ( #2814 )
...
Co-authored-by: Mahmoud Work <mahmoudwork@mahmouds-mini.home>
2024-07-05 12:54:34 -04:00
dependabot[bot]
eda582335a
chore(deps): bump tenacity from 8.4.2 to 8.5.0 ( #2813 )
...
Bumps [tenacity](https://github.com/jd/tenacity ) from 8.4.2 to 8.5.0.
- [Release notes](https://github.com/jd/tenacity/releases )
- [Commits](https://github.com/jd/tenacity/compare/8.4.2...8.5.0 )
---
updated-dependencies:
- dependency-name: tenacity
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: yufansong <yufan@risingwave-labs.com>
2024-07-05 16:26:08 +00:00
மனோஜ்குமார் பழனிச்சாமி
ed45a9e7b1
delete colima profile ( #2807 )
2024-07-05 08:55:26 -07:00
dependabot[bot]
1bdfbedccc
chore(deps-dev): bump ruff from 0.5.0 to 0.5.1 ( #2811 )
...
Bumps [ruff](https://github.com/astral-sh/ruff ) from 0.5.0 to 0.5.1.
- [Release notes](https://github.com/astral-sh/ruff/releases )
- [Changelog](https://github.com/astral-sh/ruff/blob/main/CHANGELOG.md )
- [Commits](https://github.com/astral-sh/ruff/compare/0.5.0...0.5.1 )
---
updated-dependencies:
- dependency-name: ruff
dependency-type: direct:development
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-07-05 08:46:42 -07:00
dependabot[bot]
274464101f
chore(deps): bump litellm from 1.41.6 to 1.41.7 ( #2812 )
...
Bumps [litellm](https://github.com/BerriAI/litellm ) from 1.41.6 to 1.41.7.
- [Release notes](https://github.com/BerriAI/litellm/releases )
- [Commits](https://github.com/BerriAI/litellm/compare/v1.41.6...v1.41.7 )
---
updated-dependencies:
- dependency-name: litellm
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-07-05 08:46:34 -07:00
Graham Neubig
a081935fd8
Simplify eval code ( #2775 )
...
* Start simplifying eval code
* Update
* Add EDA
* Updated GAIA
* Update gpqa
* Add humanevalfix
* Fix logic_reasoning
* Add miniwob
* Add mint and ml_bench
* toolqa
* Added swe-bench
* Fixed webarena
* Refactor parameters
0.7.1
2024-07-05 19:33:08 +09:00
r.e.e.c.h.e.e
038e8f8caa
docs: update docker run command to pull default 'latest' tag ( #2804 )
...
- Ensure users get the most recent stable release version when pulling default image.
- Explains the main tag for those who want the most recent updates.
2024-07-05 04:13:35 +00:00
மனோஜ்குமார் பழனிச்சாமி
143f38d25a
Refactored sandbox config and added fast boot ( #2455 )
...
* Refactored sandbox config and added fastboot
* added tests
* fixed tests
* fixed tests
* intimate user about breaking change
* remove default config from eval
* check for lowercase env
* add test
* Revert Migration
* migrate old sandbox configs
* resolve merge conflict
* revert migration 2
* Revert "remove default config from eval"
This reverts commit de57c588dbf29a3327798ce68976e2d2277b8bb1.
* change type to box_type
* fix var name
* linted
* lint
* lint comments
* fix tests
* fix tests
* fix typo
* fix box_type, remove fast_boot
* add tests for sandbox config
* fix test
* update eval docs
* small removal comments
* adapt toml template
* old fields shouldn't be in the app dataclass
* fix old keys in app config
* clean up exec box
---------
Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>
2024-07-05 03:30:21 +00:00
Xingyao Wang
82f4860470
retry for internal server error ( #2806 )
2024-07-05 01:47:55 +00:00
Xingyao Wang
298956c78a
[Eval] initialize llm inside process_instance to circumvent "AttributeError:… ( #2805 )
...
* initialize llm inside process_instance to circumvent "AttributeError: Can't pickle local object"
* update kwargs
2024-07-05 01:26:03 +00:00
Xingyao Wang
0d3b3ffbf8
[Arch] Removing docker exec box ( #2802 )
...
* depracting docker exec box
* remove doc exec from workflow and docs
2024-07-04 23:15:25 +00:00
Xingyao Wang
e6cdf18d3b
[Evaluation] Log empty patch stats for SWE-Bench ( #2776 )
...
* bump swebench version since the fix PR is merged
* add empy generation stats from latest pr
* delete eval_outputs if it already exists
* handle non string patch
2024-07-05 07:03:27 +08:00
Engel Nyst
0b8d357bef
Add event synchronously ( #2700 )
...
* add to event stream sync
* remove async from tests
2024-07-05 00:15:51 +02:00
sven
1b10e2b9d5
Make CodeAct finish task ( #2673 )
...
* Added feature to CodeAct agent to finish action instead of waiting for user input.
* Minor change
* Update agenthub/codeact_agent/codeact_agent.py
Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com>
* updated integration tests with claude-sonnet-3.5
* Update agenthub/codeact_agent/prompt.py
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>
* updated tests to remove typo in prompt
* resolve merge conflicts II
* revert unintended change of regenerate script
* re-regenerating prompts to resolve merge conflicts
---------
Co-authored-by: மனோஜ்குமார் பழனிச்சாமி <smartmanoj42857@gmail.com>
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>
2024-07-04 11:21:46 -07:00
மனோஜ்குமார் பழனிச்சாமி
688bd2a8fc
Added local ollama models ( #2433 )
...
* added local ollama models
* add ollama_base_url config
* Update listen.py
* add docs
* Update opendevin/server/listen.py
Co-authored-by: Graham Neubig <neubig@gmail.com>
* lint
---------
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-07-04 15:56:26 +00:00
dependabot[bot]
6853cbb4f6
chore(deps): bump framer-motion from 11.2.12 to 11.2.13 in /frontend ( #2793 )
...
Bumps [framer-motion](https://github.com/framer/motion ) from 11.2.12 to 11.2.13.
- [Changelog](https://github.com/framer/motion/blob/main/CHANGELOG.md )
- [Commits](https://github.com/framer/motion/compare/v11.2.12...v11.2.13 )
---
updated-dependencies:
- dependency-name: framer-motion
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-07-04 23:41:47 +08:00
dependabot[bot]
14b4213acf
chore(deps-dev): bump openai from 1.35.9 to 1.35.10 ( #2789 )
2024-07-04 12:37:08 +00:00
dependabot[bot]
d145dd78a3
chore(deps-dev): bump typescript from 5.2.2 to 5.5.3 in /docs ( #2785 )
2024-07-04 20:02:11 +08:00
dependabot[bot]
c8270013ad
chore(deps): bump react-dom from 18.2.0 to 18.3.1 in /docs ( #2787 )
2024-07-04 11:36:57 +00:00
dependabot[bot]
dd1bd9caf3
chore(deps): bump react from 18.2.0 to 18.3.1 in /docs ( #2786 )
2024-07-04 19:03:46 +08:00
dependabot[bot]
c77480bb55
chore(deps): bump litellm from 1.41.3 to 1.41.6 ( #2790 )
...
Bumps [litellm](https://github.com/BerriAI/litellm ) from 1.41.3 to 1.41.6.
- [Release notes](https://github.com/BerriAI/litellm/releases )
- [Commits](https://github.com/BerriAI/litellm/compare/v1.41.3...v1.41.6 )
---
updated-dependencies:
- dependency-name: litellm
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-07-04 11:01:48 +00:00
dependabot[bot]
7291c320fc
chore(deps): bump boto3 from 1.34.138 to 1.34.139 ( #2788 )
...
Bumps [boto3](https://github.com/boto/boto3 ) from 1.34.138 to 1.34.139.
- [Release notes](https://github.com/boto/boto3/releases )
- [Commits](https://github.com/boto/boto3/compare/1.34.138...1.34.139 )
---
updated-dependencies:
- dependency-name: boto3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-07-04 10:58:40 +00:00
Leo
c2f557edde
refactor: multiple code improvements ( #2771 )
2024-07-04 18:51:22 +08:00