Alex Bäuerle
cd58194d2a
docs(docs): start implementing docs website ( #1372 )
...
* docs(docs): start implementing docs website
* update video url
* add autogenerated codebase docs for backend
* precommit
* update links
* fix config and video
* gh actions
* rename
* workdirs
* path
* path
* fix doc1
* redo markdown
* docs
* change main folder name
* simplify readme
* add back architecture
* Fix lint errors
* lint
* update poetry lock
---------
Co-authored-by: Jim Su <jimsu@protonmail.com>
2024-04-29 10:00:51 -07:00
Jirka Borovec
e32d95cb1a
lint: simplify hooks already covered by Ruff ( #1204 )
...
* lint: simplify hooks already covered by Ruff
* prune dev dependency
* setting E, W, F
* poetry?
* autopep8
* quote-style = "single"
* double-quote-string-fixer
* --all-files
* apply
* Q
* drop double-quote-string-fixer
* --all-files
* apply pre-commit
* python3.11 -m poetry lock --no-update
---------
Co-authored-by: Robert Brennan <accounts@rbren.io>
2024-04-27 11:32:14 +00:00
Xia Zhenhua
747ac23cd0
fix: conftest.py comment bug. ( #1303 )
...
Co-authored-by: aaren.xzh <aaren.xzh@antfin.com>
2024-04-23 07:51:33 -04:00
Robert Brennan
516c9bf1e0
Revamp docker build process ( #1121 )
...
* refactor docker building
* change to buildx
* disable branch filter
* disable tags
* matrix for building
* fix branch filter
* rename workflow
* sanitize ref name
* fix sanitization
* fix source command
* fix source command
* add push arg
* enable for all branches
* logs
* empty commit
* try freeing disk space
* try disk clean again
* try alpine
* Update ghcr.yml
* Update ghcr.yml
* move checkout
* ignore .git
* add disk space debug
* add df h to build script
* remove pull
* try another failure bypass
* remove maximize build space step
* remove df -h debug
* add no-root
* multi-stage python build
* add ssh
* update readme
* remove references to config.toml
2024-04-15 19:10:38 -04:00
hugehope
9cd4ad3298
chore: fix some typos in comments ( #1013 )
...
Signed-off-by: hugehope <cmm7@sina.cn>
2024-04-11 23:21:46 +02:00
libowen2121
e256329e5e
Update SWE-bench eval results ( #978 )
2024-04-10 21:09:49 +08:00
Engel Nyst
99a8dc4ff9
Fallback to less expensive model ( #475 )
2024-04-07 05:45:37 +02:00
Alex Bäuerle
a82e065f56
feat: add commands for swebench ( #682 )
...
* feat: add commands for swebench
* restructure
2024-04-05 12:47:32 -05:00
Yufan Song
5e87c79838
refactor ( #543 )
2024-04-02 08:13:38 -04:00
Robert Brennan
511afa12fe
fix old references to langchains ( #513 )
2024-04-01 13:33:20 -04:00
Aravind Somaraj
26c9ce132b
style: Moved argument parsing statements into a separate function ( #503 )
...
* style: moved argument parsing into a separate function
* commito
* Update evaluation/regression/conftest.py
---------
Co-authored-by: Robert Brennan <accounts@rbren.io>
2024-04-01 10:47:58 -04:00
Tess
8796a690d5
doc - Added code documentation for clarity ( #434 )
...
* doc - Added code documentaion to 'plan.py'
* doc - Added code documentation to 'session.py'
* doc - added code documentation for clarity
* doc - added documentation to 'conftest.py'
* doc - added code documentation to 'run_tests.pt'
* Update evaluation/regression/conftest.py
---------
Co-authored-by: Robert Brennan <accounts@rbren.io>
2024-04-01 10:22:09 -04:00
iFurySt
a08c82d35e
ci: check if the image exists in ghcr.io to avoid repeat building and pushing ( #283 )
...
* ci: check if the image exists in ghcr.io to avoid repeat building and pushing.
* feat: add push MAJOR and MINOR version to ghcr.io
2024-03-31 11:30:13 -04:00
iFurySt
2286e73912
fix: change to use the latest docker image. ( #290 )
...
Co-authored-by: Robert Brennan <accounts@rbren.io>
2024-03-29 15:59:52 -04:00
Jim Su
b1b96df8a8
Replace environment variables with configuration file ( #339 )
...
* Replace environment variables with configuration file
* Add config.toml to .gitignore
* Remove unused os imports
* Update README.md
* Update README.md
* Update README.md
* Fix merge conflict
* Fallback to environment variables
* Use template file for config.toml
* Update config.toml.template
* Update config.toml.template
---------
Co-authored-by: Robert Brennan <accounts@rbren.io>
2024-03-29 15:26:20 -04:00
Anas DORBANI
7c27e59918
feat: Ad/regression tests using pytest ( #329 )
...
* Remove all the unnecessary files
* Create finalize the regression testing framework and add hello world test case
* Update requirements.txt
* Update the test function to execute the generate script
2024-03-28 23:40:30 -04:00
iFurySt
89abc5e253
fix: move the makefile to correct path. ( #252 )
2024-03-27 23:53:40 +08:00
iFurySt
8b9fc3df28
feat: add workflow to ghcr ( #237 )
...
Co-authored-by: Xingyao Wang <xingyao6@illinois.edu>
2024-03-27 23:10:34 +08:00
zch-cc
e5a28cba2f
Evaluation: Fix bug on python path on run.sh ( #98 )
...
* Move regression tests to evaluation/
* use pythnon instead of docker in the script
* add model para
* change python to python3
* bug fix
* add python path
* add readme
2024-03-23 00:01:48 +08:00
zch-cc
cfefc47439
Move regression tests to evaluation/ ( #86 )
...
* Move regression tests to evaluation/
* use pythnon instead of docker in the script
* add model para
* change python to python3
* bug fix
2024-03-22 23:26:37 +08:00
libowen2121
40a3614e80
Add a roadmap for eval ( #92 )
2024-03-22 20:27:30 +08:00
Xingyao Wang
2d5c8f1060
change to OpenDevin fork ( #89 )
2024-03-22 18:30:12 +08:00
Xingyao Wang
5ff96111f0
A starting point for SWE-Bench Evaluation with docker ( #60 )
...
* a starting point for SWE-Bench evaluation with docker
* fix the swe-bench uid issue
* typo fixed
* fix conda missing issue
* move files based on new PR
* Update doc and gitignore using devin prediction file from #81
* fix typo
* add a sentence
* fix typo in path
* fix path
---------
Co-authored-by: Binyuan Hui <binyuan.hby@alibaba-inc.com>
2024-03-22 12:43:49 +08:00
Jiaxin Pei
dc88dac296
adding a script to fetch and convert devin's output for evaluation ( #81 )
...
* adding code to fetch and convert devin's output for evaluation
* update README.md
* update code for fetching and processing devin's outputs
* update code for fetching and processing devin's outputs
2024-03-22 01:33:01 +08:00
Binyuan Hui
f99f4ebdaa
fix: typo in the evaluation folder name. ( #66 )
2024-03-20 23:00:09 +08:00