Commit Graph

2081 Commits

Author SHA1 Message Date
Engel Nyst
7da6e06da6 Retry on litellm's APIError, which includes 502 (#4167) 2024-10-03 03:00:58 +00:00
Xingyao Wang
c2223a0fe4 upgrade litellm 2024-10-02 19:49:22 +00:00
Xingyao Wang
f2a48a870c fix wrong import 2024-10-02 19:49:12 +00:00
Xingyao Wang
61d99e9e37 add few seconds to properly receive timeout error from client 2024-10-02 04:08:02 +00:00
Xingyao Wang
9af6399a90 make target_image_tag optional 2024-10-02 01:01:16 +00:00
Xingyao Wang
ac1459b0c9 Update instruction for new version of eval runtime-api (#4128) 2024-10-01 19:15:37 +00:00
Xingyao Wang
e5c5e1c4e5 bump to new runtime w/o parallel 2024-10-01 17:03:57 +00:00
Xingyao Wang
cc03b59238 fix eval_infer.sh 2024-09-29 21:02:49 +00:00
Xingyao Wang
6999d969bb [eval] log evaluating warnings directly to console (#4026) 2024-09-28 05:35:57 +00:00
tobitege
f446237081 revert #3871 dockerfile template: don't write to .bashrc file (#4095) 2024-09-28 05:22:36 +00:00
Xingyao Wang
891b02d1ce [runtime] do not keep rebuilding from generic image (#4072) 2024-09-27 21:15:57 +00:00
Xingyao Wang
78cbd90df0 parser fix for deepseek 2024-09-27 21:10:14 +00:00
Xingyao Wang
4ae0a3c887 change to imap_unordered 2024-09-24 20:32:33 +00:00
Xingyao Wang
6d9385baa2 try fix mp again 2024-09-24 20:32:30 +00:00
Xingyao Wang
7eb44cdeff use mp Pool instead ProcessPoolExecutor 2024-09-24 14:03:24 +00:00
Xingyao Wang
5a64cf2bca fix log copy failure 2024-09-20 19:33:29 +00:00
Xingyao Wang
b24a7821ec [eval] fix evaluation git patch post-processing (#3979) 2024-09-20 22:55:43 +08:00
Xingyao Wang
caa0f03c7b Merge commit 'e0f91f2aef053e8ae5c8f78539f086a01346c10e' into eval/24-sep 2024-09-18 16:01:49 +00:00
Xingyao Wang
e0f91f2aef Update evaluation/swe_bench/eval_infer.py
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-09-18 22:36:57 +08:00
Xingyao Wang
5d1355ffa0 Update evaluation/swe_bench/README.md
Co-authored-by: Graham Neubig <neubig@gmail.com>
2024-09-18 22:36:50 +08:00
Xingyao Wang
4c3068c711 Merge branch 'main' into xw/eval-swebench 2024-09-18 08:40:07 -05:00
Xingyao Wang
68b2152942 update output 2024-09-18 13:34:51 +00:00
Xingyao Wang
b7416a4723 print retry time as well 2024-09-18 01:46:43 +00:00
Xingyao Wang
770af8d74b Revert "bump timeout"
This reverts commit c92cbbb201.
2024-09-17 22:29:15 +00:00
Xingyao Wang
090f0df452 only increase timeout for /alive 2024-09-17 22:29:01 +00:00
Xingyao Wang
c92cbbb201 bump timeout 2024-09-17 22:25:51 +00:00
Qiang Li
f7ebc1cf1f chore: Add docker files for developing inside container. (#3911) 2024-09-17 23:19:01 +02:00
Xingyao Wang
ee37af93a1 sleep longer for eval retry 2024-09-17 20:42:11 +00:00
Xingyao Wang
e09e8b4ebf improve runtime cleanup script 2024-09-17 19:26:41 +00:00
Xingyao Wang
b96d798efa fix reset logger for n-p=1 2024-09-17 19:18:58 +00:00
mamoodi
8a419b5c45 Reorder sidebar doc by bringing LLMs higher up (#3922) 2024-09-17 14:26:13 -04:00
dependabot[bot]
9787a31ba1 chore(deps-dev): bump @types/react from 18.3.6 to 18.3.7 in /frontend (#3920)
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-09-17 11:57:33 -04:00
dependabot[bot]
31296624d1 chore(deps): bump vite from 5.4.5 to 5.4.6 in /frontend (#3919)
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-09-17 11:57:23 -04:00
Xingyao Wang
9a9d376772 save infer logs as well 2024-09-17 15:46:50 +00:00
Xingyao Wang
9e2a693ed4 save relavant info; remove extra logging 2024-09-17 15:43:30 +00:00
Xingyao Wang
cc3c34c90a fix eval 2024-09-17 15:40:07 +00:00
dependabot[bot]
656222f416 chore(deps): bump boto3 from 1.35.19 to 1.35.20 (#3915)
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-09-17 11:29:15 -04:00
dependabot[bot]
a4e61faf56 chore(deps-dev): bump llama-index from 0.11.9 to 0.11.10 (#3916)
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-09-17 11:29:06 -04:00
dependabot[bot]
f4657edc48 chore(deps): bump litellm from 1.46.0 to 1.46.1 (#3917)
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-09-17 15:21:32 +00:00
Xingyao Wang
279443a563 fix missing log path 2024-09-17 15:06:31 +00:00
Xingyao Wang
8a9d9576a9 use polling to get updates to avoid timeout 2024-09-17 15:03:26 +00:00
Xingyao Wang
79867629db Merge commit '963f0db6ab7b24a2f45a2692aa948f190d49cac6' into xw/eval-swebench 2024-09-17 14:50:42 +00:00
Engel Nyst
ef09f0fe37 Small fix in readme (#3912) 2024-09-17 14:33:25 +00:00
Xingyao Wang
f996b31d64 [eval] Fix multi-processing bug (again^3) & allow set EXP_NAME for each run_infer (#3907)
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>
2024-09-17 14:07:58 +00:00
Xingyao Wang
963f0db6ab Update evaluation/utils/shared.py
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>
2024-09-17 21:42:28 +08:00
Xingyao Wang
4e93a24e44 Update evaluation/utils/shared.py
Co-authored-by: Boxuan Li <liboxuan@connect.hku.hk>
2024-09-17 21:42:20 +08:00
mamoodi
fa0d9cfa42 Add providers section in documentation and put current docs under it (#3905) 2024-09-17 08:52:13 -04:00
niliy01
07a094e701 (enh) Update Docker pull data in place (#3910)
Signed-off-by: Yi Lin <teroincn@gmail.com>
2024-09-17 10:22:07 +02:00
tobitege
52c5abccbf (enh) Dockerfile.j2: improve env vars for bash and activate in .bashrc (#3871) 2024-09-17 08:49:04 +02:00
dependabot[bot]
29b0e62cd7 chore(deps): bump react-i18next from 15.0.1 to 15.0.2 in /frontend (#3889) 2024-09-17 11:04:48 +08:00