27 Commits

Author SHA1 Message Date
Han Xiao
aac0db67e4 feat: add hostnames bw filter 2025-03-18 11:24:53 +08:00
Han Xiao
4ca7804e58 feat: add hostnames bw filter 2025-03-18 10:43:46 +08:00
Han Xiao
1ac80e4d20 fix: url sanitization 2025-03-17 18:09:01 +08:00
Han Xiao
3930f8b863 fix: url sanitization 2025-03-17 15:41:54 +08:00
Han Xiao
01705291c4 fix: normalize url 2025-03-17 14:44:28 +08:00
Han Xiao
5c36410b54 fix: normalize url 2025-03-17 14:23:02 +08:00
Han Xiao
90b1d39cc6 fix: normalize url 2025-03-17 12:07:19 +08:00
Han Xiao
f9cb542dd0 fix: normalize url 2025-03-15 13:55:59 +08:00
Han Xiao
f5d6bf75f5 feat: add num urls 2025-03-14 15:18:50 +08:00
Han Xiao
c9a51bb403 fix: fallback genobj 2025-03-14 13:22:17 +08:00
Han Xiao
59b2daf66b fix: updated time 2025-03-13 10:30:35 +08:00
Han Xiao
4d76f146d0 feat: late chunking 2025-03-12 15:13:32 +08:00
Han Xiao
013056f218 feat: late chunking 2025-03-12 14:07:11 +08:00
Han Xiao
c8fc259dff refactor: pull url out 2025-03-11 21:30:59 +08:00
Han Xiao
c30043e119 fix: eval 2025-03-11 17:56:39 +08:00
Han Xiao
ea42af3101 fix: eval 2025-03-11 17:09:45 +08:00
Han Xiao
05ddb30d80 refactor: query rewriter 2025-03-11 15:34:00 +08:00
Han Xiao
d947973a68 refactor: query rewriter 2025-03-11 15:10:08 +08:00
Han Xiao
1e097a9ecc fix: url datetime guessing 2025-03-07 14:32:47 +08:00
Han Xiao
8b836431af fix: url datetime guessing 2025-03-07 13:43:14 +08:00
Han Xiao
1604013788 fix: url datetime guessing 2025-03-06 17:24:25 +08:00
Han Xiao
dbeee0c8f5 fix: url datetime guessing 2025-03-06 17:15:46 +08:00
Han Xiao
d9bfc2fd1f feat: improve url ranking, fix eval bugs 2025-03-06 14:17:56 +08:00
Han Xiao
5df8d8a9c6 fix: weighted urls and hostnames 2025-03-05 10:58:52 +08:00
Han Xiao
51ad77d302 feat: add url ranking 2025-03-04 16:29:22 +08:00
Han Xiao
ad7e524554 fix: multi-aspect 2025-02-25 11:12:33 +08:00
Han Xiao
c02588a92c feat: optimize prompt, coding, reflect 2025-02-24 13:16:18 +08:00