Commit 35771a4
committed
chore(release): 2.3.0 [skip ci]
# [2.3.0](v2.2.0...v2.3.0) (2026-02-28)
### Bug Fixes
* auto-install deps, python3 symlink, detect full commands in fail_to_pass, language-aware test scripts ([a38497f](a38497f))
* config test race condition with env var mutex ([2963325](2963325))
* expose agent_output and agent_patch in TaskResult and API responses ([348c251](348c251))
* extract_agent_only for /evaluate - no tasks/ dir required ([2b90ee1](2b90ee1))
* filter out apt-get/system commands from install (Basilica blocks syscalls), keep project-level installs ([e5365da](e5365da))
* handle null test_patch from HuggingFace API (deserialize null as empty string) ([492d068](492d068))
* install corepack/yarn/pnpm globally via npm in Dockerfile ([b7183e8](b7183e8))
* normalize repo URL in parse_task (add github.com prefix) ([398a6fd](398a6fd))
* run as root (Basilica blocks sudo), remove sudo prefix logic ([477a433](477a433))
* sudo for apt-get in install commands, add golang/corepack/sudo to Dockerfile ([1aceb88](1aceb88))
* use :id path params for Axum 0.7 (not {id} which is 0.8) ([5dfa0c1](5dfa0c1))
### Features
* /evaluate endpoint using stored agent + TRUSTED_VALIDATORS whitelist ([b6aee7a](b6aee7a))
* add POST /submit_tasks endpoint + fix HuggingFace dataset compat ([d92444c](d92444c))
* agent user with sudo for apt-install, run all commands as non-root agent ([1f7ce52](1f7ce52))
* agent ZIP upload frontend with env vars + SUDO_PASSWORD auth ([3aa5184](3aa5184))
* fat Docker image with all language runtimes (java, rust, pnpm, unzip, etc.) ([3855f2d](3855f2d))
* fetch task definitions from HF repo (workspace.yaml + tests/), remove auto_install hack ([7162a39](7162a39))
* swe-bench/swe-forge integration - extend WorkspaceConfig with fail_to_pass/pass_to_pass/install_config/difficulty fields - parse swe-forge workspace.yaml native fields as test script fallback - capture git diff (agent patch) after agent execution - add /dataset endpoint to fetch from HuggingFace CortexLM/swe-forge - wire fail_to_pass/pass_to_pass in dataset entry conversion ([814259e](814259e))1 parent 1f7ce52 commit 35771a4
3 files changed
+30
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
1 | 29 | | |
2 | 30 | | |
3 | 31 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | | - | |
| 3 | + | |
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
0 commit comments