-
Notifications
You must be signed in to change notification settings - Fork 55
Pull requests: OpenHands/benchmarks
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Strengthen restriction against accessing installed package versions
#691
opened Apr 23, 2026 by
juanmichelini
Collaborator
Loading…
fix(swebench): release disk during full image assembly
#690
opened Apr 23, 2026 by
simonrosenberg
Collaborator
Loading…
DO_NOT_MERGE_FOR_TESTING_ONLY - Simulate eval_infer error
#680
opened Apr 20, 2026 by
juanmichelini
Collaborator
•
Draft
DO_NOT_MERGE_FOR_TESTING_ONLY - Simulate run_infer error
#679
opened Apr 20, 2026 by
juanmichelini
Collaborator
•
Draft
DO_NOT_MERGE_FOR_TESTING_ONLY - Simulate build images error
#678
opened Apr 20, 2026 by
juanmichelini
Collaborator
•
Draft
[DO NOT MERGE] ci(swtbench): revert to Blacksmith runner (fallback for #495)
#673
opened Apr 20, 2026 by
simonrosenberg
Collaborator
•
Draft
fix(swtbench): rmi after push + free runner disk (fixes evaluation#495)
#672
opened Apr 19, 2026 by
simonrosenberg
Collaborator
Loading…
fix(llm_config): disable reasoning_effort for Opus 4.7
#670
opened Apr 17, 2026 by
juanmichelini
Collaborator
Loading…
Fix GAIA build target docs: binary-minimal -> binary
#667
opened Apr 15, 2026 by
simonrosenberg
Collaborator
•
Draft
Remove redundant 'Apply workflow_dispatch overrides' step from build workflows
#660
opened Apr 13, 2026 by
simonrosenberg
Collaborator
•
Draft
Remove binary diffs from agent patches in SWE-bench evaluation
#656
opened Apr 9, 2026 by
juanmichelini
Collaborator
•
Draft
Filter SWE-Bench Multimodal image builds to curated subset
#644
opened Apr 6, 2026 by
juanmichelini
Collaborator
Loading…
fix: reset BuildKit cache between retries for base/assembly builds
#631
opened Apr 4, 2026 by
simonrosenberg
Collaborator
Loading…
3 tasks
Update Claude ACP package references
#629
opened Apr 3, 2026 by
simonrosenberg
Collaborator
Loading…
build(deps): bump the version-all group across 1 directory with 21 updates
dependencies
Pull requests that update a dependency file
python:uv
Pull requests that update python:uv code
#596
opened Mar 31, 2026 by
dependabot
Bot
Loading…
build(deps): bump the version-all group across 1 directory with 5 updates
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#492
opened Mar 9, 2026 by
dependabot
Bot
Loading…
ProTip!
Exclude everything labeled
bug with -label:bug.