-
Notifications
You must be signed in to change notification settings - Fork 16
Pull requests: OpenHands/benchmarks
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add detection and reporting for incomplete evaluation runs
#148
opened Dec 10, 2025 by
simonrosenberg
•
Draft
Fix Browser action deserialization by using OpenHandsModel
#136
opened Dec 6, 2025 by
simonrosenberg
Loading…
Add GAIA eval_infer for unified evaluation workflow
#125
opened Dec 2, 2025 by
simonrosenberg
Loading…
API-based Critic implementation
build-swebench-200
Build 200 SWE-Bench Verified Image based on SDK version on this PR.
#117
opened Nov 26, 2025 by
xingyaoww
Loading…
build(deps): bump the version-all group with 2 updates
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#113
opened Nov 24, 2025 by
dependabot
bot
Loading…
ProTip!
Adding no:label will show everything without a label.