OpenHands

community

https://github.com/All-Hands-AI/OpenHands

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

pengchao authored a paper 26 days ago

SafeGenBench: A Benchmark Framework for Security Vulnerability Detection in LLM-Generated Code

pengchao authored a paper about 1 month ago

DevBench: A Comprehensive Benchmark for Software Development

pengchao authored a paper about 1 month ago

CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering

View all activity

spaces 1

OpenHands Evaluation Benchmark

models 1

OpenHands/CodeQwen1.5-7B-OpenDevin

Text Generation • Updated May 25, 2024 • 13 • 17

datasets 7

OpenHands/eval-output-webarena

Updated Jul 20, 2024 • 7

OpenHands/eval-browsing-instructions

Viewer • Updated Jul 15, 2024 • 933 • 7

OpenHands/eval-output-miniwob

Updated Jun 10, 2024 • 6

OpenHands/SWE-bench-devin-passed

Viewer • Updated Apr 9, 2024 • 79 • 6

OpenHands/SWE-bench-devin-full-filtered

Viewer • Updated Apr 9, 2024 • 450 • 1 • 1

OpenHands/SWE-bench-devin-full

Viewer • Updated Apr 9, 2024 • 570 • 4

OpenHands/Devin-SWE-bench-output

Viewer • Updated Mar 21, 2024 • 1.14k • 13