Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
YerbaPageΒ 
posted an update Aug 1
Post
2883
Latest work on SWE-Bench πŸ›

Our two new papers from the SJTU & Huawei: Powered by DeepSeek-V3, we've achieved a new SOTA on the SWE-Bench benchmark!

We introduce two innovative approaches:
βš”οΈ SWE-Debate: AI agents compete and "debate" to generate the best code fix.
🧠 SWE-Exp: An AI agent learns from past repair "experience" to solve new issues more efficiently.

πŸ‘‡ Explore the future of software development:

SWE-Debate
πŸ“„ Paper: https://arxiv.org/abs/2507.23348
πŸ’» Code: https://github.com/YerbaPage/SWE-Debate

SWE-Exp
πŸ“„ Paper: https://arxiv.org/abs/2507.23361
πŸ’» Code: https://github.com/YerbaPage/SWE-Exp
In this post