SWE-agent is a system designed to empower language model (LM) agents to autonomously perform software engineering tasks. It features a custom agent-computer interface (ACI) that enhances the agent's ability to navigate repositories, create and edit code, and execute programs, achieving state-of-the-art results on the SWE-bench and HumanEvalFix benchmarks. [2, 5, 8]
Language model (LM) agents are increasingly used for automating complex digital tasks. [5] Just as human developers rely on Integrated Development Environments (IDEs), this paper posits that LM agents also need specialized interfaces to effectively tackle complex challenges like software engineering. [5, 8]
To address this, the paper introduces SWE-agent, a system that provides a tailored agent-computer interface (ACI). [2, 3] This interface is specifically designed to improve an agent's ability to perform key software development actions, including:
By using this specialized ACI, SWE-agent achieved state-of-the-art performance, solving 12.5% of issues on the SWE-bench benchmark and 87.7% on HumanEvalFix, significantly outperforming previous non-interactive models. [2, 5, 9]