LogoAIAny
Icon for item

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

SWE-agent is a system designed to empower language model (LM) agents to autonomously perform software engineering tasks. It features a custom agent-computer interface (ACI) that enhances the agent's ability to navigate repositories, create and edit code, and execute programs, achieving state-of-the-art results on the SWE-bench and HumanEvalFix benchmarks. [2, 5, 8]

Introduction

Language model (LM) agents are increasingly used for automating complex digital tasks. [5] Just as human developers rely on Integrated Development Environments (IDEs), this paper posits that LM agents also need specialized interfaces to effectively tackle complex challenges like software engineering. [5, 8]

To address this, the paper introduces SWE-agent, a system that provides a tailored agent-computer interface (ACI). [2, 3] This interface is specifically designed to improve an agent's ability to perform key software development actions, including:

  • Creating and editing code files [2]
  • Navigating entire code repositories [2]
  • Executing tests and other programs [2]

By using this specialized ACI, SWE-agent achieved state-of-the-art performance, solving 12.5% of issues on the SWE-bench benchmark and 87.7% on HumanEvalFix, significantly outperforming previous non-interactive models. [2, 5, 9]

Information

  • Websitearxiv.org
  • AuthorsJohn Yang, Carlos E. Jimenez, Alexander Wettig, Kilian Lieret, Shunyu Yao, Karthik Narasimhan, Ofir Press
  • Published date2024/05/06