Discover the Best AI Resources
Curated essentials, no noise — just what matters
Interleaves chain-of-thought reasoning with tool-using actions in one LLM loop: the model plans, queries a source like Wikipedia, then revises from results. Cuts hallucination versus reasoning-only prompting and beats trained agents on interactive tasks.
Treats the interface between an LM agent and a computer as a design variable. A custom agent-computer interface (ACI) with concise file-edit, repo-navigation, and test commands plus compact feedback reaches 12.5% pass@1 on SWE-bench, 87.7% on HumanEvalFix.
A graph-based RAG framework pairing a knowledge graph with vector retrieval and a dual-level (low/high) query mode. New documents merge into the graph via set operations instead of triggering a rebuild, cutting the cost of keeping the index current.