AIAny - Agents-A1: Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

Most current scaling focuses on parameters; this project demonstrates an alternative: scale the agent "horizon" (long, structured trajectories) to achieve frontier-level agentic performance with a ~35B MoE model. Agents-A1 is trained on domain-grounded knowledge–action trajectories (average length ~45K tokens) and unified across six heterogeneous domains using a three-stage recipe (full-domain supervised fine-tuning, domain teacher models, and multi-teacher on-policy distillation). The result is a deployable agent that narrows the gap with much larger models on multi-step, tool-using, and research-oriented tasks.

Key Capabilities

Long-horizon agentic reasoning: decomposes complex goals into executable substeps and adapts strategies based on intermediate observations, enabled by training on very long agentic trajectories.
Tool and function integration: natively supports function calling and tool parsers, enabling API, search, and code interpreter interaction in-chain.
Multi-domain distillation: combines specialized domain teachers into one student model, unifying six domains (e.g., long-horizon search, engineering, scientific research, instruction following).
Reproducible evaluation: open-sourced an evaluation framework and reports SOTA/competitive numbers on many benchmarks, making comparisons and reproduction straightforward.

Who it's for and tradeoffs

Great fit if you need an open reproducible agentic model that handles very long contexts and tool-enabled workflows (research labs, agent developers, MLops teams using vLLM/SGLang). Look elsewhere if you need a lightweight on-device assistant or minimal-resource inference: Agents-A1 expects substantial memory and serving infrastructure to realize its 262K+ context and MoE runtime advantages. Also note practical dependencies—best results reuse the provided serving stacks (vLLM, SGLang) and quantized variants for constrained hardware. Operational caveats include increased complexity around tool chains (errors in external tools can cascade) and standard model risks like hallucination when verifier/tool signals are noisy.

Agents-A1: Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

Introduction

Information

Categories

Tags

More Items

AFTER

BugTraceAI-CORE-Ultra-27B-Q6

Qwopus-3.6-35B-A3B-Coder-MTP-GGUF

Key Capabilities

Who it's for and tradeoffs