Provides 4,659 agentic single-turn SFT training pairs extracted from Claude Fable‑5, formatted as a single-column parquet for Qwen-style fine-tuning. Includes explicit chain-of-thought (<think>) blocks, XML-serialized <tool_use> calls, PII redaction, and AGPL-3.0 licensing.
Provides a lightweight repository-exploration subagent for LLM coding agents: invoked on demand to run parallel read-only READ/GLOB/GREP calls and return compact file-path plus line-range citations so the main solver gets focused evidence instead of noisy reads.
Benchmark for evaluating procedural skill evolution in LLM agents: isolates reusable skill bodies, role-specific work surfaces, and hidden oracle assets to measure whether skill refinements transfer across tasks, roles, and model backbones. Includes 382 workplace tasks, 22 skills, and a controlled evaluation protocol.
Provides GGUF-quantized weights and runtime assets for running the Qwythos-9B reasoning LLM locally via llama.cpp and compatible runtimes. Key features include 1,048,576-token YaRN long-context, native function-calling, multimodal image input (requires mmproj), and multiple quantization/MTP variants tuned for different size/quality tradeoffs.