Provides a GGUF-quantized local build of Ornith-1.0's 9B dense model for offline inference and terminal-focused coding agents. Supports OpenAI-compatible tool-calling, a 256K context window, and runs via llama.cpp or Ollama on a single high-memory GPU.