Provides GGUF-quantized weights and runtime assets for running the Qwythos-9B reasoning LLM locally via llama.cpp and compatible runtimes. Key features include 1,048,576-token YaRN long-context, native function-calling, multimodal image input (requires mmproj), and multiple quantization/MTP variants tuned for different size/quality tradeoffs.