Provides a GGUF-quantized build of NVIDIA's Nemotron 3 Nano Omni 30B (Reasoning) for local inference — enables multimodal (video/audio/image/text) reasoning, transcription, and document understanding on compatible runtimes such as llama.cpp, Ollama, vLLM, and TensorRT-LLM.