Unifies video, audio, image and text understanding for enterprise Q&A, summarization, transcription and document intelligence. The NVFP4 quantized variant reduces footprint to ~20.9GB for more efficient single‑GPU deployment and is tuned for NVIDIA runtimes (vLLM, TensorRT).