LogoAIAny
Icon for item

MLC-LLM

Universal LLM deployment engine that compiles models with TVM Unity for native execution across GPUs, CPUs, mobile and WebGPU.

Introduction

Overview

MLC-LLM turns any HuggingFace checkpoint into a highly-optimized library with int4/int8 kernels, delivering OpenAI-style APIs on desktop, mobile and browser.

Key Capabilities
  • Ahead-of-time compilation via TVM Unity
  • Unified engine with Metal, Vulkan, CUDA back-ends
  • REST, Python, JS, iOS & Android SDKs
  • WebLLM for client-side web inference

Information

  • Websitellm.mlc.ai
  • AuthorsMLC AI Lab
  • Published date2023/04/29

Categories

More Items