LogoAIAny
Icon for item

Xinference

Xorbits’ universal inference layer (library name `xinference`) that deploys and serves LLMs and multimodal models from laptop to cluster.

Introduction

Overview

Xinference makes it one-command to host heterogeneous generative models behind REST/RPC endpoints compatible with OpenAI and LangChain.

Key Capabilities
  • One-click local or distributed launch
  • Built-in model registry with automatic download
  • Web UI & CLI with live monitoring
  • CPU/GPU adaptive scheduling and elastic scaling

Information

  • Websitegithub.com
  • AuthorsXprobe Inc.
  • Published date2023/07/11

Categories