ONNX Runtime

Microsoft’s high-performance, cross-platform inference engine for ONNX and GenAI models.

Visit Website

Introduction

Overview

ONNX Runtime accelerates >20 frameworks via graph optimizers and hardware EPs (CUDA, DNNL, DirectML, ROCm, CoreML).

Key Capabilities

Training & inference APIs (Python/C#/C++)
ORT-GenAI with flash-attn kernels
Mobile AOT & WebAssembly targets

Back

Information

Websiteonnxruntime.ai
AuthorsMicrosoft
Published date2018/12/04

More Items

ms-swift (SWIFT: Scalable lightWeight Infrastructure for Fine-Tuning)

2023

ModelScope community, Yuze Zhao +11

ms-swift (SWIFT) is an extensible, lightweight infrastructure from the ModelScope community for fine-tuning, evaluating, quantizing and deploying large language models (LLMs) and multimodal LLMs. It supports hundreds of text and multimodal models, many low-cost fine-tuning and quantized training techniques, Megatron-style model parallelism, RL/GRPO family algorithms for alignment, and multiple inference/deployment backends such as vLLM and LMDeploy. ms-swift provides CLI, Python APIs and a Web UI for end-to-end model workflows.

llm ai-train ai-inference ai-serving github+3

MiroThinker

2025

MiroMindAI, MiroMind Team

MiroThinker is an open-source search agent and research-agent project from MiroMind that advances tool-augmented reasoning and information-seeking. It includes agent models, an agent framework (MiroFlow), a training dataset (MiroVerse), and training/serving tools designed for long-context, multi-step tool usage and benchmark-driven research.

ai-agent ai-framework mcp-server mcp-client github+3

OM1 (OpenMind)

2025

OpenMind

OM1 is a modular AI runtime by OpenMind for building and deploying multimodal AI agents across digital environments and physical robots. Written in Python, OM1 ingests diverse sensor and web inputs, integrates multiple LLMs and VLMs, provides TTS/ASR endpoints, and connects to robot hardware via plugins (ROS2, Zenoh, CycloneDDS). It includes a web-based visual debugger (WebSim), example agents, documentation, and a technical paper, enabling developers to create configurable, upgradeable robot agents for humanoids, quadrupeds, mobile apps, and educational robots.

ai-agent ai-development mlops ai-framework ai-inference+2

ONNX Runtime

Introduction

Overview

Key Capabilities

Information

Categories

Tags

More Items

ms-swift (SWIFT: Scalable lightWeight Infrastructure for Fine-Tuning)

MiroThinker

OM1 (OpenMind)