Ray

Ray is an open-source distributed compute engine that lets you scale Python and AI workloads—from data processing to model training and serving—without deep distributed-systems expertise.

Visit Website

Introduction

Overview

Ray is an open-source AI compute engine that originated at UC Berkeley’s RISELab and is now developed by Anyscale. It provides a unified task- and actor-based runtime that can scale from a laptop to thousands of GPUs or heterogeneous CPU/GPU clusters. Developers can build distributed applications in pure Python while Ray handles scheduling, failure recovery and resource management under the hood.

Key Capabilities

Ray Core – task & actor primitives for parallel and distributed Python.
Ray Data – distributed data preprocessing pipeline for structured & unstructured data.
Ray Train – simple APIs to run distributed training for deep-learning frameworks.
Ray Tune – scalable hyper-parameter tuning with many built-in search algorithms.
Ray Serve – production-grade model-serving layer with autoscaling and fractional GPU sharing.
RLlib – high-performance reinforcement-learning library.

Together, these components unify data ingest, model training, hyper-parameter search, inference, and reinforcement learning on a single, elastic runtime, making Ray a full-stack solution for modern AI workloads.

Back

Information

Websitewww.ray.io
AuthorsRISELab (UC Berkeley), Anyscale Inc.
Published date2017/09/30

More Items

ms-swift (SWIFT: Scalable lightWeight Infrastructure for Fine-Tuning)

2023

ModelScope community, Yuze Zhao +11

ms-swift (SWIFT) is an extensible, lightweight infrastructure from the ModelScope community for fine-tuning, evaluating, quantizing and deploying large language models (LLMs) and multimodal LLMs. It supports hundreds of text and multimodal models, many low-cost fine-tuning and quantized training techniques, Megatron-style model parallelism, RL/GRPO family algorithms for alignment, and multiple inference/deployment backends such as vLLM and LMDeploy. ms-swift provides CLI, Python APIs and a Web UI for end-to-end model workflows.

llm ai-train ai-inference ai-serving github+3

MLX LM

2025

ml-explore (GitHub organization)

MLX LM is a Python package to run, generate with, and fine-tune large language models on Apple Silicon using MLX. It integrates with the Hugging Face Hub, supports quantization and uploading of models, low-rank and full-model fine-tuning (including for quantized models), distributed inference and training, streaming generation, sampling/custom logits processors, prompt caching, and a convenient CLI and Python API.

llm huggingface github ai-library ai-inference+4

MiroThinker

2025

MiroMindAI, MiroMind Team

MiroThinker is an open-source search agent and research-agent project from MiroMind that advances tool-augmented reasoning and information-seeking. It includes agent models, an agent framework (MiroFlow), a training dataset (MiroVerse), and training/serving tools designed for long-context, multi-step tool usage and benchmark-driven research.

ai-agent ai-framework mcp-server mcp-client github+3

Ray