AIAny - ai-development

Mooncake

2024

KVCache-AI Team

Distributed KV-cache store & transfer engine that decouples prefilling from decoding to scale vLLM serving clusters.

ai-development ai-inference ai-serving

verl

2024

ByteDance Seed / Volcano Engine

Volcano Engine Reinforcement Learning library for efficient LLM post-training—open-sourced HybridFlow.

ai-development ai-framework ai-train

RAGFlow

2024

InfiniFlow

RAGFlow is InfiniFlow’s open-source Retrieval-Augmented Generation engine focused on deep-document understanding and scalable multi-format ingestion.

ai-development ai-framework ai-library ai-agent LLM

AIBrix

2025

vLLM Project

vLLM-project’s control-plane that orchestrates cost-efficient, plug-and-play LLM inference infrastructure.

ai-development ai-inference ai-serving

NVIDIA Dynamo

2025

NVIDIA

NVIDIA Dynamo is an open-source, high-throughput, low-latency inference framework that scales generative-AI and reasoning models across large, multi-node GPU clusters.

ai-development ai-inference ai-serving nvidia

Tag

Explore by tags

Tag

Explore by tags

All

30u30

ASR

ChatGPT

GNN

IDE

RAG

ai-agent

ai-api

ai-api-management

ai-client

ai-coding

ai-demos

ai-development

ai-framework

ai-image

ai-image-demos

ai-inference

ai-leaderboard

ai-library

ai-rank

ai-serving

ai-tools

ai-train

ai-video

ai-workflow

AIGC

alibaba

amazon

anthropic

audio

blog

book

bytedance

chatbot

chemistry

claude

course

deepmind

deepseek

engineering

foundation

foundation-model

gemini

github

google

gradient-booting

grok

huggingface

LLM

llm

math

mcp

mcp-client

mcp-server

meta-ai

microsoft

mlops

NLP

nvidia

ollama

openai

paper

physics

plugin

pytorch

RL

science

sora

translation

tutorial

vibe-coding

video

vision

xAI

xai