AIAny - ai-development

LMDeploy

2023

InternLM Team

Toolkit from InternLM for compressing, quantizing and serving LLMs with INT4/INT8 kernels on GPUs.

ai-development ai-inference ai-serving

Dify

2023

LangGenius

Open-source platform for building and operating AI-native apps with agentic workflows, RAG pipelines, model management and observability.

ai-development ai-library ai-agent LLM

Vercel AI SDK

2023

Vercel

TypeScript toolkit from Vercel for building streaming, multi-provider AI applications across React, Next.js, Vue, Svelte, and more.

ai-development ai-library ai-agent

One API is a self-hosted key-management and distribution gateway that unifies OpenAI-style access to dozens of LLM providers, enabling centralized quota, billing and user management through a single binary or Docker image.

ai-development ai-api ai-api-management

Xinference

2023

Xprobe Inc.

Xorbits’ universal inference layer (library name `xinference`) that deploys and serves LLMs and multimodal models from laptop to cluster.

ai-development ai-inference ai-serving

LiteLLM

2023

BerriAI

LiteLLM is an open-source LLM gateway and Python SDK that lets developers call more than 100 commercial and open-source models through a single OpenAI-compatible interface, complete with cost tracking, rate-limiting, load-balancing and guardrails.

ai-development ai-api ai-api-management

TensorRT-LLM

2023

NVIDIA

NVIDIA’s open-source library that compiles Transformer blocks into highly-optimized TensorRT engines for blazing-fast LLM inference on NVIDIA GPUs.

ai-development ai-inference ai-serving nvidia

FlashInfer

2023

FlashInfer Team

CUDA kernel library that brings Flash-attention-style optimizations to any LLM serving stack.

ai-development ai-inference ai-serving

CrewAI

2023

João Moura

High-performance Python framework and platform for orchestrating collaborative agent “crews”.

ai-development ai-framework ai-library ai-agent LLM

FastGPT

2023

labring

FastGPT is an open-source AI knowledge-base platform that combines RAG retrieval, visual workflows and multi-model support to build domain-specific chatbots quickly.

ai-development ai-framework ai-library ai-agent LLM

LLaMA-Factory

2024

hiyouga

Zero-code CLI & WebUI to fine-tune 100+ LLMs/VLMs with LoRA, QLoRA, PPO, DPO and more.

ai-development ai-framework ai-train

GraphRAG

2024

Microsoft Research

Microsoft Research approach that enriches RAG with knowledge-graph structure and community summaries.

ai-development ai-framework ai-library ai-agent LLM

Tag

Explore by tags

All

30u30

ASR

ChatGPT

GNN

IDE

RAG

ai-agent

ai-api

ai-api-management

ai-client

ai-coding

ai-demos

ai-development

ai-framework

ai-image

ai-image-demos

ai-inference

ai-leaderboard

ai-library

ai-rank

ai-serving

ai-tools

ai-train

ai-video

ai-workflow

AIGC

alibaba

amazon

anthropic

audio

blog

book

bytedance

chatbot

chemistry

claude

course

deepmind

deepseek

engineering

foundation

foundation-model

gemini

github

google

gradient-booting

grok

huggingface

LLM

llm

math

mcp

mcp-client

mcp-server

meta-ai

microsoft

mlops

NLP

nvidia

ollama

openai

paper

physics

plugin

pytorch

RL

science

sora

translation

tutorial

vibe-coding

video

vision

xAI

xai

LMDeploy

Dify