AIAny - AI Train

AI Image2024

MiniMind-V

Trains a 65M-parameter vision-language model from scratch in ~2 hours on one RTX 3090, about 3 RMB (~$0.40) of GPU rental. Connects a frozen SigLIP2 encoder to a small MiniMind LLM via a two-layer MLP projector; full PyTorch code for pretraining and SFT.

vision pytorch github llm ai-train+2

AI Infra2024

cuTile Python

NVIDIA CORPORATIONNVIDIA

Lets Python developers write tile-based parallel kernels for NVIDIA GPUs, generating CUDA Tile IR while staying close to Python syntax for custom GPU operations.

nvidia ai-development ai-library ai-train

AI Train2024

DataFlow

OpenDCAIPeking University

Parses, generates, and filters training data from noisy sources like PDFs and weak QA, then feeds it into LLM pre-training, SFT, RL, or RAG cleaning. Ships 100+ operators and ready-made pipelines for text, reasoning, Text2SQL, and agentic data.

github mlops ai-development ai-library python+3

AI Train2024

verl: Volcano Engine Reinforcement Learning for LLMs

ByteDance Seed Team, Volcengine +1ByteDance Seed, The University of Hong Kong +1

Open-source HybridFlow implementation for RL post-training of LLMs. Decouples control flow from compute so PPO, GRPO, GSPO and DAPO share one dataflow; pairs FSDP/Megatron with vLLM/SGLang rollout and reports 1.5-20x throughput over prior RLHF stacks.

RL LLM vllm pytorch huggingface+3

AI Train2024

Protenix

ByteDance AI4Science (AML) Team

High-accuracy biomolecular structure prediction suite: open-source models (protenix-v2/v1), a benchmark/evaluation toolkit, and a web server for inference. Targets protein/antibody–antigen and ligand-aware predictions with inference-time sampling and constraint support.

bytedance github foundation-model genomics drug-discovery+2

AI Train2024

Boltz

Saro Passaro, Gabriele Corso +10MIT Jameel Clinic, Recursion

Predicts 3D structures of proteins, nucleic acids, and small-molecule complexes, the first fully open-source model to approach AlphaFold3 accuracy. Boltz-2 adds binding-affinity prediction that nears FEP simulation accuracy at ~1000x the speed.

foundation-model github science ai-train ai-inference+1

AI Train2024

DeepSeek-V3

DeepSeek-AI

A 671B-parameter Mixture-of-Experts language model (37B activated) trained on 14.8T tokens with 128K context, FP8-first training, a Multi-Token Prediction module, and Hugging Face weights—focused on efficient MoE training and long-context use cases.

deepseek foundation-model LLM huggingface vllm+5

Large Language Model Tutorials2025

Train LLM From Scratch

Fareed Khan

Provides end-to-end PyTorch scripts to download/prepare data, implement a transformer from scratch, train LLMs (13M→billion-scale) and generate text. Emphasizes educational clarity and single‑GPU experiments; useful for researchers or hobbyists, but large-scale training still requires substantial compute and engineering.

pytorch LLM python nlp ai-train+3

AI Train2025

Verifiers: Environments for LLM Reinforcement Learning

William Brown (willccbb), Prime IntellectPrime Intellect

Bundles a dataset, an interaction harness, and rubric-based reward functions into one RL environment for training and evaluating LLMs — also usable as an eval, synthetic-data pipeline, or agent harness for any OpenAI-compatible endpoint.

RL LLM ai-library python opencode+3

AI Train2025

PRIME-RL

Prime Intellect

An asynchronous, high-throughput framework for large-scale reinforcement learning and agentic training that scales to 1T+ MoE models and 1000+ GPUs, with native verifiers integration, end-to-end SFT/RL/evals, and Slurm/Kubernetes deployment; requires NVIDIA GPUs.

RL agent-skills mLOps ai-train pytorch+3

AI Train2025

AReaL

inclusionAI (AReaL Team), Institute for Interdisciplinary Information Sciences, Tsinghua University +1Ant Group, Institute for Interdisciplinary Information Sciences, Tsinghua University

Trains LLM reasoning and agentic models with fully asynchronous reinforcement learning, decoupling rollout generation from policy updates for a 2.77x speedup over synchronous RL. Covers GRPO, PPO and DAPO across Megatron, FSDP, vLLM and SGLang backends.

RL LLM ai-train ai-agent pytorch+4

Embodied AI2025

NVIDIA Isaac GR00T

NVIDIA

A vision-language-action foundation model and reference stack for generalized humanoid and cross-embodiment robot manipulation. Provides pretrained checkpoints, demo datasets, and tooling for fine-tuning, evaluation, and deployment (ONNX/TensorRT); released as Early Access.

nvidia robotics foundation-model vision pytorch+5

Category

Explore by categories

All Categories

AI Leaderboard

AI Agent Tutorials

AI Coding Tutorials

AI Model

AI Agent Papers

Chatbot

AI Dataset

Machine Learning Foundation Books

AI Train

AI Deploy

AI Client

Machine Learning Foundation Papers

Machine Learning Foundation Tutorials

AI Image Demos

AI Agent

Large Language Model Tutorials

Large Language Model Papers

Machine Learning Engineering Papers

Computer Vision Tutorials

Computer Vision Papers

Natural Language Processing Papers

Reinforcement Learning Papers

Speech Technology Papers

AI API

AI Coding

AI Image

AI Video

MLOps

MCP Client

MCP Server

AI Video Papers

AI Audio

AI Others

AI Infra

Embodied AI

MiniMind-V

cuTile Python

DataFlow

verl: Volcano Engine Reinforcement Learning for LLMs

Protenix

Boltz

DeepSeek-V3

Train LLM From Scratch

Verifiers: Environments for LLM Reinforcement Learning

PRIME-RL

AReaL

NVIDIA Isaac GR00T