AIAny - foundation-model

ChatGPT

2022

OpenAI

ChatGPT helps you get answers, find inspiration and be more productive. It is free to use and easy to try. Just ask and ChatGPT can help with writing, learning, brainstorming and more.

chatbot ai-tools foundation-model openai ai-image

Claude

2023

Anthropic

Claude is a next generation AI assistant built by Anthropic and trained to be safe, accurate, and secure to help you do your best work. Create with Claude.

chatbot ai-tools foundation-model anthropic

Gemini

2023

Google AI

Gemini is your personal, proactive, and powerful AI assistant from Google. Try it for free to help with work, school, and at home for whatever inspires you.

chatbot ai-tools foundation-model google

Grok

2023

xAI

Grok is a free AI assistant designed by xAI to maximize truth and objectivity. Grok offers real-time search, image generation, trend analysis, and more.

chatbot ai-tools foundation-model xAI

labml.ai Deep Learning Paper Implementations

2020

labml.ai (labmlai)

A curated collection of 60+ concise, well-documented PyTorch implementations of deep learning papers from labml.ai. It provides side-by-side notes and tutorials for transformers, optimizers, GANs, RL, diffusion models, vision models and more, intended as learning and reproduction resources.

pytorch paper github tutorial ai-coding+5

xFormers

2021

Facebook Research (Meta)

xFormers is a modular, hackable, and efficiency-oriented Transformer building-blocks library from Facebook Research (Meta). It provides multiple attention implementations (memory-efficient exact attention, sparse and block-sparse attention), fused operators, and custom CUDA kernels to speed up transformer research and prototyping within the PyTorch ecosystem.

pytorch ai-library meta-ai ai-development foundation-model+1

LLaVA: Large Language and Vision Assistant

2023

Haotian Liu, Chunyuan Li +3

LLaVA is an open-source large language and vision assistant project that introduces visual instruction tuning to teach large language models to understand and follow multimodal (image+text) instructions. The repository includes papers, model checkpoints (Model Zoo), training and evaluation scripts, demos (Gradio), and tooling for fine-tuning, quantized inference, and deployment. LLaVA aims to bring LLM-level conversational capabilities to vision tasks and has continued evolving (LLaVA-1.5, LLaVA-NeXT, video/interactive variants).

vision llm foundation-model github huggingface+4

DeepSeek

2023

DeepSeek

Chat with DeepSeek AI – your intelligent assistant for coding, content creation, file reading, and more. Upload documents, engage in long-context conversations, and get expert help in AI, natural language processing, and beyond.

chatbot ai-tools foundation-model deepseek

DiffSynth-Studio

2023

ModelScope Community, Artiprocher

DiffSynth-Studio is an open-source Diffusion model engine developed and maintained by the ModelScope Community, focusing on image and video generation. It supports mainstream models like FLUX, Wan, and Qwen-Image, offering efficient memory management and flexible training frameworks. Key features include VRAM optimization, low-memory inference, LoRA/ControlNet training, and innovative techniques like EliGen and Nexus-Gen for pushing generative model boundaries.

github AIGC ai-tools ai-image ai-video+5

torchtitan

2023

PyTorch

torchtitan is a PyTorch-native platform for rapid experimentation and large-scale training of generative AI models. It provides multi-dimensional composable parallelisms (FSDP2, tensor/pipeline/context parallel), distributed checkpointing, float8 and MXFP8 support, torch.compile integration, and out-of-the-box support for training Llama 3.1 models. It targets both research and production-scale LLM pretraining.

pytorch ai-train llm foundation-model ai-library+2

TimesFM

2024

Google Research

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model from Google Research for forecasting. The GitHub repo provides model code, installation instructions, inference examples (PyTorch and Flax), links to checkpoints (Hugging Face), and notes about integration with BigQuery and model versions (e.g., TimesFM 2.5).

foundation-model google pytorch huggingface ai-train+4

Midscene.js

2024

web-infra-dev, Xiao Zhou +2

Midscene.js is an open-source JavaScript SDK and framework for vision-driven UI automation across web, Android, iOS and other interfaces. It uses visual-language models to localize and interact with UI purely from screenshots, lets you script automation via natural language, JavaScript or YAML, integrates with Puppeteer/Playwright or device bridges, and provides developer features such as caching, debugging replay, MCP services and zero-code browser extension.

vision ai-agent ai-tools mcp github+4

Tag

Explore by tags

All

30u30

ASR

ChatGPT

GNN

IDE

RAG

ai-agent

ai-api

ai-api-management

ai-client

ai-coding

ai-demos

ai-development

ai-framework

ai-image

ai-image-demos

ai-inference

ai-leaderboard

ai-library

ai-rank

ai-serving

ai-tools

ai-train

ai-video

ai-workflow

AIGC

alibaba

amazon

anthropic

audio

blog

book

bytedance

chatbot

chemistry

claude

course

deepmind

deepseek

engineering

foundation

foundation-model

gemini

github

google

gradient-booting

grok

huggingface

LLM

llm

math

mcp

mcp-client

mcp-server

meta-ai

microsoft

mlops

NLP

nvidia

ocr

ollama

openai

paper

physics

plugin

pytorch

RL

science

sora

translation

tutorial

vibe-coding

video

vision

xAI

xai

ChatGPT