Tag

Explore by tags

AI Dataset2026

Open-MM-RL

Multimodal STEM problem set for verifiable, answer-supervised training and RL: contains single-image, multi-panel, and multi-image PhD-level questions across physics, math, chemistry and biology. Each example has a deterministic ground-truth answer, enabling reward modeling and automated evaluation.

multimodal RL science physics math+5

AI Dataset2026

CUA-Gym

xlangai, CUA-Gym Team

Pairs natural-language instructions with executable setup artifacts and Python reward functions to create verifiable computer-use agent tasks. Provides a Parquet task table for fast filtering plus a compressed archive of runnable task bundles; several web task endpoints are placeholders that require a local CUA-Gym-Hub deployment.

huggingface RL agent-skills pandas python+1

AI Dataset2026

ITBench-AA

Artificial Analysis, IBM

Provides 40 public Kubernetes incident scenarios (SRE subset) with ground-truth root-cause entities and offline cluster snapshots in JSONL format; designed to evaluate agentic root-cause diagnosis on alerts, events, traces and topology.

evaluation huggingface agent-skills json pandas+4

AI Dataset2026

Jackrong/claude-opus-4.6-traceInversion-9000x

Jackrong

Provides 9,000 reconstructed chain-of-thought (CoT) SFT examples produced by trace inversion from Claude Opus 4.6 outputs for fine-tuning reasoning-capable LLMs. Multilingual, packaged as .jsonl.gz and SFT/DPO-ready; verify numeric/code cases before training.

huggingface llm nlp multilingual pandas+4

AI Dataset2026

tran-vi-teacher

ngocdang83

Parallel Chinese→Vietnamese dataset of webnovel (xianxia) text provided in JSON for NMT training and teacher-student distillation. In-domain, ~100K–1M examples with CC-BY-4.0 license — useful for fine-tuning or distillation experiments but limited by narrow genre and small download footprint.

translation huggingface nlp multilingual pandas+1

AI Dataset2026

Nemotron-Pretraining-Code-v3

NVIDIA Corporation

Metadata-only corpus of 146.3M new GitHub source-code files (commit_id, rel_path, language) intended as an incremental update to Nemotron v1/v2 for LLM code pretraining; CC-BY-4.0 licensed and designed to be used jointly with older versions.

nvidia huggingface code github llm+4

AI Dataset2026

ResearchMath-14k

amphora

A collection of 14,056 self-contained research-level mathematical problems extracted from papers and open-problem lists, each rewritten with taxonomy labels and open-status metadata for training or evaluating models on research-grade math reasoning.

paper huggingface pandas python science+1

AI Dataset2026

ClawHub Security Signals

OpenClaw

Provides a sanitized, MIT‑licensed dataset of scanner evidence and registry verdicts for public ClawHub agent skills — 67k+ latest skill versions with redacted artifacts and structured VirusTotal, static-analysis, and SkillSpector outputs to study scanner disagreement and agent-skill risk governance.

security agent-skills LLM huggingface pandas+2

AI Dataset2026

Open Spatial Reasoning (Driving 3D Spatial Reasoning)

Anurag Ganguli, Anshuman Lall +5

Evaluates metric 3D spatial reasoning from single driving images via multiple-choice questions that require reconstructing scene geometry rather than relying on image-layout shortcuts. Each sample pairs a numbered-bbox image with a question, four choices, and the correct answer; images come from PlusAI and the dataset is CC BY 4.0.

vision image huggingface pandas multimodal+1

AI Dataset2026

xlangai/osworld_v2_tasks

xlangai

Provides the gated, official OSWorld 2.0 Python task class files (task_*.py) required to run the benchmark; distributed via a Hugging Face gated dataset to reduce benchmark leakage. Download requires accepting gated access on Hugging Face.

huggingface evaluation agent-skills ai-agent json+2

AI Dataset2026

Neko_Audio-80K_Short

liumindmind

Around 80K short audio clips paired with transcripts in JSON, organized for easy loading with the Hugging Face datasets ecosystem—designed for short-form speech tasks (ASR, TTS, fine-tuning) and quick prototyping with common Python data tools.

audio speech ASR tts huggingface+3

AI Dataset2026

Nemotron-Personas-Vietnam

NVIDIA Corporation, FPT Smart Cloud +1

Provides 600,000 synthetic Vietnamese persona texts (100,000 records, 6 personas per record) aligned to Vietnam's 2024 census and surveys for training and evaluating NLP / text-generation models; includes 21 demographic and persona fields, CC BY 4.0, single train split.

huggingface nvidia nlp multilingual llm+1

Tag

Explore by tags

Tag

Explore by tags

All

30u30

ASR

ChatGPT

GNN

IDE

RAG

agent-skills

ai

ai-agent

ai-api

ai-api-management

ai-client

ai-coding

ai-demos

ai-deploy

ai-development

ai-framework

ai-image

ai-image-demos

ai-inference

ai-leaderboard

ai-library

ai-rank

ai-serving

ai-tools

ai-train

ai-video

ai-workflow

AIGC

algorithms

alibaba

amazon

android

anthropic

audio

aws

benchmark

biology

blog

book

bytedance

chatbot

chatgpt

chemistry

claude

claude-code

cli

code

codex

coding

copilot

course

cuda

cursor

deepmind

deepseek

depth

devops

diffusers

distillation

docker

drug-discovery

electron

embeddings

engineering

evaluation

facebook

finance

flow-matching

foundation

foundation-model

gcode

gemini

gemini-cli

gemma