Provides 4,659 agentic single-turn SFT training pairs extracted from Claude Fable‑5, formatted as a single-column parquet for Qwen-style fine-tuning. Includes explicit chain-of-thought (<think>) blocks, XML-serialized <tool_use> calls, PII redaction, and AGPL-3.0 licensing.
Provides 1.8M synthetic Belgian personas (1.2M records; 300k per language) in Dutch/French/German/English, grounded in Belgian census distributions to improve representativeness for LLM training and evaluation. Includes 23 persona and contextual fields, CC BY 4.0 license, produced with NeMo Data Designer.
Provides de-identified MEG and EEG recordings of 35 native Spanish speakers typing memorized sentences, with synchronized behavioral logs and standardized event tables. Includes raw .fif and BrainVision files plus MATLAB logs (≈262 GB total); released under CC BY-NC 4.0 for non-commercial research on brain-to-text decoding.
Benchmark for evaluating procedural skill evolution in LLM agents: isolates reusable skill bodies, role-specific work surfaces, and hidden oracle assets to measure whether skill refinements transfer across tasks, roles, and model backbones. Includes 382 workplace tasks, 22 skills, and a controlled evaluation protocol.