Provides 4,659 agentic single-turn SFT training pairs extracted from Claude Fable‑5, formatted as a single-column parquet for Qwen-style fine-tuning. Includes explicit chain-of-thought (<think>) blocks, XML-serialized <tool_use> calls, PII redaction, and AGPL-3.0 licensing.
Provides 1.8M synthetic Belgian personas (1.2M records; 300k per language) in Dutch/French/German/English, grounded in Belgian census distributions to improve representativeness for LLM training and evaluation. Includes 23 persona and contextual fields, CC BY 4.0 license, produced with NeMo Data Designer.