AI Dataset2026

KSAFE-MM

Benchmark for evaluating multimodal LLM safety in Korean cultural contexts — includes KSAFE-MM-G which localizes global safety queries into Korean scenarios and KSAFE-MM-C which targets culture-specific visual-textual vulnerabilities. Provides curated image–text pairs and jailbreak-style prompts to reveal both unsafe behaviors and over-refusal.

Visit Website

Introduction

Most multimodal safety benchmarks are English-centric or focus on generic hazards; KSAFE-MM flips that assumption by centering Korean cultural and institutional contexts so evaluations reflect real-world local risks. The dataset stresses that model safety failures often arise not from raw toxicity but from missing local knowledge, culturally grounded cues, and visual-contextual interplay that enable bypasses or harmful outputs.

Key Findings

Two-part design: KSAFE-MM-G transforms globally shared safety queries into Korean-grounded multimodal samples; KSAFE-MM-C uses in-the-wild images and localized visual cues combined with jailbreak-style textual intents to probe culture-dependent vulnerabilities.
Reveals asymmetric failure modes: some models show high attack success rates on culturally tailored prompts while others exhibit excessive refusal on benign inputs — indicating a tradeoff between vulnerability and over-sensitivity.
Dataset construction emphasizes semantic alignment and privacy filtering: image–query pairs were selected from diverse web sources, de-duplicated, and filtered to avoid references to identifiable individuals or companies.

Who it's for and tradeoffs

Great fit if you evaluate multimodal LLM safety for non-English markets, build culturally robust moderation or alignment layers, or research localized attack vectors. Look elsewhere if you only need generic, English-only toxicity benchmarks or lightweight synthetic tests — KSAFE-MM is designed for contextual, in-the-wild evaluation and thus requires handling image hosting, language-specific annotation, and culturally informed judgment during interpretation.

Back

Information

Websitehuggingface.co
OrganizationsK-intelligence
Published date2026/06/11

More Items

Computer Vision Papers2026

CLBench-V: Evaluating Multimodal Context Learning from Grounding to Knowledge Acquisition

Lai Wei, Chengqi Li +4

Evaluates multimodal context learning across grounding, new information application, and knowledge acquisition using a 3,443-instance benchmark spanning science, finance, long documents, spatial reasoning, and web VQA; finds current multimodal models perform poorly (best score 0.2847) and analyzes failure modes.

multimodal benchmark vision evaluation paper+4

AI Dataset2026

HiFi-UMI-2K

Yuteng Wei, Jinming Ma +15Simple AI

Provides 2,000 hours of synchronized, high‑fidelity robot‑free bimanual manipulation demonstrations with multi‑view video, calibrated end‑effector trajectories, gripper states, and language annotations. Curated from a 20,000+ hour corpus; features 6 camera views, ~3 mm pose accuracy, <40 µs cross‑sensor sync, and LeRobot v3‑style Parquet+MP4 export under CC BY 4.0.

robotics video multimodal parquet huggingface+3

AI Dataset2026

Anthropic/BioMysteryBench-full

Anthropic, Hugging Face

A collection of biology-focused 'mystery' tasks for benchmarking model performance on biomedical reasoning, evidence synthesis, and problem solving; curated by Anthropic and hosted on Hugging Face, designed for granular evaluation of scientific decision-making.

anthropic huggingface evaluation benchmarks reasoning+1

KSAFE-MM

Introduction

Key Findings

Who it's for and tradeoffs

Information

Categories

Tags

More Items

CLBench-V: Evaluating Multimodal Context Learning from Grounding to Knowledge Acquisition

HiFi-UMI-2K

Anthropic/BioMysteryBench-full