Tag
Explore by tags
2026
Baidu Inc.
Youyang Yin, Huanhuan Liu +15
Performs one-shot, long-horizon OCR and document parsing by using Reference Sliding Window Attention (R-SWA) to keep the decoder KV cache constant, enabling single-pass multi-page transcription; code, model weights and an accompanying arXiv report are provided.
