Decoder-hybrid-decoder architecture for efficient reasoning with long generation

Publication
NeurIPS 2025

Related