Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models

Related