Cross-Sentence Transformations in Text Simplification

August 1, 2019·
Fernando Alva-Manchego
Fernando Alva-Manchego
,
Carolina Scarton
,
Lucia Specia
· 0 min read
Abstract
Current approaches to Text Simplification focus on simplifying sentences individually. However, certain simplification transformations span beyond single sentences (e.g. joining and re-ordering sentences). In this paper, we motivate the need for modelling the simplification task at the document level, and assess the performance of sequence-to-sequence neural models in this setup. We analyse parallel original-simplified documents created by professional editors and show that there are frequent rewriting transformations that are not restricted to sentence boundaries. We also propose strategies to automatically evaluate the performance of a simplification model on these cross-sentence transformations. Our experiments show the inability of standard sequence-to-sequence neural models to learn these transformations, and suggest directions towards document-level simplification.
Type
Publication
WiNLP 2019
publication
Fernando Alva-Manchego
Authors
Researcher in Natural Language Processing
My research interests include text simplification, readability assessment, multilingual NLP, Welsh language technology, and NLP for education and social care.