MASSAlign: Alignment and Annotation of Comparable Documents

Abstract

We introduce MASSAlign: a Python library for the alignment and annotation of monolingual comparable documents. MASSAlign offers easy-to-use access to state of the art algorithms for paragraph and sentence-level alignment, as well as novel algorithms for word-level annotation of transformation operations between aligned sentences. In addition, MASSAlign provides a visualization module to display and analyze the alignments and annotations performed.

Publication
IJCNLP 2017: System Demonstrations
Fernando Alva-Manchego
Fernando Alva-Manchego
Research Associate

My research interests include text simplification, readability assessment, evaluation of natural language generation, and writing assistance.