MASSAlign: Alignment and Annotation of Comparable Documents
Nov 1, 2017·
,·
0 min read
Gustavo Paetzold

Fernando Alva-Manchego
Lucia Specia
Abstract
We introduce MASSAlign: a Python library for the alignment and annotation of monolingual comparable documents. MASSAlign offers easy-to-use access to state of the art algorithms for paragraph and sentence-level alignment, as well as novel algorithms for word-level annotation of transformation operations between aligned sentences. In addition, MASSAlign provides a visualization module to display and analyze the alignments and annotations performed.
Type
Publication
IJCNLP 2017: System Demonstrations