📚 Publications
(2025). Analysing Zero-Shot Readability-Controlled Sentence Simplification. COLING 2025.
(2024). The BEA 2024 Shared Task on the Multilingual Lexical Simplification Pipeline. BEA 2024.
(2024). An Extensible Massively Multilingual Lexical Simplification Pipeline Dataset using the MultiLS Framework. LREC-COLING 2024.
(2023). BLESS: Benchmarking Large Language Models on Sentence Simplification. EMNLP 2023.
(2023). Comparing Generic and Expert Models for Genre-Specific Text Simplification. TSAR 2023.
(2023). A Practical Toolkit for Multilingual Question and Answer Generation. ACL 2023: System Demonstrations.
(2023). An Empirical Comparison of LM-based Question and Answer Generation Methods. Findings of ACL 2023.
(2022). Generative Language Models for Paragraph-Level Question Generation. EMNLP 2022.
(2022). Improving Embeddings Representations for Comparing Higher Education Curricula: A Use Case in Computing. EMNLP 2022.
(2022). A Benchmark for Neural Readability Assessment of Texts in Spanish. TSAR 2022.
(2022). Neural Readability Pairwise Ranking for Sentences in Italian Administrative Language. AACL-IJCNLP 2022.
(2022). PeruSIL: A Framework to Build a Continuous Peruvian Sign Language Interpretation Dataset. SignLang 2022.
(2021). deepQuest-py: Large and Distilled Models for Quality Estimation. EMNLP 2021: System Demonstrations.
(2021). Validating Quality Estimation in a Computer-Aided Translation Workflow: Speed, Cost and Quality Trade-off. MT Summit XVIII: Users and Providers Track.
(2021). Knowledge Distillation for Quality Estimation. Findings of ACL-IJCNLP 2021.
(2020). ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations. ACL 2020.
(2020). Data-Driven Sentence Simplification: Survey and Benchmark. Computational Linguistics.
(2019). EASSE: Easier Automatic Sentence Simplification Evaluation. EMNLP-IJCNLP 2019: System Demonstrations.
(2019). Cross-Sentence Transformations in Text Simplification. WiNLP 2019.
(2019). Strong Baselines for Complex Word Identification across Multiple Languages. NAACL 2019.
(2017). Learning How to Simplify From Explicit Labeling of Complex-Simplified Text Pairs. IJCNLP 2017.
(2017). MASSAlign: Alignment and Annotation of Comparable Documents. IJCNLP 2017: System Demonstrations.
(2016). Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in Spanish. LREC 2016.
(2016). SciEsp: Structural Analysis of Abstracts Written in Spanish. Computación y Sistemas.
(2012). Semantic Role Labeling for Brazilian Portuguese: A Benchmark. IBERAMIA 2012.
(2012). Towards Semi-supervised Brazilian Portuguese Semantic Role Labeling: Building a Benchmark. PROPOR 2012.
(2011). Comparação de Grades Curriculares de Cursos de Computação Baseada em Agrupamento Hierárquico de Textos. WEI 2011.