Publications

(2023). BLESS: Benchmarking Large Language Models on Sentence Simplification. EMNLP 2023.

PDF Cite Code ACL Anthology

(2023). Comparing Generic and Expert Models for Genre-Specific Text Simplification. TSAR 2023.

PDF Cite ACL Anthology

(2023). An Empirical Comparison of LM-based Question and Answer Generation Methods. Findings of ACL 2023.

PDF Cite Code DOI ACL Anthology

(2023). A Practical Toolkit for Multilingual Question and Answer Generation. ACL 2023: System Demonstrations.

PDF Cite Code DOI Demo ACL Anthology

(2022). Improving Embeddings Representations for Comparing Higher Education Curricula: A Use Case in Computing. EMNLP 2022.

PDF Cite Code DOI ACL Anthology

(2022). Generative Language Models for Paragraph-Level Question Generation. EMNLP 2022.

PDF Cite Code DOI Demo ACL Anthology

(2022). A Benchmark for Neural Readability Assessment of Texts in Spanish. TSAR 2022.

PDF Cite Code ACL Anthology

(2022). Neural Readability Pairwise Ranking for Sentences in Italian Administrative Language. AACL-IJCNLP 2022.

PDF Cite Dataset ACL Anthology

(2022). PeruSIL: A Framework to Build a Continuous Peruvian Sign Language Interpretation Dataset. SignLang 2022.

PDF Cite Code Dataset ACL Anthology

(2021). The (Un)Suitability of Automatic Evaluation Metrics for Text Simplification. Computational Linguistics.

PDF Cite Code Poster Slides Video DOI ACL Anthology

(2021). deepQuest-py: Large and Distilled Models for Quality Estimation. EMNLP 2021: System Demonstrations.

PDF Cite Code Poster Slides DOI ACL Anthology

(2021). Validating Quality Estimation in a Computer-Aided Translation Workflow: Speed, Cost and Quality Trade-off. MT Summit XVIII: Users and Providers Track.

PDF Cite Slides ACL Anthology

(2021). Knowledge Distillation for Quality Estimation. Findings of ACL-IJCNLP 2021.

PDF Cite Code DOI ACL Anthology

(2021). Controllable Text Simplification with Explicit Paraphrasing. NAACL 2021.

PDF Cite Code Poster Slides DOI ACL Anthology

(2020). ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations. ACL 2020.

PDF Cite Code Slides DOI ACL Anthology

(2020). Data-Driven Sentence Simplification: Survey and Benchmark. Computational Linguistics.

PDF Cite DOI ACL Anthology

(2019). EASSE: Easier Automatic Sentence Simplification Evaluation. EMNLP-IJCNLP 2019: System Demonstrations.

PDF Cite Code Poster DOI ACL Anthology

(2019). Cross-Sentence Transformations in Text Simplification. WiNLP 2019.

PDF Cite Poster ACL Anthology

(2019). Strong Baselines for Complex Word Identification across Multiple Languages. NAACL 2019.

PDF Cite Code Poster DOI ACL Anthology

(2017). MASSAlign: Alignment and Annotation of Comparable Documents. IJCNLP 2017: System Demonstrations.

PDF Cite Code ACL Anthology

(2017). Learning How to Simplify From Explicit Labeling of Complex-Simplified Text Pairs. IJCNLP 2017.

PDF Cite Code ACL Anthology

(2016). Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in Spanish. LREC 2016.

PDF Cite Code ACL Anthology

(2016). SciEsp: Structural Analysis of Abstracts Written in Spanish. Computación y Sistemas.

PDF Cite

(2012). Towards Semi-supervised Brazilian Portuguese Semantic Role Labeling: Building a Benchmark. PROPOR 2012.

PDF Cite

(2012). Semantic Role Labeling for Brazilian Portuguese: A Benchmark. IBERAMIA 2012.

PDF Cite

(2011). Comparação de Grades Curriculares de Cursos de Computação Baseada em Agrupamento Hierárquico de Textos. WEI 2011.

PDF Cite