Fernando Alva-Manchego
Fernando Alva-Manchego
Home
Publications
Team
Talks
Teaching
Contact
Light
Dark
Automatic
1
A Practical Toolkit for Multilingual Question and Answer Generation
We introduce AutoQG, an online service for multilingual question and answer generation (QAG) along with lmqg, an all-in-one python package for model fine-tuning, generation, and evaluation. We also release QAG models in eight languages fine-tuned on a few variants of pre-trained encoder-decoder language models, which can be used online via AutoQG or locally via lmqg.
Asahi Ushio
,
Fernando Alva-Manchego
,
Jose Camacho-Collados
PDF
Cite
Code
DOI
Demo
ACL Anthology
An Empirical Comparison of LM-based Question and Answer Generation Methods
We establish baselines with three different question and answer generation methodologies (pipeline, multitask, end-to-end) that leverage sequence-to-sequence language model fine-tuning.
Asahi Ushio
,
Fernando Alva-Manchego
,
Jose Camacho-Collados
PDF
Cite
Code
DOI
ACL Anthology
Generative Language Models for Paragraph-Level Question Generation
We introduce QG-Bench, a multilingual and multidomain benchmark for question generation, which we use to evaluate the performance of robust baselines based on fine-tuning generative language models, as well as to assess the reliability of automatic metrics commonly-used used for the task.
Asahi Ushio
,
Fernando Alva-Manchego
,
Jose Camacho-Collados
PDF
Cite
Code
DOI
Demo
ACL Anthology
Improving Embeddings Representations for Comparing Higher Education Curricula: A Use Case in Computing
We propose an approach for obtaining representations of courses in a curriculum based on a novel course-guided attention mechanism and metric learning, and test it in a new dataset with curricula of computing programs from the USA and LATAM.
Jeffri Murrugarra-Llerena
,
Fernando Alva-Manchego
,
Nils Murrugarra-Llerena
PDF
Cite
Code
DOI
ACL Anthology
A Benchmark for Neural Readability Assessment of Texts in Spanish
We compile a new benchmark for automated readability assessments of texts in Spanish, and fine-tune pre-trained language models to perform the task at both sentence and paragraph levels.
Laura Vásquez-Rodríguez
,
Pedro-Manuel Cuenca-Jiménez
,
Sergio Esteban Morales-Esquivel
,
Fernando Alva-Manchego
PDF
Cite
Code
ACL Anthology
Neural Readability Pairwise Ranking for Sentences in Italian Administrative Language
We introduce Admin-It, a new dataset for sentence-level readability assessment of Italian administrative texts, and evaluate the performance of Neural Pairwise Ranking models in this new data.
Martina Miliani
,
Serena Auriemma
,
Fernando Alva-Manchego
,
Alessandro Lenci
PDF
Cite
Dataset
ACL Anthology
PeruSIL: A Framework to Build a Continuous Peruvian Sign Language Interpretation Dataset
We present a framework for creating a multi-modal Peruvian sign language interpretation dataset based on videos.
Gissella Bejarano
,
Joe Huamani-Malca
,
Francisco Cerna-Herrera
,
Fernando Alva-Manchego
,
Pablo Rivas
PDF
Cite
Code
Dataset
ACL Anthology
Simple TICO-19: A Dataset for Joint Translation and Simplification of COVID-19 Texts
We introduce Simple TICO-19, a new language resource containing manual simplifications of the English and Spanish portions of the TICO-19 corpus for Machine Translation of COVID-19 literature.
Matthew Shardlow
,
Fernando Alva-Manchego
PDF
Cite
Dataset
ACL Anthology
Towards Readability-Controlled Machine Translation of COVID-19 Texts
This project proposes to investigate the capabilities of machine translation models for generating translations at varying levels of readability, focusing on texts about COVID-19.
Fernando Alva-Manchego
,
Matthew Shardlow
PDF
Cite
Poster
ACL Anthology
deepQuest-py: Large and Distilled Models for Quality Estimation
We introduce deepQuest-py, a framework for training and evaluation of large and light-weight models for Quality Estimation
Fernando Alva-Manchego
,
Abiola Obamuyide
,
Amit Gajbhiye
,
Frédéric Blain
,
Marina Fomicheva
,
Lucia Specia
PDF
Cite
Code
Poster
Slides
DOI
ACL Anthology
»
Cite
×