TY - GEN
T1 - Towards Text Simplification in Spanish
T2 - 4th IEEE International Conference on BioInspired Processing, BIP 2022
AU - Romero, Mario
AU - Calderon-Ramirez, Saul
AU - Solis, Martin
AU - Perez-Rojas, Nelson
AU - Chacon-Rivas, Mario
AU - Saggion, Horacio
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Text simplification refers to the transformation of a specific source text into a target text aiming to increase understanding and readability for one or more specific audiences. This task demands large human efforts and specialized knowledge, which makes the usage of automated or semi-automated computational approaches appealing. The rise of deep learning as an unifying paradigm between seemingly different fields as image analysis, sound processing and natural language processing has considerably influenced the current state of the art approaches for automatic text simplification. Therefore, in this work, we focus on the study of deep learning based state of the art methods for automatic text simplification in the Spanish language. For this end, we first disentangle the different tasks which can be addressed in order to yield a simplified text in general. Later we review the latest deep learning-based approaches, along with the main datasets and performance metrics used in the field. We also describe approaches to deal with small datasets and technical words. Finally, we describe some lessons to build accurate automatic text simplification systems in Spanish, as in this language there is a noticeable shortage of work for text simplification.
AB - Text simplification refers to the transformation of a specific source text into a target text aiming to increase understanding and readability for one or more specific audiences. This task demands large human efforts and specialized knowledge, which makes the usage of automated or semi-automated computational approaches appealing. The rise of deep learning as an unifying paradigm between seemingly different fields as image analysis, sound processing and natural language processing has considerably influenced the current state of the art approaches for automatic text simplification. Therefore, in this work, we focus on the study of deep learning based state of the art methods for automatic text simplification in the Spanish language. For this end, we first disentangle the different tasks which can be addressed in order to yield a simplified text in general. Later we review the latest deep learning-based approaches, along with the main datasets and performance metrics used in the field. We also describe approaches to deal with small datasets and technical words. Finally, we describe some lessons to build accurate automatic text simplification systems in Spanish, as in this language there is a noticeable shortage of work for text simplification.
KW - deep learning
KW - literature review
KW - machine learning
KW - text simplification
UR - http://www.scopus.com/inward/record.url?scp=85148476856&partnerID=8YFLogxK
U2 - 10.1109/BIP56202.2022.10032482
DO - 10.1109/BIP56202.2022.10032482
M3 - Contribución a la conferencia
AN - SCOPUS:85148476856
T3 - 2022 IEEE 4th International Conference on BioInspired Processing, BIP 2022
BT - 2022 IEEE 4th International Conference on BioInspired Processing, BIP 2022
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 15 November 2022 through 17 November 2022
ER -