Logo Repositorio Institucional

Por favor, use este identificador para citar o enlazar este ítem: http://dspace.ucuenca.edu.ec/handle/123456789/34490
Título : A Comparative evaluation of preprocessing techniques for short texts in spanish
Otros títulos : A comparative evaluation of preprocessing techniques for short texts in spanish
Autor: Orellana Cordero, Marcos Patricio
Trujillo, Andrea
Cedillo Orellana, Irene Priscila
Correspondencia: Cedillo Orellana, Irene Priscila, priscila.cedillo@ucuenca.edu.ec
Palabras clave : Natural language processing
Preprocessing
Twitter
Sentiment analysis
Text mining
Area de conocimiento FRASCATI amplio: 5. Ciencias Sociales
Area de conocimiento FRASCATI detallado: 5.1.2 Psicología Especial(Terapia de Aprendizaje, Habla
Area de conocimiento FRASCATI específico: 5.1 Psicología y Ciencias Cognitivas
Area de conocimiento UNESCO amplio: 03 - Ciencias Sociales, Periodismo e Información
Area de conocimiento UNESCO detallado: 0313 - Psicología
Area de conocimiento UNESCO específico: 031 - Ciencias Sociales y Ciencias del Comportamiento
Fecha de publicación : 2020
Fecha de fin de embargo: 12-ene-2050
Volumen: Volumen 1130
Fuente: Advances in Intelligent Systems and Computing
metadata.dc.identifier.doi: 10.1007/978-3-030-39442-4_10
Editor: Springer
Ciudad: 
San Francisco
Tipo: ARTÍCULO DE CONFERENCIA
Abstract: 
Natural Language Processing (NLP) is used to identify key information, generating predictive models, and explaining global events or trends. Also, NLP is supported during the process to create knowledge. Therefore, it is important to apply refinement techniques in major stages such as preprocessing, when data is frequently produced and processed with poor results. This document analyzes and measures the impact of combinations of preprocessing techniques and libraries for short texts that have been written in Spanish. These techniques were applied in tweets for analysis of sentiments considering evaluation parameters in its analysis, the processing time and characteristics of the techniques for each library. The performed experimentation provides readers insights for choosing the appropriate combination of techniques during preprocessing. The results show improvement of up to 5% to 9% in the performance of the classification.
Resumen : 
Natural Language Processing (NLP) is used to identify key information, generating predictive models, and explaining global events or trends. Also, NLP is supported during the process to create knowledge. Therefore, it is important to apply refinement techniques in major stages such as preprocessing, when data is frequently produced and processed with poor results. This document analyzes and measures the impact of combinations of preprocessing techniques and libraries for short texts that have been written in Spanish. These techniques were applied in tweets for analysis of sentiments considering evaluation parameters in its analysis, the processing time and characteristics of the techniques for each library. The performed experimentation provides readers insights for choosing the appropriate combination of techniques during preprocessing. The results show improvement of up to 5% to 9% in the performance of the classification.
URI : https://link.springer.com/chapter/10.1007/978-3-030-39442-4_10
URI Fuente: https://link.springer.com/book/10.1007/978-3-030-39442-4
ISBN : 978-303039441-7
ISSN : 2194-5357
Aparece en las colecciones: Artículos

Ficheros en este ítem:
Fichero Tamaño Formato  
documento.pdf
  Until 2050-01-12
363.43 kBAdobe PDFVisualizar/Abrir     Solicitar una copia


Este ítem está protegido por copyright original



Los ítems de DSpace están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.

 

Centro de Documentacion Regional "Juan Bautista Vázquez"

Biblioteca Campus Central Biblioteca Campus Salud Biblioteca Campus Yanuncay
Av. 12 de Abril y Calle Agustín Cueva, Telf: 4051000 Ext. 1311, 1312, 1313, 1314. Horario de atención: Lunes-Viernes: 07H00-21H00. Sábados: 08H00-12H00 Av. El Paraíso 3-52, detrás del Hospital Regional "Vicente Corral Moscoso", Telf: 4051000 Ext. 3144. Horario de atención: Lunes-Viernes: 07H00-19H00 Av. 12 de Octubre y Diego de Tapia, antiguo Colegio Orientalista, Telf: 4051000 Ext. 3535 2810706 Ext. 116. Horario de atención: Lunes-Viernes: 07H30-19H00