Publication:
A Comparative evaluation of preprocessing techniques for short texts in spanish

dc.contributor.authorOrellana Cordero, Marcos Patricio
dc.contributor.authorTrujillo, Andrea
dc.contributor.authorCedillo Orellana, Irene Priscila
dc.date.accessioned2020-06-12T16:08:08Z
dc.date.available2020-06-12T16:08:08Z
dc.date.issued2020
dc.descriptionNatural Language Processing (NLP) is used to identify key information, generating predictive models, and explaining global events or trends. Also, NLP is supported during the process to create knowledge. Therefore, it is important to apply refinement techniques in major stages such as preprocessing, when data is frequently produced and processed with poor results. This document analyzes and measures the impact of combinations of preprocessing techniques and libraries for short texts that have been written in Spanish. These techniques were applied in tweets for analysis of sentiments considering evaluation parameters in its analysis, the processing time and characteristics of the techniques for each library. The performed experimentation provides readers insights for choosing the appropriate combination of techniques during preprocessing. The results show improvement of up to 5% to 9% in the performance of the classification.
dc.description.abstractNatural Language Processing (NLP) is used to identify key information, generating predictive models, and explaining global events or trends. Also, NLP is supported during the process to create knowledge. Therefore, it is important to apply refinement techniques in major stages such as preprocessing, when data is frequently produced and processed with poor results. This document analyzes and measures the impact of combinations of preprocessing techniques and libraries for short texts that have been written in Spanish. These techniques were applied in tweets for analysis of sentiments considering evaluation parameters in its analysis, the processing time and characteristics of the techniques for each library. The performed experimentation provides readers insights for choosing the appropriate combination of techniques during preprocessing. The results show improvement of up to 5% to 9% in the performance of the classification.
dc.description.citySan Francisco
dc.identifier.doi10.1007/978-3-030-39442-4_10
dc.identifier.isbn978-303039441-7
dc.identifier.issn2194-5357
dc.identifier.urihttps://link.springer.com/chapter/10.1007/978-3-030-39442-4_10
dc.language.isoes_ES
dc.publisherSpringer
dc.sourceAdvances in Intelligent Systems and Computing
dc.subjectNatural language processing
dc.subjectPreprocessing
dc.subjectTwitter
dc.subjectSentiment analysis
dc.subjectText mining
dc.titleA Comparative evaluation of preprocessing techniques for short texts in spanish
dc.title.alternativeA comparative evaluation of preprocessing techniques for short texts in spanish
dc.typeARTÍCULO DE CONFERENCIA
dc.ucuenca.afiliacionOrellana, M., Universidad del Azuay, Cuenca, Ecuador
dc.ucuenca.afiliacionTrujillo, A., Universidad del Azuay, Cuenca, Ecuador
dc.ucuenca.afiliacionCedillo, I., Universidad del Azuay, Cuenca, Ecuador; Cedillo, I., Universidad de Cuenca, Cuenca, Ecuador
dc.ucuenca.areaconocimientofrascatiamplio5. Ciencias Sociales
dc.ucuenca.areaconocimientofrascatidetallado5.1.2 Psicología Especial(Terapia de Aprendizaje, Habla
dc.ucuenca.areaconocimientofrascatiespecifico5.1 Psicología y Ciencias Cognitivas
dc.ucuenca.areaconocimientounescoamplio03 - Ciencias Sociales, Periodismo e Información
dc.ucuenca.areaconocimientounescodetallado0313 - Psicología
dc.ucuenca.areaconocimientounescoespecifico031 - Ciencias Sociales y Ciencias del Comportamiento
dc.ucuenca.comiteorganizadorconferenciaOrganización de Ciencia e Información (SAI)
dc.ucuenca.conferenciaFuture of Information and Communication Conference (FICC) 2020
dc.ucuenca.correspondenciaCedillo Orellana, Irene Priscila, priscila.cedillo@ucuenca.edu.ec
dc.ucuenca.cuartilQ3
dc.ucuenca.embargoend2050-01-12
dc.ucuenca.embargointerno2050-01-12
dc.ucuenca.factorimpacto0.184
dc.ucuenca.fechafinconferencia2020-03-06
dc.ucuenca.fechainicioconferencia2020-03-05
dc.ucuenca.idautor0102668209
dc.ucuenca.idautorSgrp-3157-2
dc.ucuenca.idautor0102815842
dc.ucuenca.indicebibliograficoSCOPUS
dc.ucuenca.numerocitaciones0
dc.ucuenca.organizadorconferenciaOrganización de Ciencia e Información (SAI)
dc.ucuenca.paisESTADOS UNIDOS
dc.ucuenca.urifuentehttps://link.springer.com/book/10.1007/978-3-030-39442-4
dc.ucuenca.versionVersión publicada
dc.ucuenca.volumenVolumen 1130
dspace.entity.typePublication
relation.isAuthorOfPublication9ecaad85-5b06-4b92-b05c-0d89c7b10660
relation.isAuthorOfPublication.latestForDiscovery9ecaad85-5b06-4b92-b05c-0d89c7b10660

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
documento.pdf
Size:
363.43 KB
Format:
Adobe Portable Document Format
Description:
document

Collections