Logo Repositorio Institucional

Please use this identifier to cite or link to this item: http://dspace.ucuenca.edu.ec/handle/123456789/34510
Full metadata record
DC FieldValueLanguage
dc.contributor.authorOrellana Cordero, Marcos Patricio-
dc.contributor.authorCedillo Orellana, Irene Priscila-
dc.date.accessioned2020-06-15T22:05:16Z-
dc.date.available2020-06-15T22:05:16Z-
dc.date.issued2019-
dc.identifier.isbn978-1-7281-5581-4-
dc.identifier.issn0000-0000-
dc.identifier.urihttps://ieeexplore.ieee.org/document/9052236-
dc.descriptionThe outlier detection in the field of data mining and Knowledge Discovering from Data (KDD) is capturing special interest due to its benefits. It can be applied in the financial area; because the obtained data patterns can help finding possible frauds and user errors. Therefore, it is essential to assess the truthfulness of the information. In this context, data auditory process uses techniques of data mining that play a significant role in the detection of unusual behavior. Here, a method for detecting values that can be considered as outliers in a nominal database is proposed. The basic idea in this method is to implement: a Global k-Nearest Neighbors algorithm, a clustering algorithm named k-means, and a statistical method of chi-square. The application of algorithms has been developed with a database of candidate people for the granting of a loan. Each test was made on a dataset of 1180 registers in which outliers have been introduced deliberately. The experimental results show that the method is able to detect all introduced values, which were previously labeled to be differentiated. Consequently, there were found a total of 48 tuples with outliers of 11 nominal columns. © 2019 IEEE.-
dc.description.abstractThe outlier detection in the field of data mining and Knowledge Discovering from Data (KDD) is capturing special interest due to its benefits. It can be applied in the financial area; because the obtained data patterns can help finding possible frauds and user errors. Therefore, it is essential to assess the truthfulness of the information. In this context, data auditory process uses techniques of data mining that play a significant role in the detection of unusual behavior. Here, a method for detecting values that can be considered as outliers in a nominal database is proposed. The basic idea in this method is to implement: a Global k-Nearest Neighbors algorithm, a clustering algorithm named k-means, and a statistical method of chi-square. The application of algorithms has been developed with a database of candidate people for the granting of a loan. Each test was made on a dataset of 1180 registers in which outliers have been introduced deliberately. The experimental results show that the method is able to detect all introduced values, which were previously labeled to be differentiated. Consequently, there were found a total of 48 tuples with outliers of 11 nominal columns. © 2019 IEEE.-
dc.language.isoes_ES-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.sourceProceedings - 2019 International Conference on Information Systems and Computer Science, INCISCOS 2019-
dc.subject-Chi-square-
dc.subject-Data-mining-
dc.subject-Financial-fraud-
dc.subject-KNN-
dc.subjectOutlier-
dc.titleOutlier detection with data mining techniques and statistical methods-
dc.typeARTÍCULO DE CONFERENCIA-
dc.description.cityQuito-
dc.ucuenca.idautor0102668209-
dc.ucuenca.idautor0102815842-
dc.identifier.doi10.1109/INCISCOS49368.2019.00017-
dc.ucuenca.embargoend2050-06-15-
dc.ucuenca.versionVersión publicada-
dc.ucuenca.embargointerno2050-06-15-
dc.ucuenca.areaconocimientounescoamplio07 - Ingeniería, Industria y Construcción-
dc.ucuenca.afiliacionOrellana, M., Universidad del Azuay, Cuenca, Ecuador-
dc.ucuenca.afiliacionCedillo, I., Universidad de Cuenca, Departamento de Ciencias de la Computación, Cuenca, Ecuador-
dc.ucuenca.volumenVolumen 11, no 1-
dc.ucuenca.indicebibliograficoSCOPUS-
dc.ucuenca.numerocitaciones0-
dc.ucuenca.areaconocimientofrascatiamplio2. Ingeniería y Tecnología-
dc.ucuenca.paisECUADOR-
dc.ucuenca.conferencia4th International Conference on Information Systems and Computer Science, INCISCOS 2019-
dc.ucuenca.areaconocimientofrascatiespecifico2.2 Ingenierias Eléctrica, Electrónica e Información-
dc.ucuenca.areaconocimientofrascatidetallado2.2.4 Ingeniería de La Comunicación y de Sistemas-
dc.ucuenca.areaconocimientounescoespecifico071 - Ingeniería y Profesiones Afines-
dc.ucuenca.areaconocimientounescodetallado0714 - Electrónica y Automatización-
dc.ucuenca.fechainicioconferencia2019-11-20-
dc.ucuenca.fechafinconferencia2019-11-22-
dc.ucuenca.organizadorconferenciaInstitute of Electrical and Electronics Engineers Inc.-
dc.ucuenca.comiteorganizadorconferenciaSergio Luján,Oswaldo Moscoso,Luis Terán,R.S. Nithin,Giancarlo Agostini ,Diego Ordóñez,William Chamorro,Joel Paredes,Guillermo Mosquera,Estevan Gómez.-
dc.ucuenca.urifuentehttps://ieeexplore.ieee.org/xpl/conhome/9039808/proceeding-
Appears in Collections:Artículos

Files in This Item:
File Description SizeFormat 
documento.pdf
  Until 2050-06-15
document56.85 kBAdobe PDFView/Open Request a copy


This item is protected by original copyright



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Centro de Documentacion Regional "Juan Bautista Vázquez"

Biblioteca Campus Central Biblioteca Campus Salud Biblioteca Campus Yanuncay
Av. 12 de Abril y Calle Agustín Cueva, Telf: 4051000 Ext. 1311, 1312, 1313, 1314. Horario de atención: Lunes-Viernes: 07H00-21H00. Sábados: 08H00-12H00 Av. El Paraíso 3-52, detrás del Hospital Regional "Vicente Corral Moscoso", Telf: 4051000 Ext. 3144. Horario de atención: Lunes-Viernes: 07H00-19H00 Av. 12 de Octubre y Diego de Tapia, antiguo Colegio Orientalista, Telf: 4051000 Ext. 3535 2810706 Ext. 116. Horario de atención: Lunes-Viernes: 07H30-19H00