Logo Repositorio Institucional

Please use this identifier to cite or link to this item: http://dspace.ucuenca.edu.ec/handle/123456789/35471
Title: A ranking-based approach for supporting the initial selection of primary studies in a Systematic Literature Review
Authors: Freire Zurita, Renan Gonzalo
Gonzalez Toral, Hernan Santiago
Saquicela Galarza, Victor Hugo
Gualan Saavedra, Ronald Marcelo
Keywords: machinelearning
data mining
ranking indexing
Systematic literature review
knowledge graphs
NLPPCA
text mining
automation
metadata.dc.ucuenca.areaconocimientofrascatiamplio: 2. Ingeniería y Tecnología
metadata.dc.ucuenca.areaconocimientofrascatidetallado: 2.2.4 Ingeniería de La Comunicación y de Sistemas
metadata.dc.ucuenca.areaconocimientofrascatiespecifico: 2.2 Ingenierias Eléctrica, Electrónica e Información
metadata.dc.ucuenca.areaconocimientounescoamplio: 06 - Información y Comunicación (TIC)
metadata.dc.ucuenca.areaconocimientounescodetallado: 0613 - Software y Desarrollo y Análisis de Aplicativos
metadata.dc.ucuenca.areaconocimientounescoespecifico: 061 - Información y Comunicación (TIC)
Issue Date: 2019
metadata.dc.ucuenca.embargoend: 31-Dec-2050
metadata.dc.ucuenca.volumen: September 2019
metadata.dc.source: Proceedings - 2019 45th Latin American Computing Conference, CLEI 2019
metadata.dc.identifier.doi: 10.1109/CLEI47609.2019.235079
Publisher: Institute of Electrical and Electronics Engineers Inc.
metadata.dc.description.city: 
Panamá
metadata.dc.type: ARTÍCULO DE CONFERENCIA
Abstract: 
Traditionally most of the steps involved in a Systematic Literature Review (SLR) process are manually executed, causing inconvenience of time and effort, given the massive amount of primary studies available online. This has motivated a lot of research focused on automating the process. Current state-of-the-art methods combine active learning methods and manual selection of primary studies from a smaller set so they can maximize the finding of relevant papers while at the same time minimizing the number of manually reviewed papers. In this work, we propose a novel strategy to further improve these methods whose early success heavily depends on an effective selection of initial papers to be read by researchers using a PCAbased method which combines different document representation and similarity metric approaches to cluster and rank the content within the corpus related to an enriched representation of research questions within the SLR protocol. Validation was carried out over four publicly available data sets corresponding to SLR studies from the Software Engineering domain. The proposed model proved to be more efficient than a BM25 baseline model as a mechanism to select the initial set of relevant primary studies within the top 100 rank, which makes it a promising method to bootstrap an active learning cycle.
URI: http://dspace.ucuenca.edu.ec/handle/123456789/35471
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85084734905&doi=10.1109%2fCLEI47609.2019.235079&partnerID=40&md5=dda5a12dd48231eea588da4d463b39b4
metadata.dc.ucuenca.urifuente: https://ieeexplore.ieee.org/xpl/conhome/9042768/proceeding
ISBN: 978-172815574-6
ISSN: 0000-0000
Appears in Collections:Artículos

Files in This Item:
File Description SizeFormat 
documento.pdf
  Until 2050-12-31
document227.8 kBAdobe PDFView/Open Request a copy


This item is protected by original copyright



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Centro de Documentacion Regional "Juan Bautista Vázquez"

Biblioteca Campus Central Biblioteca Campus Salud Biblioteca Campus Yanuncay
Av. 12 de Abril y Calle Agustín Cueva, Telf: 4051000 Ext. 1311, 1312, 1313, 1314. Horario de atención: Lunes-Viernes: 07H00-21H00. Sábados: 08H00-12H00 Av. El Paraíso 3-52, detrás del Hospital Regional "Vicente Corral Moscoso", Telf: 4051000 Ext. 3144. Horario de atención: Lunes-Viernes: 07H00-19H00 Av. 12 de Octubre y Diego de Tapia, antiguo Colegio Orientalista, Telf: 4051000 Ext. 3535 2810706 Ext. 116. Horario de atención: Lunes-Viernes: 07H30-19H00