Logo Repositorio Institucional

Please use this identifier to cite or link to this item: http://dspace.ucuenca.edu.ec/handle/123456789/30707
Title: Integrating text mining and citation analysis in the decision-making process for library collections
Other Titles: 
Authors: Illescas Peña, Lourdes Eugenia
Siguenza Guzman, Lorena Catalina
Sucozhañay Calle, Dolores Catalina
Keywords: Library Collection
Scientific Publications
University Libraries
Text Mining
metadata.dc.ucuenca.areaconocimientofrascatiamplio: 5. Ciencias Sociales
metadata.dc.ucuenca.areaconocimientofrascatidetallado: 5.8.3 Bibliotecología
metadata.dc.ucuenca.areaconocimientofrascatiespecifico: 5.8 Comunicación y Medios
metadata.dc.ucuenca.areaconocimientounescoamplio: 03 - Ciencias Sociales, Periodismo e Información
metadata.dc.ucuenca.areaconocimientounescodetallado: 0322 - Biblioteca, Información y Archivística
metadata.dc.ucuenca.areaconocimientounescoespecifico: 032 - Periodismo e Información
Issue Date: 2018
metadata.dc.ucuenca.embargoend: 1-Jan-2050
metadata.dc.ucuenca.volumen: volumen 0, número 0
metadata.dc.source: INTED2018 Proceedings
metadata.dc.identifier.doi: 10.21125/inted.2018.1754
Publisher: 
metadata.dc.description.city: 
Valencia
metadata.dc.type: ARTÍCULO DE CONFERENCIA
Abstract: 
In recent years, the scientific production in Ecuador has registered a considerable increase, due to the implementation of government policies designed to improve the quality of education. Higher Education Institutions (HEI) have also tried to stimulate research and scientific production to even higher quality standards with the pressure to rack up publications in high-impact journals. However, research and scientific production can flourish only in an environment where access to scientific knowledge is easily available. Consequently, Ecuadorian universities have increased their budget by approximately five times in order to provide access to digital databases and other electronic resources. Unfortunately, these efforts have not yielded the expected results to cover the minimum level of access to knowledge, due to the high costs of subscriptions to scientific journals. Therefore, decision making in library collection development becomes a very important process that needs to get the attention deserved. In general, at the University of Cuenca, funds for library collection development are allocated by faculties; each faculty decides what to subscribe or unsubscribe, generally following historical spending patterns, electronic journal usage data, and in some cases, based on their own finances and priorities. Nevertheless, these indicators have been subject to recurring debates, due to their unclear relation with the current and future library needs of information. More research is required for the construction of accurate indicators regarding the library collection performance and the growing needs of collection development. The aim of this article is to have a deep insight of the local use of the collection, contextualised to the references cited in scientific articles published by authors affiliated to the University of Cuenca. To achieve this goal, a set of the last 10-year publications were analysed. The full article and reference list were extracted using text mining methods. Text parsing and text filtering techniques were used for data extraction of each text corpus. Each word was classified as a text tree; in which, through the recognition of identities and the extraction of relationships, a data structure was constructed. This structure allowed the application of data mining techniques, such as clustering, decision trees and classification methods. By integrating text mining and citation analysis in the decision-making process for library collections, the authors aim to provide a dynamic solution that assists library managers to make economic decisions based on an “as realistic as possible” perspective of the users' needs.
Description: 
In recent years, the scientific production in Ecuador has registered a considerable increase, due to the implementation of government policies designed to improve the quality of education. Higher Education Institutions (HEI) have also tried to stimulate research and scientific production to even higher quality standards with the pressure to rack up publications in high-impact journals. However, research and scientific production can flourish only in an environment where access to scientific knowledge is easily available. Consequently, Ecuadorian universities have increased their budget by approximately five times in order to provide access to digital databases and other electronic resources. Unfortunately, these efforts have not yielded the expected results to cover the minimum level of access to knowledge, due to the high costs of subscriptions to scientific journals. Therefore, decision making in library collection development becomes a very important process that needs to get the attention deserved. In general, at the University of Cuenca, funds for library collection development are allocated by faculties; each faculty decides what to subscribe or unsubscribe, generally following historical spending patterns, electronic journal usage data, and in some cases, based on their own finances and priorities. Nevertheless, these indicators have been subject to recurring debates, due to their unclear relation with the current and future library needs of information. More research is required for the construction of accurate indicators regarding the library collection performance and the growing needs of collection development. The aim of this article is to have a deep insight of the local use of the collection, contextualised to the references cited in scientific articles published by authors affiliated to the University of Cuenca. To achieve this goal, a set of the last 10-year publications were analysed. The full article and reference list were extracted using text mining methods. Text parsing and text filtering techniques were used for data extraction of each text corpus. Each word was classified as a text tree; in which, through the recognition of identities and the extraction of relationships, a data structure was constructed. This structure allowed the application of data mining techniques, such as clustering, decision trees and classification methods. By integrating text mining and citation analysis in the decision-making process for library collections, the authors aim to provide a dynamic solution that assists library managers to make economic decisions based on an “as realistic as possible” perspective of the users' needs.
URI: https://library.iated.org/view/ILLESCAS2018INT
metadata.dc.ucuenca.urifuente: https://library.iated.org/publications/INTED2018/start/150
ISBN: 978-84-697-9480-7
ISSN: 2340-1079
Appears in Collections:Artículos

Files in This Item:
File Description SizeFormat 
documento.pdf
  Until 2050-01-01
document159.66 kBAdobe PDFView/Open Request a copy


This item is protected by original copyright



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Centro de Documentacion Regional "Juan Bautista Vázquez"

Biblioteca Campus Central Biblioteca Campus Salud Biblioteca Campus Yanuncay
Av. 12 de Abril y Calle Agustín Cueva, Telf: 4051000 Ext. 1311, 1312, 1313, 1314. Horario de atención: Lunes-Viernes: 07H00-21H00. Sábados: 08H00-12H00 Av. El Paraíso 3-52, detrás del Hospital Regional "Vicente Corral Moscoso", Telf: 4051000 Ext. 3144. Horario de atención: Lunes-Viernes: 07H00-19H00 Av. 12 de Octubre y Diego de Tapia, antiguo Colegio Orientalista, Telf: 4051000 Ext. 3535 2810706 Ext. 116. Horario de atención: Lunes-Viernes: 07H30-19H00