Integrating text mining and citation analysis in the decision-making process for library collections

dc.contributor.authorIllescas Peña, Lourdes Eugenia
dc.contributor.authorSigüenza Guzmán, Lorena Catalina
dc.contributor.authorSucozhañay Calle, Dolores Catalina
dc.contributor.ponenteIllescas Peña, Lourdes Eugenia
dc.date.accessioned2018-07-13T13:22:41Z
dc.date.available2018-07-13T13:22:41Z
dc.date.issued2018
dc.descriptionIn recent years, the scientific production in Ecuador has registered a considerable increase, due to the implementation of government policies designed to improve the quality of education. Higher Education Institutions (HEI) have also tried to stimulate research and scientific production to even higher quality standards with the pressure to rack up publications in high-impact journals. However, research and scientific production can flourish only in an environment where access to scientific knowledge is easily available. Consequently, Ecuadorian universities have increased their budget by approximately five times in order to provide access to digital databases and other electronic resources. Unfortunately, these efforts have not yielded the expected results to cover the minimum level of access to knowledge, due to the high costs of subscriptions to scientific journals. Therefore, decision making in library collection development becomes a very important process that needs to get the attention deserved. In general, at the University of Cuenca, funds for library collection development are allocated by faculties; each faculty decides what to subscribe or unsubscribe, generally following historical spending patterns, electronic journal usage data, and in some cases, based on their own finances and priorities. Nevertheless, these indicators have been subject to recurring debates, due to their unclear relation with the current and future library needs of information. More research is required for the construction of accurate indicators regarding the library collection performance and the growing needs of collection development. The aim of this article is to have a deep insight of the local use of the collection, contextualised to the references cited in scientific articles published by authors affiliated to the University of Cuenca. To achieve this goal, a set of the last 10-year publications were analysed. The full article and reference list were extracted using text mining methods. Text parsing and text filtering techniques were used for data extraction of each text corpus. Each word was classified as a text tree; in which, through the recognition of identities and the extraction of relationships, a data structure was constructed. This structure allowed the application of data mining techniques, such as clustering, decision trees and classification methods. By integrating text mining and citation analysis in the decision-making process for library collections, the authors aim to provide a dynamic solution that assists library managers to make economic decisions based on an “as realistic as possible” perspective of the users' needs.
dc.description.abstractIn recent years, the scientific production in Ecuador has registered a considerable increase, due to the implementation of government policies designed to improve the quality of education. Higher Education Institutions (HEI) have also tried to stimulate research and scientific production to even higher quality standards with the pressure to rack up publications in high-impact journals. However, research and scientific production can flourish only in an environment where access to scientific knowledge is easily available. Consequently, Ecuadorian universities have increased their budget by approximately five times in order to provide access to digital databases and other electronic resources. Unfortunately, these efforts have not yielded the expected results to cover the minimum level of access to knowledge, due to the high costs of subscriptions to scientific journals. Therefore, decision making in library collection development becomes a very important process that needs to get the attention deserved. In general, at the University of Cuenca, funds for library collection development are allocated by faculties; each faculty decides what to subscribe or unsubscribe, generally following historical spending patterns, electronic journal usage data, and in some cases, based on their own finances and priorities. Nevertheless, these indicators have been subject to recurring debates, due to their unclear relation with the current and future library needs of information. More research is required for the construction of accurate indicators regarding the library collection performance and the growing needs of collection development. The aim of this article is to have a deep insight of the local use of the collection, contextualised to the references cited in scientific articles published by authors affiliated to the University of Cuenca. To achieve this goal, a set of the last 10-year publications were analysed. The full article and reference list were extracted using text mining methods. Text parsing and text filtering techniques were used for data extraction of each text corpus. Each word was classified as a text tree; in which, through the recognition of identities and the extraction of relationships, a data structure was constructed. This structure allowed the application of data mining techniques, such as clustering, decision trees and classification methods. By integrating text mining and citation analysis in the decision-making process for library collections, the authors aim to provide a dynamic solution that assists library managers to make economic decisions based on an “as realistic as possible” perspective of the users' needs.
dc.description.cityValencia
dc.identifier.doi10.21125/inted.2018.1754
dc.identifier.isbn978-84-697-9480-7
dc.identifier.issn2340-1079
dc.identifier.urihttps://library.iated.org/view/ILLESCAS2018INT
dc.language.isoes_ES
dc.publisher
dc.sourceINTED2018 Proceedings
dc.subjectLibrary Collection
dc.subjectScientific Publications
dc.subjectUniversity Libraries
dc.subjectText Mining
dc.titleIntegrating text mining and citation analysis in the decision-making process for library collections
dc.title.alternative
dc.typeARTÍCULO DE CONFERENCIA
dc.ucuenca.afiliacionIllescas, L., Universidad de Cuenca, Facultad de Filosofía, Letras y Ciencias de la Educación, Cuenca, Ecuador
dc.ucuenca.afiliacionSiguenza, L., Universidad de Cuenca, Departamento de Ciencias de la Computación, Cuenca, Ecuador
dc.ucuenca.afiliacionSucozhañay, D., Universidad de Cuenca, Departamento de Espacio y Población, Cuenca, Ecuador; Sucozhañay, D., Universidad de Cuenca, Facultad de Ciencias Económicas y Administrativas, Cuenca, Ecuador
dc.ucuenca.areaconocimientofrascatiamplio5. Ciencias Sociales
dc.ucuenca.areaconocimientofrascatidetallado5.8.3 Bibliotecología
dc.ucuenca.areaconocimientofrascatiespecifico5.8 Comunicación y Medios
dc.ucuenca.areaconocimientounescoamplio03 - Ciencias Sociales, Periodismo e Información
dc.ucuenca.areaconocimientounescodetallado0322 - Biblioteca, Información y Archivística
dc.ucuenca.areaconocimientounescoespecifico032 - Periodismo e Información
dc.ucuenca.comiteorganizadorconferenciaRed de apoyo a la gestión educativa
dc.ucuenca.conferencia12th International Technology, Education and Development Conference
dc.ucuenca.embargoend2050-01-01
dc.ucuenca.embargointerno2050-01-01
dc.ucuenca.fechafinconferencia2018-03-07
dc.ucuenca.fechainicioconferencia2018-03-05
dc.ucuenca.idautor0102074622
dc.ucuenca.idautor0102659687
dc.ucuenca.idautor0102680709
dc.ucuenca.indicebibliograficoISI WEB OF SCIENCE (WEB OF KNOWLEDGE)
dc.ucuenca.numerocitaciones0
dc.ucuenca.organizadorconferenciaRED AGE
dc.ucuenca.paisESPAÑA
dc.ucuenca.urifuentehttps://library.iated.org/publications/INTED2018/start/150
dc.ucuenca.versionVersión publicada
dc.ucuenca.volumenvolumen 0, número 0

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
documento.pdf
Size:
159.66 KB
Format:
Adobe Portable Document Format
Description:
document

Collections