Skip to main content
article

Data-driven collection development: Text mining college course catalogs

Authors

Abstract

Academic librarians are trained to develop and manage collections. They rely on their own subject expertise in academic disciplines, input from teaching faculty, and professional training to make informed selections to support institutional curriculum. Professional training in collection development has, in recent years, focused on evidence-based acquisition methods (Johnson, 2018, p. 134). College and university course catalogs are a potential but untapped source of evidence for identifying topics of importance to institutional curricula. Course descriptions are concise descriptions of the subjects covered in college or university courses and therefore the topics about which students may require additional sources of information. Until recently, examining course catalogs was a time-consuming prospect. The advent of data and text mining techniques, however, makes it possible to analyze course descriptions with much less time and effort expended. This article contains a brief introduction to data science in libraries; details of tools and processes used for collecting and cleaning course catalog data; and preliminary results of a project to mine course catalogs for changes in curriculum focus to benefit library collection development decisions.

Keywords: Python, R, collection development, text mining, data science, academic libraries

How to Cite:

Sutton, S. W., Tolbert, H. & Harris, K., (2024) “Data-driven collection development: Text mining college course catalogs”, Kansas Library Association College and University Libraries Section Proceedings 1(2024). doi: https://doi.org/10.4148/2160-942X.1093

Downloads:

Downloads are not available for this article.

0 Views

Published on
2024-11-13