Quantcast
Channel: University Library
Viewing all articles
Browse latest Browse all 136

Data mining of Citations in Theses: a workflow for automated analysis of Open Access and library holdings coverage

$
0
0
Data mining of Citations in Theses: a workflow for automated analysis of Open Access and library holdings coverage Martin, Jose; Han, Lee Yen A systems specialist and a liaison librarian worked together in this project to analyze resource usage, Open Access coverage, library holdings coverage and citation patterns from a collection of doctoral theses in a graduate research university. The extracted citations and some basic metadata about the theses and their authors were processed using a workflow created with KNIME, an open source data-mining software. The workflow uses Summon and Crossref APIs for library holdings coverage, CORE and Unpaywall APIs for Open Access coverage and an SQLite database to store the output and enable detailed analysis. This tool provides an insight into the resources that have been effectively used to produce doctoral theses. It would be useful for academic libraries interested in evaluating the impact of Open Access resources and how they contribute to their scholarly output, and to evaluate the coverage provided by their holdings to the research activity in their institutions beyond the usual usage reports provided by publishers or third parties.

Viewing all articles
Browse latest Browse all 136

Trending Articles