The Open UniversitySkip to content

Connecting repositories in the open access domain using text mining and semantic data

Knoth, Petr; Robotka, Vojtech and Zdrahal, Zdenek (2011). Connecting repositories in the open access domain using text mining and semantic data. In: Research and Advanced Technology for Digital Libraries, pp. 483–487.

Full text available as:
Full text not publicly available (Version of Record)
Due to publisher licensing restrictions, this file is not available for public download
Click here to request a copy from the OU Author.
DOI (Digital Object Identifier) Link:
Google Scholar: Look up in Google Scholar


This paper presents CORE (COnnecting REpositories), a system that aims to facilitate the access and navigation across scientific papers stored in Open Access repositories. This is being achieved by harvesting metadata and full-text content from Open Access repositories, by applying text mining techniques to discover semanticly related articles and by representing and exposing these relations as Linked Data. The information about associations between articles expressed in an interoperable format will enable the emergence of a wide range of applications. The potential of CORE can be demonstrated on two use-cases: (1) Improving the the navigation capabilities of digital libraries by the means of a CORE pluging, (2) Providing access to digital content from smart phones and tablet devices by the means of the CORE Mobile application.

Item Type: Conference or Workshop Item
Copyright Holders: 2011 Springer-Verlag Berlin Heidelberg
ISSN: 0302-9743
Extra Information: Published in Lecture Notes in Computer Science, Volume 6966/2011, ISBN 978-3-642-24468-1, pp. 483-487.
Keywords: digital library federations; automatic link generation; text mining; semantic similarity; content harvesting; mobile devices; CORE
Academic Unit/School: Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM)
Research Group: Centre for Research in Computing (CRC)
Big Scientific Data and Text Analytics Group (BSDTAG)
Item ID: 32180
Depositing User: Kay Dave
Date Deposited: 07 Feb 2012 15:56
Last Modified: 04 Apr 2019 09:37
Share this page:


Altmetrics from Altmetric

Citations from Dimensions

Actions (login may be required)

Policies | Disclaimer

© The Open University   contact the OU