Copy the page URI to the clipboard
Knoth, Petr; Anastasiou, Lucas; Basile, Giorgio; Pearce, Samuel and Pontika, Nancy (2017). Machine accessibility of Open Access scientific publications from publisher systems via ResourceSync. OAI10.
Abstract
In this poster, we outline the technical difficulties and present how we succeeded in harvesting metadata records and full text content of millions of OA articles from publisher APIs. We also show how we have managed to provide an interoperable layer over these data using ResourceSync.
To achieve this we have created a publisher connector, which harvests the open access scientific papers from publishers and exposes the content in a standardised API. Our contribution can be summarised as: a) creation of a seamless layer for accessing content from across publishers, b) offering of a generic integrated access point to these data via ResourceSync and c) provision of a high performance access interface, which will be constantly updated. This is first service to provide a harmonised access layer over non-standardised publisher APIs for retrieving gold and hybrid gold scholarly content as well as the first implementation of ResourceSync scaling to millions of documents with the potential for fast real-time updates.
Viewing alternatives
Download history
Item Actions
Export
About
- Item ORO ID
- 50181
- Item Type
- Other - Other
- Keywords
- CORE; publisher; connector; open access; journals; gold; hybrid; aggregation; text and data mining
- Academic Unit or School
-
Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM) - Research Group
-
Big Scientific Data and Text Analytics Group (BSDTAG)
Centre for Research in Computing (CRC) - Copyright Holders
- © 2017 The Authors
- Depositing User
- Lucas Anastasiou