The Open UniversitySkip to content
 

Digging in the library

King, David (2013). Digging in the library. In: Biodiversity Informatics Horizons 2013, 3-6 Sep 2013, CNR & La Sapienza, Rome, Italy.

Full text available as:
[img] ZIP archive (Version of Record)
Download (3MB)
URL: http://conference.lifewatch.unisalento.it/index.ph...
Google Scholar: Look up in Google Scholar

Abstract

“You want to talk about climate change – change from what? Or invasive species – coming from where? To help us understand what is happening today, we need to unlock the old literature to understand what was happening yesterday.”

To understand fully the world around us as it is now we need to understand how it was before. We are fortunate in biodiversity to have more than 250 years of observations of the natural world sitting on library shelves that can help us understand how the world used to be. Or could, if we could had the time to read all 300 million pages. If humans don’t have the time, perhaps computers can help. How can we liberate the data and reveal what we already know?

This talk covers some of the computing techniques developed to extract explicit data, such as taxon names and locations, as well as implicit data, such as concepts and relationships, while overcoming the problems inherent in digitising printed matter. However, just extracting data is not enough to make it usable. Hence, the talk concludes with examples of how linked open data addresses this second problem and thereby lets us reveal what we already know.

Item Type: Conference or Workshop Item
Copyright Holders: 2013 ViBRANT
Project Funding Details:
Funded Project NameProject IDFunding Body
ViBRANTRI-261532European Union 7th Framework Programme within the Research Infrastructures group
Keywords: text mining; OCR; dirty data; XML; RDF; LOD
Academic Unit/School: Faculty of Science, Technology, Engineering and Mathematics (STEM) > Computing and Communications
Faculty of Science, Technology, Engineering and Mathematics (STEM)
Research Group: Centre for Research in Computing (CRC)
Related URLs:
Item ID: 38409
Depositing User: David King
Date Deposited: 20 Sep 2013 09:53
Last Modified: 07 Dec 2018 10:18
URI: http://oro.open.ac.uk/id/eprint/38409
Share this page:

Download history for this item

These details should be considered as only a guide to the number of downloads performed manually. Algorithmic methods have been applied in an attempt to remove automated downloads from the displayed statistics but no guarantee can be made as to the accuracy of the figures.

Actions (login may be required)

Policies | Disclaimer

© The Open University   contact the OU