Challenging knowledge extraction to support the curation of documentary evidence in the humanities

Daga, Enrico and Motta, Enrico (2019). Challenging knowledge extraction to support the curation of documentary evidence in the humanities. In: Third International Workshop on Capturing Scientific Knowledge (Sciknow). Collocated with the tenth International Conference on Knowledge Capture (K-CAP) (Garijo, Daniel; Markovic, Milan; Groth, Paul; Santana, Idafen and Belhajjame, Khalid eds.), Los Angeles, CA, USA.

Abstract

The identification and cataloguing of documentary evidence from textual corpora is an important part of empirical research in the humanities. In this position paper, we ponder the applicability of knowledge extraction techniques to support the data acquisition process. Initially, we characterise the task by analysing the end-to-end process occurring in the data curation activity. After that, we examine general knowledge extraction tasks and discuss their relation to the problem at hand. Considering the case of the Listening Experience Database (LED), we perform an empirical analysis focusing on two roles: the 'listener' and the 'place'. The results show, among other things, how the entities are often mentioned many paragraphs away from the evidence text or are not in the source at all. We discuss the challenges emerged from the point of view of scientific knowledge acquisition.

Viewing alternatives

Download history

Item Actions

Export

About