Daga, Enrico and Motta, Enrico
(2019).
DOI (Digital Object Identifier) Link: | https://doi.org/10.1145/3360901.3364415 |
---|---|
Google Scholar: | Look up in Google Scholar |
Abstract
The task of identifying pieces of evidence in texts is of fundamental importance in supporting qualitative studies in various domains, especially in the humanities. In this paper, we coin the expression themed evidence, to refer to (direct or indirect) traces of a fact or situation relevant to a theme of interest and study the problem of identifying them in texts. We devise a generic framework aimed at capturing themed evidence in texts based on a hybrid approach, combining statistical natural language processing, background knowledge, and Semantic Web technologies. The effectiveness of the method is demonstrated in a case study of a digital humanities database aimed at collecting and curating a repository of evidence of experiences of listening to music. Extensive experiments demonstrate that our hybrid approach outperforms alternative solutions. We also evidence its generality by testing it on a different use case in the digital humanities.
Item Type: | Conference or Workshop Item | ||||||
---|---|---|---|---|---|---|---|
Project Funding Details: |
|
||||||
Academic Unit/School: | Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi) Faculty of Science, Technology, Engineering and Mathematics (STEM) |
||||||
Research Group: | Centre for Research in Computing (CRC) Digital Humanities at the Open University (DH_OU) |
||||||
Item ID: | 67014 | ||||||
Depositing User: | Enrico Daga | ||||||
Date Deposited: | 30 Sep 2019 13:56 | ||||||
Last Modified: | 02 Oct 2019 19:12 | ||||||
URI: | http://oro.open.ac.uk/id/eprint/67014 | ||||||
Share this page: | ![]() ![]() ![]() ![]() |
Metrics
Altmetrics from Altmetric | Citations from Dimensions |