The Open UniversitySkip to content
 

Measuring Accuracy of Triples in Knowledge Graphs

Liu, Shuangyan; d'Aquin, Mathieu and Motta, Enrico (2017). Measuring Accuracy of Triples in Knowledge Graphs. In: Language, Data, and Knowledge: First International Conference, LDK 2017 Galway, Ireland, June 19–20, 2017 Proceedings (Gracia, Jorge; Bond, Francis; McCrae, John P.; Buitelaar, Paul; Chiarcos, Christian and Hellmann, Sebastian eds.), Lecture Notes in Computer Science, Springer, Cham, pp. 343–357.

Full text available as:
[img]
Preview
PDF (Accepted Manuscript) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (789kB) | Preview
DOI (Digital Object Identifier) Link: https://doi.org/10.1007/978-3-319-59888-8_29
Google Scholar: Look up in Google Scholar

Abstract

An increasing amount of large-scale knowledge graphs have been constructed in recent years. Those graphs are often created from text-based extraction, which could be very noisy. So far, cleaning knowledge graphs are often carried out by human experts and thus very inefficient. It is necessary to explore automatic methods for identifying and eliminating erroneous information. In order to achieve this, previous approaches primarily rely on internal information i.e. the knowledge graph itself. In this paper, we introduce an automatic approach, Triples Accuracy Assessment (TAA), for validating RDF triples (source triples) in a knowledge graph by finding consensus of matched triples (among target triples) from other knowledge graphs. TAA uses knowledge graph interlinks to find identical resources and apply different matching methods between the predicates of source triples and target triples. Then based on the matched triples, TAA calculates a confidence score to indicate the correctness of a source triple. In addition, we present an evaluation of our approach using the FactBench dataset for fact validation. Our findings show promising results for distinguishing between correct and wrong triples.

Item Type: Conference or Workshop Item
Copyright Holders: 2017 Springer International Publishing AG
ISBN: 3-319-59887-2, 978-3-319-59887-1
Keywords: data quality; triple matching; predicate semantic similarity; knowledge graphs; algorithm configuration optimisation
Academic Unit/School: Faculty of Science, Technology, Engineering and Mathematics (STEM) > Knowledge Media Institute (KMi)
Faculty of Science, Technology, Engineering and Mathematics (STEM)
Research Group: Centre for Research in Computing (CRC)
Item ID: 51017
Depositing User: Shuangyan Liu
Date Deposited: 25 Sep 2017 09:35
Last Modified: 11 Sep 2018 08:42
URI: http://oro.open.ac.uk/id/eprint/51017
Share this page:

Metrics

Altmetrics from Altmetric

Citations from Dimensions

Download history for this item

These details should be considered as only a guide to the number of downloads performed manually. Algorithmic methods have been applied in an attempt to remove automated downloads from the displayed statistics but no guarantee can be made as to the accuracy of the figures.

Actions (login may be required)

Policies | Disclaimer

© The Open University   contact the OU