Detecting Erroneous Identity Links on the Web using Network Metrics - Archive ouverte HAL Access content directly
Book Sections Year : 2018

Detecting Erroneous Identity Links on the Web using Network Metrics

Abstract

In the absence of a central naming authority on the Semantic Web, it is common for different datasets to refer to the same thing by different IRIs. Whenever multiple names are used to denote the same thing, owl:sameAs statements are needed in order to link the data and foster reuse. Studies that date back as far as 2009, have observed that the owl:sameAs property is sometimes used incorrectly. In this paper, we show how network metrics such as the community structure of the owl:sameAs graph can be used in order to detect such possibly erroneous statements. One benefit of the here presented approach is that it can be applied to the network of owl:sameAs links itself, and does not rely on any additional knowledge. In order to illustrate its ability to scale, the approach is evaluated on the largest collection of identity links to date, containing over 558M owl:sameAs links scraped from the LOD Cloud.
Fichier principal
Vignette du fichier
ISWC 2018.pdf (484.16 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01899407 , version 1 (19-10-2018)

Identifiers

  • HAL Id : hal-01899407 , version 1

Cite

Joe Raad, Wouter Beek, Frank van Harmelen, Nathalie Pernelle, Fatiha Saïs. Detecting Erroneous Identity Links on the Web using Network Metrics. The Semantic Web – ISWC 2018. ISWC 2018. Lecture Notes in Computer Science, vol 11136. Springer, Cham, 2018. ⟨hal-01899407⟩
148 View
334 Download

Share

Gmail Facebook Twitter LinkedIn More