Wouter Beek (wouter@triply.cc), Joe Raad (j.raad@vu.nl), Erman Acar (erman.acar@vu.nl), Frank van Harmelen (frank.van.harmelen@vu.nl)
“Include links to other URIs, so that [data clients] can discover more things.”
“Link your data to other data to provide context.”
owl:sameAs
Linked Data is not possible without formal logic.
dbr:President_Barack_Obama
.owl:sameAs
link to go to node fb:m.05b6w1g
.owl:sameAs
link to go to node dbr:Barack_Obama_Cabinet
.Oops! President Obama ≠ The Obama Cabinet
We need a Travel Guide that informs us where we can go (and what the risks are).
owl:sameAs
statements is…2.8%
4%
20%
owl:sameAs
isn't the Same: An Analysis of Identity in Linked Data”. International
Semantic Web Conference, pp. 305–320.Communities correspond to roles:
Class | № instances |
---|---|
meta:IdentityStatement | 556,152,454 |
meta:Community | 55,697,160 |
meta:EquivalenceSet | 48,999,148 |
№ implicit identity statements | 35,201,120,188 |
Property | № triples |
---|---|
meta:error | 556,152,454 |
meta:community | 410,706,139 |
meta:fromCommunity | 145,446,315 |
meta:toCommunity | 145,446,315 |
meta:cardinality | 48,999,148 |
meta:equivalenceSet | 55,697,160 |
meta:cardinality | 48,999,148 |
Total | 4,352,602,480 |
fb:m.05b6w1g owl:sameAs dbr:President_Barack_Obama. # ←
fb:m.05b6w1g owl:sameAs dbr:Barack_Obama_Cabinet. # →
select ?error {
[ rdf:subject fb:m.05b6w1g;
rdf:object dbr:President_Barack_Obama;
meta:error ?error ].
}
owl:sameAs
link is (run online).
Through which countries does the Yenisei river flow? (Lopez et al. 2013)
select distinct ?uri ?string {
dbr:Yenisei_River owl:sameAs*/dbp:country/owl:sameAs* ?uri.
optional {
?uri rdfs:label ?string.
filter(lang(?string) = "en")
}
}
Returns over 30K results, including hundreds of unrelated geographic places, the concept of creative writing, and the mythical creature Gorgon.
Setting error value < 0.3 only returns identifiers for Russia and Mongolia (the correct answers).
What are the band members of ABBA? (Buikstra et al. 2011)
Result | ≤ 1.0 | ≤ 0.8 | ≤ 0.6 | ≤ 0.4 | ≤ 0.2 | ≤ 0.0 |
---|---|---|---|---|---|---|
Björn Ulvaeus (band member) | 28 | 8 | 8 | 3 | 2 | 2 |
Agnetha Fältskog (band member) | 26 | 4 | 4 | 2 | 1 | 1 |
Anni-Frid Lyngstad (band member) | 9 | 3 | 3 | 2 | 1 | 1 |
Benny Andersson (band member) | 6 | 2 | 2 | 1 | 1 | 1 |
Ola Brukert (drummer) | 3 | 2 | 2 | 1 | 1 | 1 |
Agnetha Ulvaeus (Agnetha F. married name) | 2 | 0 | 0 | 0 | 0 | 0 |
Stig Andersson (band manager) | 9 | 4 | 4 | 1 | 1 | 1 |
Gert van der Graaf (stalker of Agnetha Fältskog) | 2 | 0 | 0 | 0 | 0 | 0 |
Benny Anderssons Orkester (new band) | 5 | 3 | 3 | 0 | 0 | 0 |
Stig Andersson (sportsman) | 2 | 2 | 2 | 0 | 0 | 0 |
Fuzzy identity function (t-conorm): commutative, monotonic, associative (compositionality).
This still allows for Linked Data reuse: see i1
This even allows for better Linked Data reuse: see i3
Apply other, more computationally intensive, approaches for links with error values [0.3,0.7].
Current sets are small / dataset-specific: DBpedia-based, OAEI.
https://krr.triply.cc/krr/metalink
Wouter Beek (wouter@triply.cc), Joe Raad (j.raad@vu.nl), Erman Acar (erman.acar@vu.nl), Frank van Harmelen (frank.van.harmelen@vu.nl)