This Linked Hypernym dataset attaches entity articles in English, German and Dutch Wikipedia with a DBpedia resource or a DBpedia ontology concept as their type. The types are hypernyms mined from articles' free text using hand-crafted lexicosyntactic patterns.
The dataset contains 2.8 million entity-type assignments, out of which nearly 2.5 million are novel with respect to DBpedia and 2 million w.r.t. Yago 2s and DBpedia.
The dataset was generated with DBpedia 3.8 and Wikipedia snapshots in 2012.
LHD 1.0
Download highlights | |||
---|---|---|---|
Dataset | Dutch | English | German |
Ontology Hypernym Types with DBpedia Ontology mappings Complete Linked Hypernyms dataset, types are DBpedia ontology classes (preferred) or DBpedia resources | nt stat 665k 0.89 +- 2% | nt stat 1,309k 0.85 +- 2.5% | ntstat 828k 0.77 +- 2.5% |
Complete "Plain Text" Hypernym Dataset Complete Hypernyms dataset, all hypernyms are textual strings | nt 866k 0.91 (F1) | nt 1,507k 0.91 (F1) | nt 913k 0.92 (F1) |
DBpedia enrichment Subset containing only Entity-type assignments which are novel w.r.t. DBpedia (instance file, v3.8) | nt stat 705k | nt stat 1,060k 0.85 +- 2% | nt
stat 648k |
YAGO enrichment Entity-type assinments which are novel w.r.t. to YAGO 2s and DBpedia (instances file, v3.8) | nt stat 624k | nt stat 602k 0.81* | nt stat730k |
High accuracy: Linked Hypernym confirmed with YAGO Entity-type assignments which are redundant w.r.t. YAGO 2s ontology, all entries are resolvable to DBpedia ontology classes | nt stat 3k | nt stat 59k 0.994 [0.99;1] | nt stat 9k |
LHD 2.0 (draft)
The version 2.0 of the Linked Hypernyms Dataset increases the number of entities with a type in DBpedia Ontology namespace to nearly 100% (for English). This is accomplished by mapping the original types (DBpedia resources), to DBpedia Ontology concepts.
Raw data and intermediary results | |||
---|---|---|---|
Dataset | Dutch | English | German |
Mapping from DBpedia resources to DBpedia Ontology classes via the subclass relation, each record is preceded by its support in the comment | nt | nt | nt |
Entities mapped via LHD2.0 mappings to DBpedia Ontology Entity type assignment is confirmed in DBpedia |
nt stat | nt stat | nt stat |
Entities mapped via LHD2.0 mappings to DBpedia Ontology Entity is in DBpedia, but the type is not confirmed |
nt stat | nt stat | nt stat |
Entities mapped via LHD2.0 mappings to DBpedia Ontology Entity is not in DBpedia |
nt stat | nt stat | nt stat |
All entities with DBpedia Ontology classes replacing DBpedia resources The complete LHD dataset with some type assignments overriden by LHD 2.0 mappings |
nt stat | nt stat | nt stat |
Publications
- T. Kliegr, V. Zeman, M. Dojchinovski. Linked Hypernyms Dataset - Generation Framework and Use Cases. In Linguistic Linked Data (LDL'14) Challenge collocated with LREC 2014, Reykjavik, Iceland, May, 2014.
- T. Kliegr., O. Zamazal Towards Linked Hypernyms Dataset 2.0: complementing DBpedia with hypernym discovery. In 9th International Language Resources and Evaluation Conference (LREC'14), Reykjavik, Iceland, May, 2014.