This Linked Hypernym dataset attaches entity articles in English, German and Dutch Wikipedia with a DBpedia resource or a DBpedia ontology concept as their type. The types are hypernyms mined from articles' free text using hand-crafted lexicosyntactic patterns.
Datasets were generated within DBpedia 2015 and Wikipedia snapshots in March 2015.
The latest version of the Linked Hypernyms Dataset - October 2015!
All partitions of the dataset, as described in the dataset description section, can be download from here.
Download highlights | |||||
---|---|---|---|---|---|
Dataset | Dutch | English | German | ||
Core Dataset Most accurate - result of pattern matching |
nt |
nt |
nt |
||
Inference Dataset Types are in the DBpedia ontology namespace - merge of Core, STI |
nt |
nt |
nt |
||
Extension Dataset Types are in the DBpedia resource namespace - highest type specificity |
nt |
nt |
nt |
||
Raw "Plain Text" Dataset All hypernyms are string literals (the original extracted word). |
nt |
nt |
nt |
Publications
- T. Kliegr,O. Zamazal. LHD 2.0: A Text Mining Approach to Typing Entities In Knowledge Graphs. Web Semantics, 2016 preprint
- T. Kliegr. Linked Hypernyms: Enriching DBpedia with Targeted Hypernym Discovery. Web Semantics, 2015