After quite some work into improving the DBpedia information extraction framework, we have released a new version of the DBpedia dataset today.
The renewed DBpedia dataset describes 1,950,000 “things”, including at least 80,000 persons, 70,000 places, 35,000 music albums, 12,000 films. It contains 657,000 links to images, 1,600,000 links to relevant external web pages and 440,000 external links into other RDF datasets. Altogether, the DBpedia dataset now consists of around 103 million RDF triples.
We worked on improving the data quality in order to make the dataset more usable and useful to developers and fixed a lot of bugs submitted by our growing developer-community. We also reworked our framework to enable developers to extend the dataset with their own extractors.
We are grateful for all contributions and are looking forward to support new projects based on DBpedia data.