All posts by JensLehmann

New DBpedia Overview Article

We are pleased to announce that a new overview article for DBpedia is available.

The article covers several aspects of the DBpedia community project:

  • The DBpedia extraction framework.
  • The mappings wiki as the central structure for maintaining the community-curated DBpedia ontology.
  • Statistics on the multilingual support in DBpedia.
  • DBpedia live synchronisation with Wikipedia.
  • Statistics on the interlinking of DBpedia with other parts of the LOD cloud (incoming and outgoing links).
  • Several usage statistics: What kind of queries are asked against DBpedia and how did that change over the past years? How much traffic do the official static and live endpoint as well as the download server have? What are the most popular DBpedia datasets?
  • A description of use cases and applications of DBpedia in several areas (drop me mail if important applications are missing).
  • The relation of DBpedia to the YAGO, Freebase and WikiData projects.
  • Future challenges for the DBpedia project.

After our ISWC 2009 paper on DBpedia, this is the (long overdue) new reference article for DBpedia, which should provide a good introduction to the project. We submitted the article as a system report to the Semantic Web journal.

Download article as PDF.

DBpedia 3.1 breaks 100 million triples barrier

Today, we released DBpedia 3.1. As always in the past years, the size of Wikipedia increased a lot over the past months. The new extraction contains 116,7 million triples, marking an increase of 27% over the previous version.

Apart from the more recent Wikipedia dumps we used, some notable improvements are a much better YAGO mapping, providing a more complete (more classes assigned to instances) and accurate (95% accuracy) class hierarchy for DBpedia. The Geo extractor code has been improved and is now run for all 14 languages. URI validation has switched to the PEAR validation class.

Downloads | ChangeLog

DBpedia 3.0 Release

We announce the availability of the DBpedia 3.0 final release.

Downloads are available at http://wiki.dbpedia.org/Downloads. For a list of changes since DBpedia 2.0, see the Changelog. Most notably, multi-language support was improved, new linked data sets added, and extraction code improved. Compared to the 3.0 release candidate, a number of extraction framework and data set bugs reported at our sourceforge.net bug tracker were fixed.

Overall, the combined download size of all provided NT and CSV files is 5,0 GB (uncompressed: 48,1 GB). The available data sets contain 92M triples (excluding 126M triples for internal Wikipedia links). DBpedia’s coverage grows to 2.4M entities for the English edition in this release, thanks to the hard-working Wikipedia contributors.

The extraction was performed on a server of the AKSW research group. I would like to thank Jörg Schüppel, Sören Auer, Chris Bizer, Richard Cyganiak, Georgi Kobilarov, the OpenLink team, and many other contributors for their DBpedia support.