Category Archives: tools

New DBpedia Overview Article

We are pleased to announce that a new overview article for DBpedia is available.

The article covers several aspects of the DBpedia community project:

  • The DBpedia extraction framework.
  • The mappings wiki as the central structure for maintaining the community-curated DBpedia ontology.
  • Statistics on the multilingual support in DBpedia.
  • DBpedia live synchronisation with Wikipedia.
  • Statistics on the interlinking of DBpedia with other parts of the LOD cloud (incoming and outgoing links).
  • Several usage statistics: What kind of queries are asked against DBpedia and how did that change over the past years? How much traffic do the official static and live endpoint as well as the download server have? What are the most popular DBpedia datasets?
  • A description of use cases and applications of DBpedia in several areas (drop me mail if important applications are missing).
  • The relation of DBpedia to the YAGO, Freebase and WikiData projects.
  • Future challenges for the DBpedia project.

After our ISWC 2009 paper on DBpedia, this is the (long overdue) new reference article for DBpedia, which should provide a good introduction to the project. We submitted the article as a system report to the Semantic Web journal.

Download article as PDF.

DBpedia Spotlight – Text Annotation Toolkit released

We are happy to announce a first release of DBpedia Spotlight – Shedding Light on the Web of Documents. 

The amount of data in the Linked Open Data cloud is steadily increasing. Interlinking text documents with this data enables the Web of Data to be used as background knowledge within document-oriented applications such as search and faceted browsing. 

DBpedia Spotlight is a tool for annotating mentions of DBpedia resources in text, providing a solution for linking unstructured information sources to the Linked Open Data cloud through DBpedia. The DBpedia Spotlight Architecture is composed by the following modules:

  • Web application, a demonstration client (HTML/Javascript UI) that allows users to enter/paste text into a Web browser and visualize the resulting annotated text.

  • Web Service, a RESTful Web API that exposes the functionality of annotating and/or disambiguating entities in text. The service returns XML, JSON or RDF.

  • Annotation Java / Scala API, exposing the underlying logic that performs the annotation/disambiguation.

  • Indexing Java / Scala API, executing the data processing necessary to enable the annotation/disambiguation algorithms used.

More information about DBpedia Spotlight can be found at: 

http://spotlight.dbpedia.org 

DBpedia Spotlight is provided under the terms of the Apache License, Version 2.0. Part of the code uses LingPipe under the Royalty Free License.

 

The source code can be downloaded from: 

http://sourceforge.net/projects/dbp-spotlight 

The development of DBpedia Spotlight was supported by: 

  • Neofonie GmbH, a Berlin-based company offering leading technologies in the area of Web search, social media and mobile applications (http://www.neofonie.de/).

  • The European Commission through the project LOD2 – Creating Knowledge out of Linked Data (http://lod2.eu/). 

Lots of thanks to:

  • Andreas Schultz for his help with the SPARQL endpoint.

  • Paul Kreis for his help with evaluations.

  • Robert Isele and Anja Jentzsch for their help in early stages with the DBpedia extraction framework.

Cheers,

 Pablo N. Mendes, Max Jakob, Andrés García-Silva and Chris Bizer.