Category Archives: NLP

Have you backlinked your data yet? – A retrospective of the 6th DBpedia community meeting in The Hague

 

We thought it was about time to go orange again, meet the Dutch DBpedia Chapter and to meet and celebrate the growing dutch DBpedia community. Thus, following our successful US-event past November, the National Library of the Netherlands hosted the 6th DBpedia community meeting in The Hague on February 12th. 

First and foremost, we would like to thank TNO for organizing the pre-event and the National Library of the Netherlands, especially Menno Rasch (Director of KB operations), for sponsoring the catering during the DBpedia community meeting.

Pre-event

Before diving into DBpedia topics, we had a welcome reception on February 11th with snacks and drinks at TNO – New Babylon.  Around 40 people from the DBpedia community, members from TNO and its Data Science Department and representatives from the Platform Linked Data Netherlands engaged in vital exchanges about Linked Data topics in the Netherlands.

Sebastian Hellmann gave a short introduction about DBpedia and the recently found DBpedia Association. After Jean-Louis Roso talked about the TNO Data Science Department and current developments and projects, Erwin Folmer presented the platform Linked Data Netherlands (PiLOD).

A poster and demo session right after gave people from TNO the opportunity to present and discuss projects currently carried out at TNO.

Following, you find a short list of poster-presentation during the pre-event:

The following social gathering with snacks and drinks, encouraged talks about current developments in the DBpedia community and about ongoing projects. According to TNO representative Laura Daniel, the pre-event was very successful. She summarized the evening of the welcome reception: “It was very inspiring to see the DBpedia community in action. There were lots of interesting projects that use DBpedia as well as lively discussions on the challenges faced by the community, and of course, the event was a great opportunity for networking!”

 

Main event

2016-02-12 10.38.18 (1)
Gerard Kuys, Ordina, during the opening session @ The Hague

Opening Session

Following the pre-event, the main event attracted 95 participants and featured special session dedicated to the DBpedia showcases, the DBpedia ontology and challenges of DBpedia and Digital Heritage.

During the opening session, Menno Rasch, host of the meeting and Director of KB operations, highlighted the importance to raise awareness of the DBpedia brand in order to build a DBpedia community.

2016-02-12 10.41.17 (1)
Sebastian Hellmann, AKSW/ KILT and DBpedia Association @ the DBpedia community meeting

The newly found DBpedia Association and the related new charter regulating organizational issues in the DBpedia community was one of the focuses during the early morning hours, right before several interesting keynote presentations opened the discussion about DBpedia and its usage in the Netherlands.

2016-02-12 10.57.09
Marco De Niet, DEN Foundation @ the meeting in The Hague

Marco de Niet, representative of Digital Heritage Foundation (DEN Foundation), the Dutch knowledge centre for digital heritage, talked about “the National Strategy for Digital Heritage in the Netherlands”.

Marco Brattinga and Arjen Santema from the Land Registry and Mapping Agency (Kadaster) presented a framework to describe the data and metadata in registration in relation to a concept schema that describes what the registration is about. Apart from the ideas behind the framework, their presentation included a showcase of examples from the cadastral registration as well as the topographic map and the information node addresses and buildings.

The morning session was closes by Paul Groth, from Elsevier giving a presentation about knowledge graph construction and the Role of DBPedia and other Wikipedia based knowledge. He discussed  the importance of structured data as key to coordinate data in order to build better taxonomies. He also pointed towards the importance of having an updated publicly available knowledge graph as a reference for constructing internal knowledge graphs.

2016-02-12 11.28.03 (1)
Paul Groth, Elsevier, discussing knowledge graphs and the role of DBpedia @ the community meeting

After Lunch Track

DBpedia is one of the biggest and most important focal point of the Linked Open Data movement. Thus, the after-lunch-track focused very much on DBpedia Usages during the dedicated showcase session, which started with the new DBpedia & DBpedia+ Data Stack  release (planned for  2016-04).

Afterwards, the session continued with further DBpedia related discussions, in which various practical DBpedia matters such as DBpedia in the EUROPEANA Food and Drink project, the use of DBpedia for improved vaccine information systems or using Elasticsearch + DBpedia to maintain a searchable database of global power plants were tackled.

Afternoon Track

The afternoon track came along with four DBpedia highlight-sessions, namely DBpedia and Ontologies, DBpedia and Heritage, DBpedia hands-on development and DBpedia and NLP. Firstly, the DBpedia ontology group discussed possible ontology usages and presented the results of the latest DBpedia Ontology survey. In the following 75 minutes during the DBpedia and Heritage session, special challenges and opportunities of reference data for digital heritage were addressed by experts from EUROPEANA, iMinds, RCE and KB, the National Library of the Netherlands. Thirdly, members of the DBpedia Association and the AKSW/KILT group from Leipzig led a practical session for developers and DBpedia enthusiasts to talk about technical issues and challenges in DBpedia as well as they held a Tutorial session for DBpedia Newbies.

The end of the event was dedicated to NLP and the application of Linked Data on Language Technologies, especially entity linking, topics which are of vital importance for the research of AKSW/KILT members at the University of Leipzig.

Following, you find a list of all presentations given during the meeting.

All slides and presentations are also available on our Website and you will find more feedback and photos about the event on Twitter via #DBpediaDenHaag.

Summing up, the 6th community meeting brought together more than 95 DBpedia enthusiast from the Netherlands and Europe which engaged in vital conversations about interesting projects and approaches to questions/problems revolving around DBpedia, not only during the dedicated session but also during networking breaks. The recently found DBpedia Association was strongly represented with presentations from Sebastian Hellmann, Dimitris Kontokostas,  Nilesh Chakraborty, as well as Markus Freudenberg.

Finally, we would like to thank the organizers Enno Meijers, Richard Nagelmaker, Gerald Wildenbeest, Gerard Kuys, Monika Solanki and representatives of the DBpedia Association such as Dimitris Kontokostas and Sebastian Hellmann for devoting their time to the organization of the meeting and the programme. We are now looking forward to the 7th DBpedia Community Meeting, which will be held in the city of Leipzig again, during the Semantics conference in September 15th, 2016.

For updates, just follow us on Facebook, Twitter or check the following websites: http://www.semantics.cc/ and http://wiki.dbpedia.org/.

 

DBpedia Spotlight V0.7 released

DBpedia Spotlight is an entity linking tool for connecting free text to DBpedia through the recognition and disambiguation of entities and concepts from the DBpedia KB.

We are happy to announce Version 0.7 of DBpedia Spotlight, which is also the first official release of the probabilistic/statistical implementation.

More information about as well as updated evaluation results for DBpedia Spotlight V0.7 are found in this paper:

Joachim Daiber, Max Jakob, Chris Hokamp, Pablo N. Mendes: Improving Efficiency and Accuracy in Multilingual Entity ExtractionISEM2013. 

The changes to the statistical implementation include:

  • smaller and faster models through quantization of counts, optimization of search and some pruning
  • better handling of case
  • various fixes in Spotlight and PigNLProc
  • models can now be created without requiring a Hadoop and Pig installation
  • UIMA support by @mvnural
  • support for confidence value

See the release notes at [1] and the updated demos at [4].

Models for Spotlight 0.7 can be found here [2].

Additionally, we now provide the raw Wikipedia counts, which we hope will prove useful for research and development of new models [3].

A big thank you to all developers who made contributions to this version (with special thanks to Faveeo and Idio). Huge thanks to Jo for his leadership and continued support to the community.

Cheers,
Pablo Mendes,

on behalf of Joachim Daiber and the DBpedia Spotlight developer community.

[1] – https://github.com/dbpedia-spotlight/dbpedia-spotlight/releases/tag/release-0.7

[2] – http://spotlight.sztaki.hu/downloads/

[3] – http://spotlight.sztaki.hu/downloads/raw

[4] – http://dbpedia-spotlight.github.io/demo/

(This message is an adaptation of Joachim Daiber’s message to the DBpedia Spotlight list. Edited to suit this broader community and give credit to him.)

DBpedia Spotlight has been selected for Google Summer of Code. Please apply now!

The Google Summer of Code (GSoC) is a global program that offers student developers (BSc,MSc,PhD) stipends to write code for open source software projects. It has had thousands of participants since the first edition in 2005, connecting prospective students with mentors from open source communities such as Debian, KDE, Gnome, Apache Software Foundation, Mozilla, etc.

For the students, it is a great chance to get real-world software development experience. For the open source communities, it is a chance to expand their development community. For everybody else, more source code is created and released for the benefit of all!

We are thrilled to announce that our open source project DBpedia Spotlight has been selected for the Google Summer of Code 2012.

We are now seeking students interested in working with us to enhance operational aspects of DBpedia Spotlight, as well as to engage in research activities in collaboration with our team. If you are an energetic developer, passionate for open source and interested in areas related to DBpedia Spotlight, please get in touch with us!

We have shared a number of project ideas to get you started.

To apply, visit: http://www.google-melange.com/gsoc/org/google/gsoc2012/dbpediaspotlight

If you would like to see DBpedia Spotlight in action, helping you to explore available projects within GSoC 2012, please visit our demonstration page at: http://spotlight.dbpedia.org/gsoc/


DBpedia Spotlight – Text Annotation Toolkit released

We are happy to announce a first release of DBpedia Spotlight – Shedding Light on the Web of Documents. 

The amount of data in the Linked Open Data cloud is steadily increasing. Interlinking text documents with this data enables the Web of Data to be used as background knowledge within document-oriented applications such as search and faceted browsing. 

DBpedia Spotlight is a tool for annotating mentions of DBpedia resources in text, providing a solution for linking unstructured information sources to the Linked Open Data cloud through DBpedia. The DBpedia Spotlight Architecture is composed by the following modules:

  • Web application, a demonstration client (HTML/Javascript UI) that allows users to enter/paste text into a Web browser and visualize the resulting annotated text.

  • Web Service, a RESTful Web API that exposes the functionality of annotating and/or disambiguating entities in text. The service returns XML, JSON or RDF.

  • Annotation Java / Scala API, exposing the underlying logic that performs the annotation/disambiguation.

  • Indexing Java / Scala API, executing the data processing necessary to enable the annotation/disambiguation algorithms used.

More information about DBpedia Spotlight can be found at: 

http://spotlight.dbpedia.org 

DBpedia Spotlight is provided under the terms of the Apache License, Version 2.0. Part of the code uses LingPipe under the Royalty Free License.

 

The source code can be downloaded from: 

http://sourceforge.net/projects/dbp-spotlight 

The development of DBpedia Spotlight was supported by: 

  • Neofonie GmbH, a Berlin-based company offering leading technologies in the area of Web search, social media and mobile applications (http://www.neofonie.de/).

  • The European Commission through the project LOD2 – Creating Knowledge out of Linked Data (http://lod2.eu/). 

Lots of thanks to:

  • Andreas Schultz for his help with the SPARQL endpoint.

  • Paul Kreis for his help with evaluations.

  • Robert Isele and Anja Jentzsch for their help in early stages with the DBpedia extraction framework.

Cheers,

 Pablo N. Mendes, Max Jakob, Andrés García-Silva and Chris Bizer.