Only 8 days left to reserve your seat for our 3rd US DBpedia Community Meeting. We are happy to announce that the 11th DBpedia Meeting will be held in Cupertino, California on October 12th 2017, hosted by Apple Inc.
The meetup focuses on connecting the community interested in DBpedia and Knowledge Graphs in general, has included lightning talks by distinguished speakers (e.g. from Stanford, Google, IBM Watson, Netflix, LinkedIn, Wikimedia Foundation, Nuance, etc.). Talk topics have extended also to natural language processing, knowledge representation, information extraction, integration and retrieval, graph databases, knowledge base embeddings and machine learning.
We are looking forward to meeting again in person with the US-based DBpedia Community.
After the success of the last two community meetings in Sunnyvale and in Galway, we thought it is time to go Orange again. During the SEMANTiCS 2017 in Amsterdam, Sep 11-14, the DBpedia Community met on the 14th of September. First and foremost, we would like to thank the Institute for Applied Informatics for supporting our community and many thanks to the Meervaart Theatre and the SEMANTiCS for hosting our community meeting.
During the opening session, Chris Welty, Google Researcher, presented Even the Changes Are Changing: A New Age of Cognitive Computing. He introduced the impact and challenges of question answering & AI as well as the development of Jeopardy through technical changes. Victor de Boer from the VU University talked about Semantic Technology for Development: Semantic Web without the Web?. He demonstrated the use of semantic technology in the challenging technical environment of developing countries. Both talks illustrated the ever growing importance of semantic technology and AI each placed at opposite sites of the technology spectrum, from Raspberry PIs to High Performance Clusters.
The DBpedia Showcase Session started with an interactive interview. Sebastian Hellmann (AKSW/KILT) talked with Jan-Bart de Vreede (Kennisnet, former member of the Wikimedia Foundation) about the challenges of growing an open community and creating a more formal structure. They discussed advantages, pitfalls and what lessons can be learned from other communities such as Wikimedia. Afterwards Markus Freudenberg (AKSW/KILT) introduced the highlights of the 2016-10 DBpedia Release.
At this session, five speakers presented how to utilize DBpedia in novel and interesting ways. Including:
Virtuoso 8 and Scalable Attributed-based Access Controls (ABAC) by Patrick van Kleef (Openlink Software)
Learning to Associate DBpedia Entities like Humans by Joern Hees (DFKI) (demo)
Towards Using UnifiedViews for Executing DBpedia Data Extraction and Curation Tasks by Tomas Knap (Semantic Web Company)
Sustainable Linked Data Generation: The Case of DBpedia by Wouter Maroy (imec)
As a regular part of the DBpedia Community Meeting, we had two parallel sessions in the afternoon where DBpedia newbies can learn about what DBpedia is and how to use the DBpedia datasets. Participants who wanted to learn DBpedia basics joined the tutorial session by Markus Freudenberg (DBpedia Release Manager). The DBpedia Association Hour provided a platform for the community to discuss the results of the DBpedia Strategy Survey 2017. This survey was prepared by Sören Auer and the DBpedia Board members to get to know what the DBpedia Community thinks about DBpedia’s strategic priorities and how the funds of the DBpedia Association should be spent. Even if 45 minutes were not adequate to review all survey questions, this session proved to be beneficial due to a really agile and dynamic discussion. A better cooperation and communication between the Association and the different national and language chapter is only one suitable key which was embraced by the community to facilitate problem solving and DBpedia’s organization.
The sessions in the afternoon highlighted two important fields of research and development, namely DBpedia Ontology and DBpedia & NLP. At the DBpedia Ontology Session, Gustavo Publio (AKSW/KILT) presented data quality issues in DBpedia and highlighted the challenges on redesign the DBpedia Ontology (slides). Wouter Maroy (imec) and Ismael Rodríguez (Polytechnic University of Catalonia) showcased the DBpedia Mappings Front-End Administration, which they created during this year’s Google Summer of Code project. If you are interested in career opportunities at DBpedia, check out Wouter’s success story here.
At the same time, Milan Dojchinovski (AKSW/KILT) chaired the DBpedia & NLP session with five very interesting talks. In the following you will find all presentations given during this session:
DBpedia Spotlight 1.0 – A new Release by Sandro Coelho (DBpedia Association) slides
TextExt winner – Lector: RDF Triples Extraction from Wikipedia Text by Matteo Cannaviccio, Roma Tre University slides
Chaudron: Extending DBpedia with Measures by Julien Subercaze, Télécom Saint-Etienne & Université Jean Monnet slides
Dutch DBpedia Hour & Joint Workshop
Enno Meijers (National Library of the Netherlands) chaired the Dutch DBpedia Hour. In this open session members of the Dutch DBpedia Language Chapter discussed tasks and responsibilities for sustaining and developing the Dutch DBpedia as well as communication, technical infrastructure and content improvement of the DBpedia Dutch Language Chapter. The reference for this discussion was the tasks and responsibilities stated in the Memorandum of Understanding signed by Huygens ING, Koninklijke Bibliotheek, Vrije Universiteit Amsterdam, iMec and Beeld en Geluid. Outcome of this session was an agreement on the approach for creating an operational plan.
Simultaneously, DBpedia joint a session with the Workshop “Linked Data Quality Assessment and Improvement from Academia to Industry”. The presentations are available below:
In the closing session, Sebastian Hellmann (AKSW/KILT) announced a new collaboration to strengthen the DBpedia NLP Department. Via videostream we talked with Mike Tung and Filipe Mesquita from diffbot, about NLP and the relation extraction from Wikipedia articles. If you are interested in the new collaboration, please check diffbot’s slides here.
All slides and presentations are also available on our Website and you will find more feedback and photos about the event on Twitter via #DBpediaAmsterdam17.
We would like to thank the DBpedia Dutch language chapter, especially Enno Meijers (National Library of the Netherlands), Lieke Verhelst (Linked Data Factory, Informagic), Victor de Boer (Vrije Universiteit Amsterdam), Roland Cornelissen (metamatter), Gerald Wildenbeest (Saxion), Gerard Kuys (Ordina), Maarten Brinkerink (The Netherlands Institute for Sound and Vision) as well as Julia Holze (DBpedia Association), Dimitris Kontokostas (DBpedia Chapter Coordinator) and Sebastian Hellmann (AKSW/KILT, DBpedia Association) for devoting their time to curating the program and organizing the meeting.
Special thanks go to Katharina Weissenberg and Anna Keil for supporting the meeting by taking pictures of the community and the event.
We are now looking forward to the 11th DBpedia Community Meeting which will be held on 12th of October 2017 in Cupertino, California. Visit our event page for further updates.
We are very pleased to announce that all of this year’s Google Summer of Code students made it successful through the program and passed their projects. All codes have been submitted, merged and are ready to be examined by the rest of the world.
Marco Fossati, Dimitris Kontokostas, Tommaso Soru, Domenico Potena, Emanuele Storti , anastasia Dimiou, Wouter Maroy, Peng Xu, Sandro Coelho and Ricardo Usbeck, members of the DBpedia Community, did a great job in mentoring 7 students from around the world.All of the students enjoyed the experiences made during the program and will hopefully continue to contribute to DBpedia in the future.
“GSoC is the perfect opportunity to learn from experts, get to know new communities, design principles and work flows.” (Ram G Athreya)”
Now, we would like to take that opportunity to give you a little recap of the projects mentored by DBpedia members during the past months. Just click below for more details .
The goal of the project was to create a front-end application that provides a user-friendly interface so the DBPedia community can easily view, create and administrate DBpedia mapping rules using RML. The developed system includes user administration features, help posts, Github mappings synchronization, and rich RML related features such as syntax highlighting, RML code generation from templates, RML validation, extraction and statistics. Part of these features are possible thanks to the interaction with the DBPedia Extraction Framework. In the end, all the functionalities and goals that were required have been developed, with many functional tests and the approval of the DBpedia community. The system is ready for production deployment. For further information, please visit the project blog. Mentors: Anastasia Dimou and Wouter Maroy (Ghent University), Dimitris Kontokostas (GeoPhy HQ).
DBpedia Chatbot is a conversational chatbot for DBpedia which is accessible through the following platforms: a Web Interface, Slack and Facebook Messenger.
The bot is capable of responding to users in the form of simple short text messages or through more elaborate interactive messages. Users can communicate or respond to the bot through text and also through interactions (such as clicking on buttons/links). The bot tries to answer text based questions of the following types: natural language questions, location information, service checks, language chapters, templates and banter. For more information, please follow the link to the project site.Mentor: Ricardo Usbeck (AKSW).
Knowledge base embeddings has been an active area of research. In recent years a lot of research work such as TransE, TransR, RESCAL, SSP, etc. has been done to get knowledge base embeddings. However none of these approaches have used DBpedia to validate their approach. In this project, I want to achieve the following tasks: i) Run the existing techniques for KB embeddings for standard datasets. ii) Create an equivalent standard dataset from DBpedia for evaluations. iii) Evaluate across domains. iv) Compare and Analyse the performance and consistency of various approaches for DBpedia dataset along with other standard datasets. v) Report any challenges that may come across implementing the approaches for DBpedia. For more information, please follow the links to her project blog and GitHub-repository.Mentors: Tommaso Soru (AKSW) and Sandro Coelho (KILT).
The project defined embeddings to represent classes, instances and properties by implementing Random Vector Accumulators with additional features in order to better encode the semantic information held by the Wikipedia corpus and DBpedia graphs. To test the quality of embeddings generated by the RVA, lexical memory vectors of locations were generated and tested on a modified subset of the Google Analogies Test Set. Check out further information via Akshay’s GitHub-repo. Mentors: Tommaso Soru (AKSW) and Xu Peng (University of Alberta).
Wikipedia is full of data hidden in tables. The aim of this project was to explore the possibilities of exploiting all the data represented with the appearance of tables in Wiki pages, in order to populate the different chapters of DBpedia through new data of interest. The Table Extractor has to be the engine of this data “revolution”: it would achieve the final purpose of extracting the semi structured data from all those tables now scattered in most of the Wiki pages. In this page you can observe dataset (english and italian) extracted using table extractor . Furthermore you can read log file created in order to see all operations made up for creating RDF triples. I recommend to also see this page, that contains the idea behind the project and an example of result extracted from log files and .ttl dataset. For more details see Luca’s Git-Hub repository.Mentors: Domenico Potena and Emanuele Storti (Università Politecnica delle Marche).
Wikipedia represents a comprehensive cross-domain source of knowledge with millions of contributors. The DBpedia project tries to extract structured information from Wikipedia and transform it into RDF.
The main classification system of DBpedia depends on human curation, which causes it to lack coverage, resulting in a large amount of untyped resources. DBTax provides an unsupervised approach that automatically learns a taxonomy from the Wikipedia category system and extensively assigns types to DBpedia entities, through the combination of several NLP and interdisciplinary techniques. It provides a robust backbone for DBpedia knowledge and has the benefit of being easy to understand for end users. details about his work and his code can e found on the projects site. Mentors: Marco Fossati (Università degli Studi di Trento) and Dimitris Kontokostas (GeoPhy HQ).
This project aimed to augment upon the already existing list-extractor project by Federica in GSoC 2016. The project focused on the extraction of relevant but hidden data which lies inside lists in Wikipedia pages. Wikipedia, being the world’s largest encyclopedia, has humongous amount of information present in form of text. While key facts and figures are encapsulated in the resource’s infobox, and some detailed statistics are present in the form of tables, but there’s also a lot of data present in form of lists which are quite unstructured and hence its difficult to form into a semantic relationship. The main objective of the project was to create a tool that can extract information from Wikipedia lists and form appropriate RDF triplets that can be inserted in the DBpedia dataset. Fore details on the code and about the project check Krishanu’s blog and GitHub-repository.Mentors: Marco Fossati (Università degli Studi di Trento), Domenico Potena and Emanuele Storti (Università Politecnica delle Marche).
We are regularly growing our community through GSoC and can deliver more and more opportunities to you. Ideas and applications for the next edition of GSoC are very much welcome. Just contact us via email or check our website for details.
Again, DBpedia is planning to be a vital part of the GSoC Mentor Summit, from October 13th -15th, at the Google Campus in Sunnyvale California. This summit is a way to say thank you to the mentors for the great job they did during the program. Moreover it is a platform to discuss what can be done to improve GSoC and how to keep students involved in their communities post-GSoC.
And there is more good news to tell. DBpedia wants to meet up with the US community during the 11th DBpedia Community Meeting in California. We are currently working on the program and keep you posted as soon as registration is open.
We are happy to announce that the 10th DBpedia Community Meeting will be held in Amsterdam, Netherlands. During the SEMANTiCS 2017, Sep 11-14, the DBpedia Community will get together on the 14th of September for the DBpedia Day.
What cool things do you do with DBpedia? Present your tools and datasets at the DBpedia Community Meeting. Please submit your proposal in our form.
– Where: Meervaart Theatre, Meer en Vaart 300, 1068 LE Amsterdam, Netherlands
– Call for Contribution: Please submit your proposal in our form.
– Attending the DBpedia Community Meeting costs €40 (excl. registration fee and VAT). DBpedia members get free admission, please contact your nearest DBpedia chapter or the DBpedia Association for a promotion code.
If you can’t stand it till the end of the SEMANTiCS, you can already participate in the workshop “Two worlds, one goal: A Reliable Linked Data ecosystem for media” held by DBpedia and Wolters Kluwer on the 11th of September. This half-day workshop aims at exploring major topics for publishers and libraries from DBpedia’s and Wolters Kluwer’s perspective. Therefore, both communities will dive into core areas like Interlinking, Metadata and Data Quality and address challenges such as fundamental requirements when publishing data on the web. Did we spark your interest? Check our detailed program here and get your ticket today.
After our 2nd Community Meeting in the US, we delighted the Irish DBpedia Community with the 9th DBpedia Community Meeting, which was co-located with the Language, Data and Knowledge Conference 2017 in Galway at the premises of the NUI Galway.
First and foremost, we would like to thank John McCrae (Insight Centre for Data Analytics, NUI Galway) and the LDK Conference for co-hosting and support the event.
The focus of this Community Meeting was the Irish DBpedia and Linked Data Community in Ireland. Therefore we invited local data scientists as well as European DBpedia enthusiasts to discuss the state of Irish Linked Data.
The meeting started with two compelling keynotes by Brian Ó Raghallaigh, Dublin City University and Logainm.ie, and Sharon Flynn, NUI Galway and Wikimedia Ireland. Brian presented Logainm.ie, a data use case about placenames in Ireland with a special focus on linked Logainm and machine-readable data.
His insightful presentation was followed by Sharon Flynn talking about Wikimedia in Ireland and the challenges of “this monumental undertaking” with particular reference to the Wikimedia Community in Ireland.
For more details on the content of the presentations, follow the links to the slides.
As a regular part of the DBpedia Community Meeting we have two parallel sessions in the afternoon where DBpedia newbies can learn about what DBpedia is and how to use the DBpedia data sets.
Participants who wanted to learn DBpedia basics joined the DBpedia Tutorial Session byMarkus Freudenberg (DBpedia Release Manager). The DBpedia Association Hour provided a platform for the community to discuss and give feedback.
Additionally, Sebastian Hellmann and Julia Holze, members of the DBpedia Association, updated the participants about the growing number of the DBpedia Association members, the formalized DBpedia language chapters, the established DBpedia Community Committee and they informed about technical developments such as the DBpedia API.
Ontology Engineering and Software Alignment in the ALIGNED Project
The afternoon session started with the DBpedia 2016-10 release update by Markus Freudenberg (DBpedia Release Manager). Following this, Kevin Chekov Feeney, (Trinity College Dublin) presented the software alignment in the ALIGNED project. He talked about “Generating correct-by-construction semantic datasets from unstructured, semi-structured and badly structured data sources”.
At this point, we also like to thank the ALIGNED project for the development ofDBpedia as a project use case and for covering parts of the travel cost.
Session about Irish Linked Data Projects
Chaired by Rob Brennan and Bianca Pereira, the speakers in the last session presented new Irish Linked Data Projects, for example GeoHive, BIOOPENER and the TCD Open Linked Data Engagement Fund Project. The following panel session gave DBpedia and Linked Data enthusiasts a platform for exchange and discussion. Outcome of this session was the creation of a roadmap for the Irish Linked Data with all participants.
Following, you find a list of all presentations of this session:
Closing this session John McCrae announced that the next edition of the Language, Data and Knowledge (LDK) Conference is scheduled for 2019 in Germany. We at the DBpedia Association are now looking forward to welcome the LDK Community in Leipzig!
Social Evening Event
The Community Meeting slowly came to an end with our social evening event, which was held at the PorterShed in Galway. The evening session revolved around the topic How to exploit data commercially? and featured two short impulse talks. Paul Buitelaar started the session by presenting “Kibi”, which is an Open Source platform for Data Intelligence based on the search engine Elasticsearch. Finally, Sebastian Hellmann talked about “Improving the Utility of DBpedia by co-designing a public and commercial DBpedia API” (slides).
Summing up, the 9th DBpedia Community Meeting brought together more than 45 DBpedia enthusiasts from Ireland and Europe who engaged in vital discussions about Linked Data, DBpedia use cases and services.
Special thanks go to LDK 2017 for hosting the meeting.
Thanks Ireland and hello Amsterdam!
We are looking forward to the next DBpedia Community Meeting which will be held in Amsterdam, Netherlands. Co-located with the SEMANTiCS17, the Community will get together on the 14th of September on the DBpedia Day.
We are happy to announce that the 9th DBpedia Community meeting will be held in Galway, Ireland on June 21st 2017. DBpedia will be part of the Language, Data and Knowledge conference (LDK) in Galway. This new biennial conference series aims at bringing together researchers from across disciplines. The DBpedia Meeting is part of the conference and is scheduled for the last day.
Only few seats are left: So come and get your ticket to be part of the 9th DBpedia Community meeting in Galway.
Keynote #1: Logainm.ie data use cases by Brian Ó Raghallaigh (Dublin City University & Logainm)
Keynote #2: Wikimedia in Ireland: A Monumental Undertaking by Sharon Flynn (NUI Galway & Wikimedia Ireland)
DBpedia Association hour
A session about Irish Linked data projects (and DBpedia)
The social event will be held in the evening (starting at 6pm) at the PorterShed around the topic How to exploit data commercially? featuring several short impulse talks. We still have some remaining slots and would welcome you to present your success stories as well as use cases, but also tell us about your problems regarding the commercialisation of data. If you are interested in presenting, please email firstname.lastname@example.org.
DBpedia will participate for a fifth time in the Google Summer of Code program (GSoC) and now we are looking for students who will share their ideas with us. We are regularly growing our community through GSoC and can deliver more and more opportunities to you. We got excited with our new ideas, we hope you will get excited too!
What is GSoC?
Google Summer of Code is a global program focused on bringing more student developers into open source software development. Funds will given to students (BSc, MSc, PhD) to work for three months on a specific task. At first open source organizations announce their student projects and then students should contact the mentor organizations they want to work with and write up a project proposal for the summer. After a selection phase, students are matched with a specific project and a set of mentors to work on the project during the summer.
As previous years, we would like your input for DBpedia related project ideas for GSoC 2017.
For those who are unfamiliar with GSoC (Google Summer of Code), Google pays students (BSc, MSc, PhD) to work for 3 months on an open source project. Open source organizations announce their student projects and students apply for projects they like. After a selection phase, students are matched with a specific project and a set of mentors to work on the project during the summer.
After our successful meeting in Poznan in 2015, we thought it is time to meet the Polish DBpedia community again. The DBpedia meetup will be held on 22th of November 2016 at the Poznań University of Economics and Business. This meetup aims at the presentation of semantic web technologies and their use in applications by entrepreneurs.
After the largest DBpedia meeting to date we decided it was time to cross the Atlantic for the second time for another meetup. Two weeks ago the 8th DBpedia Community Meeting was held in Sunnyvale, California on October 27th 2016.
Pablo Mendes from Lattice Data Inc. opened the main event with a short introduction setting the tone for the evening. After that Dimitris Kontokostas gave technical and organizational DBpedia updates. The main event attracted attendees with lightning talks from major companies actively using DBpedia or interested in knowledge graphs in general.
Four major institutions described their efforts to organize reusable information in a centralized knowledge representation. Google’s Tatiana Libman presented (on behalf of Denny Vrandečić) the impressive scale of the Google Knowledge graph, with 1B+ entities and over 100 billion facts.
Yahoo’s Nicolas Torzec presented the Yahoo knowledge graph, with focus on their research on extracting data from Web tables to expand their knowledge which includes DBpedia as an important part. Qi He from LinkedIn focused mostly on how to model a knowledge graph of people and skills, which becomes particularly interesting with the possibility of integration with Microsoft’s Satori Graph. Such an integration would allow general domain knowledge and very specific knowledge about professionals complementing one another. Stas Malyshev from Wikidata presented statistics on their growth, points of contact with DBpedia as well as an impressive SPARQL query interface that can be used to query the structured data that they are generating.
Three other speakers focused on the impact of DBpedia in machine learning and natural language processing. Daniel Gruhl from IBM Watson gave the talk “Truth for the impatient” where he showed that a knowledge model built from DBpedia can help costs and time to value for extracting entity mentions with higher accuracy. Pablo Mendes from Lattice Data Inc. presented their approach that leverages DBpedia and other structured information sources for weak supervision to obtain very strong NLP extractors. Sujan Perera from IBM Watson discussed the problem of identifying implicit mentions of entities in tweets and how the knowledge represented in DBpedia can be used to help uncover those references.
Another three speakers focused on applications of DBpedia and knowledge graphs. Margaret Warren from Metadata Authoring Systems, LLC presented ImageSnippets and how background knowledge from DBpedia allows better multimedia search through inference. For instance, by searching for “birds” you may find pictures that haven’t been explicitly tagged as birds but for which the fact can be inferred from DBpedia. Jans Aasman from Franz Inc presented their company’s approach to Data Exploration with Visual SPARQL Queries. They described opportunities for graph analytics in the medical domain, and discussed how DBpedia has been useful in their applications. Finally, Wang-Chiew Tan presented their research at RIT relating to building chatbots, among other projects that relate to using background knowledge stored in computers to enrich real life experiences.
Overall the talks were very high quality and fostered plenty of discussions afterwards. We finalized the event with a round of introductions where every attendee got to say their name and affiliation to help them connect with one another throughout the final hour of the event.
All slides and presentations are also available on ourWebsite and you will find more feedback and photos about the event on Twitter via #DBpediaCA.