Just recently, DBpedia Association member and hosting specialist, OpenLink released the DBpedia Usage report, a periodic report on the DBpedia SPARQL endpoint and associated Linked Data deployment.
The report not only gives some historical insight into DBpedia’s usage, number of visits and hits per day but especially shows statistics collected between July 2017 and September 2020, spanning more than 3 years of logs from the DBpedia web service operated by our partner OpenLink Software at http://dbpedia.org/sparql/.
Before we want to highlight a few aspects of DBpedia’s usage we would like to thank OpenLink for the continuous hosting of the DBpedia Endpoint and the creation of this report.
DBpedia Usage Report: Historical Overview
The first table shows the average numbers of Visits and Hits per day during the time each DBpedia dataset was live on the http://dbpedia.org/sparql endpoint. Similarly to the hits, we also see a huge increase in visits coinciding with the DBpedia 2015–10 release on April 1st, 2016.
This boost was attributed to an intensive promotion of DBpedia via community meetings, and exchange with various partners in the Linked Data community. In addition, our Social Media activity in the community increased backlinks. Since then, not only the numbers of hits rose but DBpedia also provided for better data quality. We are constantly working on improving accessibility, data quality and stability of the SPARQL endpoint.
Kudos to Open Link for maintaining the technical baseline for DBpedia.
The next graph shows the percentage of the total number of hits in a given time period that can be attributed to the /sparql endpoint. If we look at the historical data from 2014–09 onward, we can see the requests to /sparql were about 60.16% of the total number of hits.
DBpedia Usage Report: Current Statistics
If we focus on the last 12 months, we can see a slightly lower average of 48.10%, as shown in the graph below. This means that around 50% of traffic uses Linked Data constructions to view the information available through DBpedia. To put this into perspective, that means that of the average of 7.2 million hits to the endpoint on a given day, 3.6 million hits are Linked Data Deployment hits.
The following table shows the information on visits, sited and hits for each month between September 2019 and 2020.
For detailed information on the specific usage numbers, please visit the original report by Openlink published here. Also, older reporst are available through their site.
The SEMANTiCS Onsite Conference 2020 had to be postponed till September 2021. To bridge the gap until 2021, we took this opportunity to organize the Knowledge Graphs in Action (KGiA) online track as a SEMANTiCS satellite event on October 6, 2020. This new online conference is a combination of two existing events: the DBpedia Community Meeting and the annual Spatial Linked Data conference organised by EuroSDR and the Platform Linked Data Netherlands. We combined the best of both and as a bonus we added a track about Geo-information Integration organized by EuroSDR. As special joint sessions we presented four keynote speakers.
Following, we will give you a brief retrospective about the keynote presentations and talks.
Opening & Keynote #1
The Knowledge Graphs in Action conference was opened with a keynote presentation ‘Data Infrastructure for Energy System Models’ by Carsten Hoyer-Klick (German Aerospace Center). He presented LOD GEOSS, a project for the development of a distributed data infrastructure for the analysis of energy systems. The project is about the development of networked database concepts based on the ideas of linked open data and the semantic web for input and output data of energy system models in energy systems analysis. Afterwards the conference chairs offered three parallel sessions in the morning.
Session 1: Spatial Linked Data Country Update
In this session 7 speakers presented the uptake and latest progress of Spatial Linked Data adoption in European countries, either within national mapping agencies or beyond.
Session 2: VGI country presentations
There is an increasing use of crowdsourced geo-information (CGI) in spatial data applications by National Mapping and Cadastral Agencies (NMCAs). Applications range from using CGI for supporting the actualisation of spatial data to adding extra content, such as land use, building entrances, road barriers, sensors placed in the public space and many more. This session hosted five presentations from NMCAs showing the status of their CGI integration in mapping applications and processes.
Session 3: DBpedia Member presentations
Members of the DBpedia Association presented their latest tools, applications and technical developments in this session. Filipe Mesquita (Diffbot) opened the member session with his talk ‘Beyond Human Curation: How Diffbot Is Building A Knowledge Graph of the Web’. Also ImageSnippets, timbr.ai and GNOSS gave interesting and delightful talks about their technical developments. Vassil Momtchev from Ontotext closed the session by giving insights into the GraphDB 9.4.
For further details of the presentations follow the links to the slides on the event page.
The afternoon sessions started with an interesting keynote by Peter Mooney (Maynooth University). He talked about the opportunities for a more integrated approach to Geo-information integration.
After the second keynote Sebastian Hellmann, the CEO of the DBpedia Association, presented the development and methodology of the National Knowledge Graph for the Netherlands. In cooperation with Dutch partners, DBpedia invested two months to develop this new knowledge graph. His insightful presentation was followed by Benedicte Bucher (University Gustave Eiffel) talking about ‘Knowledge Graph on spatial digital assets in European’. She also presented the EuroSDR LDG initiative in many details.
Afternoon Parallel Sessions
Session 4: Transforming Linked Data into a networked data economy – DBpedia Chapter Session
In the DBpedia Chapter Session, members of different European DBpedia chapters gave an overview about the data landscape in their countries. They presented identified business opportunities and important challenges, such as automated clearance of licenses in their countries. Enno Meijers (National Library of the Netherlands) summarized the data landscape in the Netherlands. There were also presentations about the data landscape in Brazil, Spain, Austria and Poland.
Session 5: EuroSDR VGI data wrangling
This session intends to uncover new combinations and integration of CGI data with data from NMCAs which demonstrate the added value for map creation and map usage. Data wrangling (the process of creating small reproducible data processing workflows) is deployed for this work by using and combining existing geospatial software (desktop, web and mobile). In this session the results of the data wrangling process were presented.
Session 6: Spatial Session
In this session, two speakers presented how they built knowledge graphs, and in the second part three presenters gave insights into tooling and presented the state of the art on working with Linked Data.
For further details of the presentations follow the links to the slides on the event page.
Keynote #3 and #4
Keynote #3 ‘Spatial Knowledge in Action – Deep semantics, geospatial thinking, and new cartographies’ was given by Marinos Kavouras (National Technical University of Athens). Marinos stated that the power of maps and modern cartographic language proves to have a new role for society at large, as an indispensable communication and cognitive tool. The KG in Action conference ended with the keynote presentation ‘Know, Know Where, KnowWhereGraph’ by Krzysztof Janowicz (University of California). During his live talk from California, Krzysztof provided an overview of ideas and hopes for creating geo-specific knowledge graphs and geo-enrichment services on top of this graph to address some of the aforementioned challenges.
In case you missed the event, all slides and presentations are also available on the DBpeda website. We will upload all recordings on the DBpedia youtube channel. Further insights, feedback and photos about the event are available on Twitter (#KGiA hashtag).
With 45 project proposals, this GSoC edition marked a new record for DBpedia.
Oh, what a year! For the 9th year in a row, we were part of this incredible journey of young ambitious developers who joined us as an open source organization to work on a GSoC coding project all summer.
Each year has brought us new project ideas, many amazing students and mostly great project results that shaped the future of DBpedia.
Even though Covid-19 changed a lot in the world, it couldn’t shake GSoC much. The program, designed to mentor youngsters from afar is almost too perfect for the current world situation. One of the advantages of Google Summer of Code is, especially in times like these, the chance to work on projects remotely, but still obtain a first deep dive into Open Source projects like us – DBpedia.
Meet the students and their projects
This year, we had notably more applications than in the previous ones. With 45 project proposals, this GSoC edition marked a new record for DBpedia. Throughout the summer program, our seven finalists worked intensely on their challenging DBpedia projects with great outcomes to show to the public. Projects ranged from extending our DBpedia extraction framework to a DBpedia Database project as well as to an online tool to generate RDF from DBpedia abstracts. If you want to have deeper insights into our GSoC student’s work you can find their blogs and repos in the following list. Check them out!
Thanks to all our mentors around the world for joining us in this endeavour, for mentoring with kindness and technical expertise. A huge shout out to those who have been by our side for so many years in a row. Many thanks to Tommaso Soru, Beyza Yaman, Diego Moussalem, Edgard Marx, Mariano Rico, Thiago Castro Ferreira, Luca Virgili as well as Sebastian Hellmann, Stuart Chan, Amandeep Srivastava, Julio Hernandez and Jan Forberg.
During the previous years you might have noticed that we always organized a little lottery to decide which mentor or organization admin can join the annual GSoC mentor summit. As this year’s event will be held online, space is not limited to 300 something mentors but is open to all organization admins and mentors alike. The GSoC Virtual Mentor Summit takes place October 15- 16, 2020 and this year we hope all our mentors will find the time to join and exchange with fellow mentors from around dozens of open source projects.
After GSoC is before the next GSoC
We can not wait for the 2021 edition. Likewise, if you are an ambitious student who is interested in open source development and working with DBpedia you are more than welcome to either contribute your own project idea or apply for project ideas we offer starting in early 2021.
In case you like to mentor a project do not hesitate to also get in touch with us via firstname.lastname@example.org.
Stay tuned, frequently check Twitter, LinkedIn or the DBpedia Forum to stay in touch and don’t miss your chance of becoming a crucial force in this endeavour as well as a vital member of the DBpedia community.