The German federal government has proclaimed Faceted Wikipedia Search as one of the 365 most innovative ideas in Germany in the context of the Deutschland – Land der Ideen competition. The competition showcases innovative ideas in areas such as science and technology, business, education, art and ecology. The patron of the competition is the German President Horst Köhler.
Faceted Wikipedia Search allows users to ask complex queries, like “Which Rivers flow into the Rhine and are longer than 50 kilometers?” or “Which Skyscrapers in China have more than 50 floors and been constructed before 2000?” against Wikipedia. The answers to these queries are not generated based on key word matching as the answers of search engines like Google or Yahoo, but are generated based on structured information that has been extracted from many different Wikipedia articles. Faceted Wikipedia Search allows users to query Wikipedia like a structured database and thus enables them to truly exploit Wikipedia’s collective intelligence.
Faceted Wikipedia Search can be tested online at http://dbpedia.neofonie.de/browse/
Please click on the example queries below to experience Faceted Wikipedia Search in action:
- Rivers that flow into the Rhine and are longer than 50 kilometers
- Japanese cities in the Kansai region with more than 100000 inhabitants
- French scientists who were born in the 19th century
- Skyscrapers in China that have been constructed before 2000 and have more than 50 floors
- Actors of the American TV-series Lost that were born in England
- Endangered Primates
Faceted Wikipedia Search has been jointly developed by neofonie GmbH, Berlin and the Web-based Systems Group at Freie Universität Berlin. Technically Faceted Wikipedia Search is based on the DBpedia data extraction framework and the neofonie search engine. The DBpedia data extraction framework extracts structured information from Wikipedia, such as the content of infoboxes which summarize the relevant facts as a table on the top right-hand side of Wikipedia articles, and represents the extracted data using the Resource Description Framework, a data model for web-based systems. Currently, the framework extracts around 190 million facts from the English editon of Wikipedia and 289 million facts from Wikipedia editions in 90 further languages. The DBpedia data extraction framework is developed by the Web-based Systems group at Freie Universität Berlin and the Agile Knowledge Engineering and Semantic Web group at Universität Leizpig. The neofonie search engine is employed to search and navigate the extracted data.
In the context of the W3C Linking Open Data initiative, the DBpedia dataset is currently interlinked with various other open-license databases and is developing into a crystallization point of the emerging Web of Data. In the future, the links between databases will allow applications like Faceted Wikipedia Search to answer queries based not only on Wikipedia knowledge but based on knowledge from a world-wide web of databases.
Faceted Wikipedia Search will be presented as part of the Land der Ideen series on April 12th, 2010 at neofonie, Berlin.
Additional information about the Land der Ideen competition, DBpedia, neofonie and the Web of Data is found at:
- Deutschland – Land der Ideen
- Deutschland – Land der Ideen - Winners 2010
- Deutschland - Land der Ideen - Background information on the 2010 competition
- Press release about Faceted Wikipedia Search (in German)
- DBpedia website
- Neofonie website
- DBpedia – A Crystallization Point for the Web of Data (article)
- Linked Data - The Story so far (article)