2013 Special Theme: Towards Industrial Linked Data Ecosystems
Abstract
The quantity of published Linked Data continues to increase. However, applications that exploit Linked Data are not yet widespread. Reasons may include a lack of suitable solutions for a number of open problems. The diversity and dynamics of LOD sources have brought new challenges in seamless data integration, dynamic discovery, provenance tracking, and quality assessment at the Web scale. Addressing these issues requires joint community efforts from both LOD provision and consumption perspectives, including the development and investigation of concepts that can lead towards the realisation of a sustainable Linked Data ecosystem. The objective of this workshop is to provide a focused venue for academic and industrial discussions on concepts, algorithms, infrastructure and tools (including systematic analysis and rigorous evaluation) that help to exploit Linked Data (and not just to produce it).
The workshop will be co-located with the 12th International Semantic Web Conference (ISWC) in Sydney, Australia. Networked communication will be encouraged during the workshop using IRC, microblogging and other services, provided with the official hashtag #cold2013
to follow the live-stream of the event.
News
- 2013-10-14: The workshop program is now online.
- 2013-10-11: Yves Raimond (BBC Research & Development) will be this year's keynote speaker. He is going to talk about "Consuming Linked Data at the BBC" (see our program for an abstract).
- 2013-09-12: The workshop proceedings are online as CEUR-WS.org Vol-1034.
Objectives
The term Linked Data refers to a practice for publishing and interlinking structured data on the Web. Since the practice has been proposed in 2006, a grass-roots movement has started to publish and to interlink multiple open databases on the Web following the Linked Data principles. Due to conference workshops, tutorials, and general evangelism an increasing number of data publishers such as the BBC, Thomson Reuters, The New York Times, the Library of Congress, and the UK and US governments have adopted Linked Data principles. The ongoing effort resulted in bootstrapping the Web of Data which, today, comprises billions of RDF triples including millions of links between data sources. The published datasets include data about books, movies, music, radio and television programs, reviews, scientific publications, genes, proteins, medicine, and clinical trials, geographic locations, people, companies, statistical and census data, etc.
Access to Linked Data presents exciting opportunities for the next generation of Web-based applications: data from different providers can be aggregated and fragmentary information from multiple sources can be integrated to achieve a more comprehensive view. While a few applications, such as the BBC music guide have used Linked Data to significant benefit, the deployment methodology has been to harvest the data of interest from the Web to create a private, disconnected repository for each specific application. Such an approach can only be the beginning; new concepts to consume Linked Data are required in order to exploit the Web of Linked Data to its full potential. The concepts, patterns and tools necessary are very different from situations when resource identifiers are local or known a-priori, whole-repository queries are possible, access to the repository is reliable and relevant data sources are known to be trustworthy.
Several open issues that make the development of Linked Data based applications a challenging or still impossible task. These issues include the lack of approaches for seamless integration of Linked Data from multiple sources, for dynamic, on-the-fly discovery of available data, for information quality assessment, and for elaborate end user interfaces. These open issues can only be addressed appropriately when they are conceived as research problems that require the development and systematic investigation of novel approaches. The International Workshop on Consuming Linked Data (COLD) aims to provide a platform for the presentation and discussion of such approaches. Our main objective is to receive submissions that present scientific discussion (including systematic evaluation) of concepts and approaches, instead of exposition of features implemented in Linked Data based applications. For practical systems without formalization or evaluation we refer interested participants to other offerings at ISWC, such as the Semantic Web Challenge or the Demo Track. As such, we see our workshop as orthogonal to these events.
Program
- 9:00 - 9:10: Workshop Introduction
- 09:10 - 10:10: Keynote: Consuming Linked Data at the BBC (Yves Raimond - BBC Research & Development)
Abstract: The BBC consumes a large amount of Linked Data, aggregated from a mixture of external, internal and commercial data sources, both for managing parts of its web site and for data mining projects. In this talk we are going to talk about various uses of Linked Data at the BBC and about some of the challenges we encountered.Bio: Yves Raimond holds a PhD from Queen Mary, University of London. His thesis was entitled "A Distributed Music Information System", and defined a framework for applying a range of Semantic Web technologies for managing and distributing music-related information. As part of his thesis, he contributed extensively to what would become the "Linking Open Data" community project. Since 2008, he has been working for the BBC, first on the bbc.co.uk/programmes service, publishing structured data about all BBC programmes, and then in BBC R&D on the ABC-IP Technology Strategy Board collaborative project, aiming at unlocking archives by interlinking them with related datasets. As part of this project he has worked on a prototype combining automated interlinking with Linked Data sources and crowdsourcing to open up the BBC World Service archive.
QUERY track
- 10:10 - 10:30:
Choosing Between Graph Databases and RDF Engines for Consuming and Mining Linked Data (Domingo De Abreu, Alejandro Flores, Guillermo Palma, Valeria Pestana, Jose Pinero, Jonathan Queipo, Jose Sanchez and Maria-Esther Vidal)
Additional material: see the project Web site of Graphium (the graph database benchmark described in the paper), and check out the source code and datasets used for the benchmark
- 10:30 - 11:00: BREAK
- 11:00 - 11:20: Natural Language Query Translation into SPARQL using Patterns (Camille Pradel, Ollivier Haemmerlé and Nathalie Hernandez)
- 11:20 - 11:40: Including Co-referent URIs in a SPARQL Query (Christian Y. A. Brenninkmeijer, Carole Goble, Alasdair J. G. Gray, Paul Groth, Antonis Loizou and Steve Pettifer)
DYNAMICS track
- 11:40 - 12:00: On-the-fly Integration of Static and Dynamic Sources (Andreas Harth, Craig Knoblock, Steffen Stadtmüller, Rudi Studer and Pedro Szekely)
- 12:00 - 12:20: Change-a-LOD: Does the Schema on the Linked Data Cloud Change or Not? (Renata Dividino, Ansgar Scherp, Gerd Gröner and Thomas Grotton)
- 12:20 - 12:40: Self-Sustaining Platforms: a Semantic Workflow Engine (Sam Coppens, Ruben Verborgh, Erik Mannens and Rik Van de Walle)
- 12:40 - 13:45: LUNCH
META track
- 13:45 - 14:05: Consuming Linked data in Supply Chains: Enabling data visibility via Linked Pedigrees (Monika Solanki and Christopher Brewster)
- 14:05 - 14:25: Pleasantly Consuming Linked Data with RDF Data Descriptions (Michael Schmidt and Georg Lausen)
- 14:25 - 14:45: Content-Preserving Graphics (Timothy Lebo, Alvaro Graves and Deborah McGuinness)
- 14:45 - 15:05:
Bounds: Expressing Reservations about Incoming Data (Martin G. Skjæveland and Audun Stolpe)
Additional material: see the Boundz project Web site, for the Boundz vocabulary, the prototype implementation, and the evaluation test and results discussed in the paper
POSITION track
- 15:05 - 15:20: Towards an RDF Analytics Language: Learning from Successful Experiences (Fadi Maali and Stefan Decker)
- 15:20 - 16:00: BREAK
- 16:05 - 16:15: Rights declaration in Linked Data (Víctor Rodríguez-Doncel, Asunción Gómez-Pérez and Nandana Mihindukulasooriya)
- 16:15 - 16:30: Linked Data for Financial Reporting (Masatomo Goto, Bo Hu, Aisha Naseer, and Pierre-Yves Vandenbussche)
- 16:30 - 16:45: Linked Data Platform as a novel approach for Enterprise Application Integration (Nandana Mihindukulasooriya, Raúl García Castro and Miguel Esteban Gutiérrez)
PANEL
- 16:45 - 17:45:
Panel Title: Enterprise Linked Data
Panelists: Pascal Hitzler (Wright State University), Peter Mika (Yahoo!), Spyros Kotoulas (IBM Research), Yves Raimond (BBC R&D)
SOCIAL GATHERING
- 20:30 - ... : Linked Data Gathering at Palace Hotel, just 5 minutes walking from the workshop venue.
Proceedings
The workshop proceedings are online as CEUR-WS.org Vol-1034.
Organizing Committee
Steering Committee
Program Committee
- Mathieu d'Aquin, Open University, UK
- Cosmin Basca, University of Zurich, Switzerland
- Gong Cheng, Nanjing University, China
- Oscar Corcho, Universidad Politecnica de Madrid, Spain
- Aba-Sah Dadzie, University of Birmingham, UK
- Christina Feilmayr, Johannes Kepler University of Linz, Austria
- Yolanda Gil, University of Southern California, USA
- Claudio Gutierrez, Universidad de Chile, Chile
- Andreas Harth, Karlsruhe Institute of Technology, Germany
- Katja Hose, Max-Planck-Institut für Informatik, Germany
- Bo Hu, Fujitsu, UK
- Hak-Lae Kim, Samsung R&D, Korea
- Terunobu Kume, Fujitsu, Japan
- Roger Menday, Fujitsu, UK
- Adrian Mocan, SAP, Germany
- Aisha Naseer, Fujitsu, UK
- Alexandre Passant, seevl.net, MDG Web ltd, Ireland
- Giuseppe Pirro, Free University of Bolzano, Italy
- Axel Polleres, Siemens AG Österreich, Austria
- Matthew Rowe, Open University, UK
- Kai-Uwe Sattler, TU Illmenau, Germany
- Bernhard Schandl, Gnowsis.com, Austria
- Stefan Schlobach, Vrije Universiteit, Netherlands
- Aibo Tian, University of Texas at Austin, USA
- Raphael Troncy, EURECOM, France
- Pierre-Yves Vandenbussche, Fujitsu, Ireland
- Boris Villazon-Terrazas, iSOCO, Intelligent Software Components, Spain
Contact
For further information about the workshop, please contact the workshops chairs at cold.org.ws@googlemail.com
History
COLD 2013 is the fourth edition of the Consuming Linked Data workshop series. Previous editions are COLD 2012, COLD 2011, and COLD 2010.
Sponsors
