Damien Graux - On the Efficient Distributed Evaluation of SPARQL Queries

13:00

Thursday

Dec

2016

Thesis defence

Place:

Montbonnot, INRIA

Organized by:

Damien Graux

Speaker:

Damien Graux

Teams:

TYREX

Lieu de soutenance :

Grand Amphi Inria Grenoble Alpes

Membres du jury :

M. Mohand-Saïd Hacid, Pr, Univ Claude Bernard Lyon 1, rapporteur
M. Patrick Valduriez, DR, Inria Sophia Antipolis, rapporteur
M. Jérôme Euzenat, DR, Inria Grenoble Alpes, examinateur
M. Farouk Toumani, Pr, Univ Blaise Pascal Clermont-Ferrand II, examinateur
M. Nabil Layaïda, DR, Inria Inria Grenoble Alpes, directeur de thèse
M. Pierre Genevès, CR, Cnrs, co-directeur de thèse

The Semantic Web standardized by the World Wide Web Consortium aims at providing a common framework that allows data to be shared and analyzed across applications. The Resource Description Framework (RDF) and the query language SPARQL constitute two major components of this vision.

Because of the increasing amounts of RDF data available, dataset distribution across clusters is poised to become a standard storage method. As a consequence, efficient and distributed SPARQL evaluators are needed.

To tackle these needs, we first benchmark several state-of-the-art distributed SPARQL evaluators while monitoring a set of metrics which is appropriate in a distributed context (e.g. network traffic). Then, an analysis driven by typical use cases leads us to define new development perspectives in the field of distributed SPARQL evaluation. On the basis of these perspectives, we design several efficient distributed SPARQL evaluators whose performances are validated and compared to state-of-the-art evaluators. For instance, our distributed SPARQL evaluator named SPARQLGX offers efficient time performances while being resilient to the loss of nodes.