The SmartDataLake Toolkit

The SmartDataLake toolkit offers a full stack of components for exploring and analyzing data in a data lake, aiming to facilitate data analysts and data scientists through their journey from raw data to actionable insights. In particular, SDL-Virt provides tools […]

Open data exploration – Pilot testing by SYNYO

Throughout the third pilot use-case of the SmartDataLake project, SYNYO has applied selected components from the SmartDataLake toolkit to complement the end-to-end workflow of open data processing, analysis and converting open data into insights and actionable information. This was performed […]

Proteus-RAW integration

Connecting two distinct systems is a legitimate approach when crafting a standalone software solution that would include features of both. But as handy standard APIs and serialization formats can be, the blending won’t systematically perform in a satisfying way. Integration […]

B2B Portfolio recommendation

B2B Portfolio recommendation is one of the pilot use cases in SmartDataLake, led by SpazioDati. Given the list of clients of a company, the objective of business-to-business (B2B) portfolio recommendation is to find other potential clients (called “leads”) for the […]

Fast Heterogeneous Analytics in SmartDataLake

Complex analytical queries with multiple joins over vast amounts of heterogeneous data usually take a considerable amount of time to execute and prevent interactive analysis. In SmartDataLake (SDL), the EPFL team accelerates execution by taking advantage of the diverse hardware […]

Project Facts
SmartDataLake is a Research and Innovation action funded by the Horizon 2020 Framework Programme of the European Union.

Project Full Title: Sustainable Data Lakes for Extreme-Scale Analytics

Topic: ICT-12-2018-2020 - Big Data technologies and extreme-scale analytics

Grant Agreement No: 825041

Duration: 36 months (1/2019 – 12/2021)

Coordinated by : IMSI / Athena RC