Polytechnic University of Valencia Congress, CARMA 2018 - 2nd International Conference on Advanced Research Methods and Analytics

Font Size: 
Blockchain-backed analytics. Adding blockchain-based quality gates to data science projects.
Markus Herrmann, Jörg Petzold, Vivek Bombatkar

Last modified: 11-07-2018


A typical analytical lifecycle in data science projects starts with the process of data generation and collection, continues with data preparation and preprocessing and heads towards project specific analytics, visualizations and presentations. In order to ensure high quality trusted analytics, every relevant step of the data-model-result linkage needs to meet certain quality standards that furthermore should be certified by trusted quality gate mechanisms.

We propose “blockchain-backed analytics”, a scalable and easy-to-use generic approach to introduce quality gates to data science projects, backed by the immutable records of a blockchain. For that reason, data, models and results are stored as cryptographically hashed fingerprints with mutually linked transactions in a public blockchain database.

This approach enables stakeholders of data science projects to track and trace the linkage of data, applied models and modeling results without the need of trust validation of escrow systems or any other third party.

Full Text: PDF