Personal Information
Organization / Workplace
Eindhoven Area, Netherlands Netherlands
Occupation
Principal Consultant
Industry
Technology / Software / Internet
- Presentations
- Documents
- Infographics
Automated Data Quality Assurance with Machine Learning and Autoencoders
Institute of Contemporary Sciences
•
4 years ago
Business Value Metrics for Data Governance
DATAVERSITY
•
5 years ago
Data-Ed Webinar: Data Quality Engineering
DATAVERSITY
•
10 years ago
Simplifying Real-Time Architectures for IoT with Apache Kudu
Cloudera, Inc.
•
6 years ago
Apache Spark At Scale in the Cloud
Databricks
•
4 years ago
Extreme Apache Spark: how in 3 months we created a pipeline that can process 2.5 billion rows a day
Josef A. Habdank
•
8 years ago
The alignment
Alberto Brandolini
•
6 years ago
Chasing elephants
Alberto Brandolini
•
7 years ago
Data Audit Approach To Developing An Enterprise Data Strategy
Alan McSweeney
•
9 years ago
EuroBSDcon 2017 System Performance Analysis Methodologies
Brendan Gregg
•
6 years ago
Migrating existing monolith to serverless in 8 steps
Yan Cui
•
4 years ago
HBase Advanced Schema Design - Berlin Buzzwords - June 2012
larsgeorge
•
11 years ago
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
larsgeorge
•
7 years ago
An Overview on Optimization in Apache Hive: Past, Present Future
DataWorks Summit/Hadoop Summit
•
7 years ago
Efficient Data Formats for Analytics with Parquet and Arrow
DataWorks Summit/Hadoop Summit
•
7 years ago
Interactive Analytics at Scale in Apache Hive Using Druid
DataWorks Summit/Hadoop Summit
•
7 years ago
Interactive SQL POC on Hadoop (Hive, Presto and Hive-on-Tez)
Sudhir Mallem
•
8 years ago
Introduction to Kafka Streams
Guozhang Wang
•
8 years ago
Kafka 102: Streams and Tables All the Way Down | Kafka Summit San Francisco 2019
Michael Noll
•
4 years ago