11th Swiss Big Data User Group Meeting

Monday, 7th April 2014 at 5pm - 8pm

Location: ETH Zurich, building CHN, room E46, Universitätstrasse 16, 8092 Zurich

This event is in the past.

When: Monday, April 7 2014, 6:00 PM
Where:
 ETH Zurich, CHN Building, Room E46 (Map: http://bit.ly/1gSswOw)


Agenda for meetup #11:

Building a Hadoop Data Warehouse with Impala (40')
Marcel Kornacker, Impala achitect and tech lead at Cloudera for new products developments

Impala (impala.io) raises the bar for SQL query performance on Apache Hadoop. With Impala, you can query Hadoop data – including SELECT, JOIN, and aggregate functions – in real time to do BI-style analysis. As a result, Impala makes a Hadoop-based enterprise data hub function like an enterprise data warehouse for native Big Data.

Closing The Loop for Evaluating Big Data Analysis (30')
Karolina Alexiou, Data Scientist at Teralytics

Analysis of big data is useless (and a lot harder to sell) when you can't measure whether the resulting insights are correct. In order to develop sophisticated data analysis methodologies tailored to your particular use-case, you need to be able to figure out what works and what doesn't. It is crucial to gather data independently to your analysis (ground truth) and compare it to your results using the correct metrics and account for biases. The sheer volume of data means that you also need to have a strategy for slicing and dicing the data to isolate the really valuable parts, and also, a keen eye for visualization so that you can quickly compare methodologies and support the validity of your insights to third parties.

Networking apéro sponsored by consulteer


About the sponsors

Big thanks to consulteer for sponsoring food and drinks and Teralytics for providing the location!

Comments

Export to your calendar


Swiss Big Data User Group

Powered by GroupSpaces · Terms · Privacy Policy · Cookie Use · Create Your Own Group