This event is in the past.
When: Monday, April 7 2014, 6:00 PM
Where: ETH Zurich, CHN Building, Room E46 (Map: http://bit.ly/1gSswOw)
Agenda for meetup #11:
Building a Hadoop Data Warehouse with Impala (40')
Marcel Kornacker, Impala achitect and tech lead at Cloudera for new products developments
Impala (impala.io) raises the bar for SQL query performance on Apache Hadoop. With Impala, you can query Hadoop data – including SELECT, JOIN, and aggregate functions – in real time to do BI-style analysis. As a result, Impala makes a Hadoop-based enterprise data hub function like an enterprise data warehouse for native Big Data.
Closing The Loop for Evaluating Big Data Analysis (30')
Karolina Alexiou, Data Scientist at Teralytics
Analysis of big data is useless (and a lot harder to sell) when you can't measure whether the resulting insights are correct. In order to develop sophisticated data analysis methodologies tailored to your particular use-case, you need to be able to figure out what works and what doesn't. It is crucial to gather data independently to your analysis (ground truth) and compare it to your results using the correct metrics and account for biases. The sheer volume of data means that you also need to have a strategy for slicing and dicing the data to isolate the really valuable parts, and also, a keen eye for visualization so that you can quickly compare methodologies and support the validity of your insights to third parties.
Networking apéro sponsored by consulteer
About the sponsors
Big thanks to consulteer for sponsoring food and drinks and Teralytics for providing the location!