Free

Data Science Tech Talk: Big Data! Interactively Analyse 100GB of Data using...

Event Information

Share this event

Date and Time

Location

Location

Oxford Centre for Innovation

New Road

Oxford

OX1 1BY

United Kingdom

View Map

Friends Who Are Going
Event description

Description

Join us for one of our Evening Tech Talk's exploring state-of-the-art techniques and their practical applications.

Evening Talk: Big Data! Interactively Analyse 100GB of Data using Spark, Amazon EMR and Zeppelin

You may have been hearing a lot of buzz around Big Data, Apache Spark, Amazon Elastic Map Reduce (EMR) and Apache Zeppelin. What’s the fuss about, and how can you benefit from these state of the art technologies?

In this highly interactive session, you will learn how to leverage Spark to rapidly mine a large real-world data set. We will conduct the analysis live entirely using an iPython Notebook to show you how easy it can be to get to grips with these technologies.

In the first part of the session, we characterise what Big Data is. We will then use a sample of data from the Open Library dataset, and you will learn how to apply common Spark patterns to extract insights and aggregate data. In the second part of the session, you will see how to leverage Spark on Amazon EMR to scale your data processing queries over a cluster of machines and interactively analyse a large data set (100GB) with a Zeppelin Notebook. Along the way, you will learn gotchas as well as useful performance and monitoring tips.

Speaker: Dr Raoul-Gabriel Urma, Cambridge Spark

This event is free to attend, but please make sure you register to secure a place. Thank you!

Share with friends

Date and Time

Location

Oxford Centre for Innovation

New Road

Oxford

OX1 1BY

United Kingdom

View Map

Save This Event

Event Saved