Introduction to Big Data and Apache Spark
Sales Ended
Introduction to Big Data and Apache Spark

Introduction to Big Data and Apache Spark

Event Information

Share this event
Date and Time
Location
Location

kenic GmbH

Linienstraße 130

Germany

View Map

Event description

Description

Apache Spark (http://spark.apache.org) is currently the fastest growing project in Big Data environment. It allows processing Big Data sets faster and easier than in the existing solutions. This workshop will jump-start you into working with Spark and help in transition from analyst or developer to Big Data engineer.


Agenda


I. Introduction to Big Data

  • Definition
  • What is Big Data?
  • History of Big Data
  • Big Data problems

II. Apache Spark

  • Introduction
  • History
  • Spark vs Hadoop
  • Resilient Distributed Datasets (RDDs)
  • Architecture
  • Operation variants
  • Administration

III. Spark Core

  • Introduction
  • Java vs Spark vs Python
  • Connecting to cluster
  • Dataset distribution
  • RDD operations
  • Shared variables
  • Execution and testing

IV. Spark SQL

  • Introduction
  • Spark SQL vs Hive
  • Basic operation
  • Data and schema
  • Queries
  • Hive integration
  • Execution and testing


Minimum requirements


  • basic knowledge of Java / SQL / bash / Python (or another scripting language)
  • device: Intel Core i5 or better, 6GB RAM

This is a BYOD (bring your own device) workshop, so remember to bring your own laptop.

The workshop will be conducted in English.


Trainer - Jakub Nowacki


Jakub is University of Bristol graduate where he obtained PhD in Engineering Mathematics. On the daily basis he utilizes his analytical and development skill working in software development. He is mostly interested in distributed processing and analysis of big data sets. Jakub originally has C/C++ background but currently works mostly in JVM and Python world.

Jakub Nowacki


Tickets price includes


  • Full-day workshop
  • Lunch
  • Coffee & Tea


Last edidtion (2016)


Voxxed Days Berlin

 

Share with friends
Date and Time
Location

kenic GmbH

Linienstraße 130

Germany

View Map

Save This Event

Event Saved