What you will learn
You will learn the fundamental skills you need to manage and manipulate large and/or fast datasets.
3Vs of big data
- Lambda architecture
- CAP theorem
- Resilient Distributed Datasets
- Pipeline tuning
Languages and technologies
- Python programming language
- Hadoop, Spark, Storm, Zeppelin, EC2
How you will learn
The one-day course is extremely interactive and hands-on. You will learn by working through concrete problems with real datasets. You will be taught by academic and industry experts in the field, who have a wealth of experience and knowledge to share.
Is it for me?
Audience: Those who are curious about the “Big Data” space and who want to feel comfortable getting their hands dirty with high volume, high velocity, diverse real-world datasets.
Level: Intermediate (ideally you will have attended the Introduction to Data Science bootcamp)
Prerequisites: good knowledge of python, some familiarity with matrices, basic understanding of machine learning practice (as taught in Introduction to Data Science)
For further information about this one-day course visit: http://cambridgecoding.com/workshop/bigdata