Actions and Detail Panel
Introduction to web crawling with StormCrawler (and Elasticsearch)
Mon 24 April 2017, 10:00 – 17:00 CEST
Price includes 19% VAT / Mwst.
In this course, we will explore [StormCrawler](http://stormcrawler.net), a collection of resources for building low-latency, large scale web crawlers on Apache Storm. After a short introduction to Apache Storm and an overview of what Storm-Crawler provides, we'll put it to use straight away for a simple crawl before moving on to the deployed mode of Storm
In the second part of the session, we will then introduce metrics and index documents with Elasticsearch and Kibana and dive into data extraction. Finally, we'll cover recursive crawls and scalability. This course will be hands-on: attendees will run the code on their own machines.
This course of workshops will run for one full day (with a break for lunch):
Monday April 24th 2017
10am - 5pm CET