A tanfolyamról
Apache Storm is an advanced and distributed open source stream engine processing big data at an extremely high speed. Written in Clojure programming language, it is a real-time computational system which makes it easier to process unbounded streams of data. One of its advantages is that it can be used on any programming language. Built to accept tons of data flowing in with high speed from various sources, it analyses and publishes real-time updates to a UI or any other specified target place.
Apache Storm training will help you master its concepts including its architecture, installation, planning, and configuration. You will learn to leverage the power of Storm and use it for real-time processing of big data. The course also helps you get an insight on the way Storm interfaces with other frameworks like Kafka, Java, and Cassandra. The explosion of big data and the fact that organizations have realized the potential of using tools such as Storm to analyse it, ensures that Apache Storm experts will have great opportunities to contribute to these enterprises and become indispensable to them.
Individual Benefits:
- An individual with Apache Storm course can use any programming language in Storm framework
- An individual can process huge volumes of data with Apache Storm training.
Organizational Benefits:
- Organizations can start data processing to gain early profit as soon as Storm is installed in their systems
- Organizations can perform data refresh and end-to-end delivery response in seconds
- With Apache Storm, organizations can process data without losing anything, which is a major concern of all the organizations
- Organizations are easily detecting frauds by early risk mitigation with Apache Storm
- Using Apache Storm, organizations can quickly redeploy unfinished tasks.
What you will learn
-
Big Data and its Uses
Learn what is Big Data and relevant concepts, where it is used, and various types of data analytics -
Storm Introduction
Understand the concepts of Storm architecture, use cases, & its usage in real-time stream processing -
Storm Installation and Configuration
Learn how to set up Storm and what system configuration is needed to create various topologies -
Spouts and Bolts
Get the knowledge on how to use spouts & bolts along with their mechanism and life cycle -
Storm Trident
Learn how to handle failures in Trident topologies & how to perform real-time computing in Storm -
Real-time industry level projects
Get your concepts clear by working on real-time projects on Twitter, Spotify and travel websites
Who should attend the course
- Software Professionals
- Data Scientists
- ETL Developers
- Project Managers
- Mainframe Professionals
- Analytics professionals
- Research professionals
- IT developers and testers
- Big Data Analysts and Hadoop Professionals
- Professionals who want to acquire a good understanding of Apache Storm
- An individual interested to perform real-time analytics
We provide the course in English.
Tematika
Curriculum
1 Introduction to Big Data and Storm
Learning Objectives:
Learn about the use of Big data, difference between Batch and Real-time Processing. How Apache Storm can be helpful for Real-time processing and so on.
Topics:
- What is Big Data, Big Data Analytics: Batch Vs Real Time
- Hadoop for Batch Analytics
- Shortcomings of Hadoop
- Storm for Real Time Analytics
- What is Storm
- Use Cases of Storm
- Components of Storm
- Properties of Storm
- Storm Vs Hadoop
- Storm Data Model
- Storm Architecture
- Storm Bolt
- Storm Topology
- Serialization-Deserialization
- Types of Topologies
Hands-on:
- You will learn various use cases of Apache Storm-
- Batch processing vs real-time processing
- Aggregating click and impression data from different streams
- Trending search on any e-commerce portal
- Twitter Streaming
2 Storm Installation and Configuration
Learning Objectives:
Learn How to install Storm and various Groupings architecture
Topics:
- Storm Installation
- Storm Running Modes
- Creating First Storm Topology
Hands-on:
- Setting up Storm Cluster
- Various Components of Cluster
- Storm Grouping
3 Storm Advanced Concepts
Learning Objectives:
Learn about internal components of Storm and their working. You will be able to use Spouts and bolts and their mechanisms. Different type of Spouts and their working. Lifecycle of bolts and its working.
Topics:
- Spouts
- Types of Spouts
- Structure of Spout
- Bolt
- Structure of Bolt
- Stream Groupings
- Ack and Fail
- Ack Timeout
- Anchoring
- Topology Lifecycle
- Data Ingestion in Storm
- Topology–Connecting Spout and Bolt
Hands-on:
- Trending Search topology
- You will be given file of various search keywords you have to find top 10 search keywords in last 60 seconds at any moment
4 Storm Trident
Learning Objectives:
Learn about Trident topology. Performing complex transformations on the fly using the Trident topology: Map, Filter, Windowing and Partitioning operations.
Topics:
- Trident
- Trident Data Model
- Stateful Processing using Trident
- Operations in Trident
- Trident State
- Trident Topology
- Fault-tolerance Levels
- Pipelining
- Trident Advantages
Hands-on:
- Twitter Data Analysis using Trident
Kinek ajánljuk
Előfeltételek
Prerequisites
There are no prerequisites required to attend the Apache Storm training. But, having elementary knowledge of programming languages like Java, Object-Oriented Programming, Hadoop, Linux, will help you understand concepts easily.