Apache Kafka Certification Training

Apache Kafka Certification Training helps you in learning the concepts about Kafka Architecture, Configuring Kafka Cluster, Kafka Producer, Kafka Consumer, Kafka Monitoring. Apache Kafka Certification Training is designed to provide insights into Integration of Kafka with Hadoop, Storm and Spark, understand Kafka Stream APIs, implement Twitter Streaming with Kafka, Flume through real life cases studies.

Goal: In this module, you will understand where Kafka fits in the Big Data space, and Kafka Architecture. In addition, you will learn about Kafka Cluster, its Components, and how to Configure a Cluster

Skills:
 
  • Kafka Concepts
  • Kafka Installation
  • Configuring Kafka Cluster
GoalKafka Producers send records to topics. The records are sometimes referred to as Messages. In this Module, you will work with different Kafka Producer APIs.
Skills:
  • Configure Kafka Producer
  • Constructing Kafka Producer
  • Kafka Producer APIs
  • Handling Partitions
Objectives:
At the end of this module, you should be able to:
  • Construct a Kafka Producer
  • Send messages to Kafka
  • Send messages Synchronously & Asynchronously
  • Configure Producers
  • Serialize Using Apache Avro
  • Create & handle Partitions
Goal: Applications that need to read data from Kafka use a Kafka Consumer to subscribe to Kafka topics and receive messages from these topics. In this module, you will learn to construct Kafka Consumer, process messages from Kafka with Consumer, run Kafka Consumer and subscribe to Topics

Skills:

  • Configure Kafka Consumer
  • Kafka Consumer API
  • Constructing Kafka Consumer

Objectives: At the end of this module, you should be able to:

  • Perform Operations on Kafka
  • Define Kafka Consumer and Consumer Groups
  • Explain how Partition Rebalance occurs
  • Describe how Partitions are assigned to Kafka Broker
  • Configure Kafka Consumer
  • Create a Kafka consumer and subscribe to Topics
  • Describe & implement different Types of Commit
  • Deserialize the received messages
Goal: Apache Kafka provides a unified, high-throughput, low-latency platform for handling real-time data feeds. Learn more about tuning Kafka to meet your high-performance needs.
Skills:
  • Kafka APIs
  • Kafka Storage
  • Configure Broker
Goal:  Kafka Cluster typically consists of multiple brokers to maintain load balance. ZooKeeper is used for managing and coordinating Kafka broker. Learn about Kafka Multi-Cluster Architectures, Kafka Brokers, Topic, Partitions, Consumer Group, Mirroring, and ZooKeeper Coordination in this module.
Skills:
  • Administer Kafka
Objectives:
At the end of this module, you should be able to
  • Understand Use Cases of Cross-Cluster Mirroring
  • Learn Multi-cluster Architectures
  • Explain Apache Kafka’s MirrorMaker
  • Perform Topic Operations
  • Understand Consumer Groups
  • Describe Dynamic Configuration Changes
  • Learn Partition Management
  • Understand Consuming and Producing
  • Explain Unsafe Operations
Goal: Learn about the Kafka Connect API and Kafka Monitoring. Kafka Connect is a scalable tool for reliably streaming data between Apache Kafka and other systems.
Skills:
  • Kafka Connect
  • Metrics Concepts
  • Monitoring Kafka
Objectives: At the end of this module, you should be able to:
  • Explain the Metrics of Kafka Monitoring
  • Understand Kafka Connect
  • Build Data pipelines using Kafka Connect
  • Understand when to use Kafka Connect vs Producer/Consumer API
  • Perform File source and sink using Kafka Connect
Goal: In this module, you will learn about Apache Hadoop, Hadoop Architecture, Apache Storm, Storm Configuration, and Spark Ecosystem. In addition, you will configure Spark Cluster, Integrate Kafka with Hadoop, Storm, and Spark.
Skills:
  • Kafka Integration with Hadoop
  • Kafka Integration with Storm
  • Kafka Integration with Spark
Objectives:
At the end of this module, you will be able to:
  • Understand What is Hadoop
  • Explain Hadoop 2.x Core Components
  • Integrate Kafka with Hadoop
  • Understand What is Apache Storm
  • Explain Storm Components
  • Integrate Kafka with Storm
  • Understand What is Spark
  • Describe RDDs
  • Explain Spark Components
  • Integrate Kafka with Spark
 Objectives:
At the end of this module, you should be able to,
  • Understand Flume
  • Explain Flume Architecture and its Components
  • Setup a Flume Agent
  • Integrate Kafka with Flume
  • Understand Cassandra
  • Learn Cassandra Database Elements
  • Create a Keyspace in Cassandra
  • Integrate Kafka with Cassandra
  • Understand Talend
  • Create Talend Jobs
  • Integrate Kafka with Talend
Goal: In this module, you will work on a project, which will be gathering messages from multiple
sources.
Scenario:
In E-commerce industry, you must have seen how catalog changes frequently. Most deadly problem they face is “How to make their inventory and price
consistent?”.
There are various places where price reflects on Amazon, Flipkart or Snapdeal. If you will visit Search page, Product Description page or any ads on Facebook/google. You will find there are some mismatch in price and availability. If we see user point of view that’s very disappointing because he spends more time to find better products and at last if he doesn’t purchase just because of consistency.
Here you have to build a system which should be consistent in nature. For example, if you are getting product feeds either through flat file or any event
stream you have to make sure you don’t lose any events related to product specially inventory and price.
If we talk about price and availability it should always be consistent because there might be possibility that the product is sold or the seller doesn’t want to sell it anymore or any other reason. However, attributes like Name, description doesn’t make that much noise if not updated on time.
This Project enables you to gain Hands-On experience on the concepts that you have learned as part of this Course.
You can email the solution to our Support team within 2 weeks from the Course Completion Date. Edureka will evaluate the solution and award a Certificate with a Performance-based Grading.
Problem Statement:
You are working for a website techreview.com that provides reviews for different technologies. The company has decided to include a new feature in the website which will allow users to compare the popularity or trend of multiple technologies based on twitter feeds. They want this comparison to happen in real time. So, as a big data developer of the company, you have been task to implement following things:
• Near Real Time Streaming of the data from Twitter for displaying last minute’s count of people tweeting about a particular technology.
• Store the twitter count data into Cassandra.
Month End Offer - Flat 20% Off + 20% Cashback  
+
WhatsApp us