Talend for Data Integration and Big data


Talend Training for Data Integration and Big Data will help you in learning how to use Talend Open Studio to simplify Big Data Integration. After this Talend Training, you can easily work with Apache Hadoop, Apache Spark, Apache Hive, Apache Pig, and NoSQL Databases using Talend. Talend Open Studio accelerates the task development process with its drag-and-drop UI and pre-built components for Data Offloading and Ingesting Data into your Data Centers.

Data Integration Curriculum

Talend – A Revolution in Big Data
Learning Objectives: In this module of Talend Training, you will get an overview of ETL Technologies and the reason why Talend is referred as the next Generation Leader in Big Data Integration. You will be introduced to various products offered by Talend Corporation till date and its relevance to Data Integration and Big Data. Further, you will learn about the TOS (Talend Open Studio), its Architecture, GUI, and how to install TOS. 
  • Core ETL concepts
  • Talend products and their features
  • Design and implementation of Talend Open Studio
  • Working with ETL
  • Rise of Big Data
  • Role of Open Source ETL Technologies in Big Data
  • Comparison with other market leader tools in ETL domain
  • Importance of Talend (Why Talend)
  • Talend and its Products
  • Introduction of Talend Open Studio
  • TOS for Data Integration
  • GUI of TOS with Demo
Working with Talend Open Studio for DI
Learning Objectives: In this module of Talend course, you will learn to work with various types of Data Source, Target Systems supported by Talend, Metadata and how to read/write from popular CSV/Delimited file and fixed width file. Connect to a Database and read/write/update data and read complex source system like Excel and XML along with some of the basic components like tLog, tMap using TOS. 
  • Create jobs with different components and link them
  • Read and write files of various format
  • Work with Database
  • Launching Talend Studio
  • Working with different workspace directories
  • Working with projects
  • Creating and executing jobs
  • Connection types and triggers
  • Most frequently used Talend components [tJava, tLogRow, tMap]
  • Read & Write Various Types of Source/Target Systems
  • Working with files [CSV, XLS, XML, Positional]
  • Working with databases [MySQL DB]
  • Metadata management
  • Creating a Business Model
  • Adding Components to a Job
  • Connecting the Components
  • Reading and writing Delimited File
  • Reading and writing Positional File
  • Reading and writing XML and Xls/Xlsx Files
  • Connecting Database(MySQL)
  • Retrieving Schema from the Database
  • Reading from Database Metadata
  • Retrieving data from a file and inserting it into the Database
  • Deleting data from Database
  • Working with Logs and Error
Basic Transformations in Talend
Learning Objectives: In this module of Talend Training, you will understand Data Mapping and Transformations using TOS. In addition, you will learn how to filter and join various Data Sources using lookups and search and sort through them. 
  • Create and use context variables
  • Mapping and Transformations
  • Work with components like tFilter, tJoin, tSortRow, tReplicate, tSplit, Lookup
  • Context Variables
  • Using Talend components
  • tJoin
  • tFilter
  • tSortRow
  • tAggregateRow
  • tReplicate
  • tSplit
  • Lookup
  • tRowGenerator
  • Accessing job level/ component level information within the job
  • SubJob (using tRunJob, tPreJob, tPostJob)
  • Embedding Context Variables
  • Adding different environments
  • Data Mapping using tMap
  • Using functions in Talend
  • tJava
  • tSortRow
  • tAggregateRow
  • tReplicate
  • tFilter
  • tSplit
  • tRowGenerator
  • Perform Lookup operations using tJoin
  • Creating SubJob (using tRunJob, tPreJob, tPostJob)
Advance Transformations and Executing Jobs remotely in Talend
Learning Objectives: In this module of Talend Certification, you will understand the Transformation and various steps involved in looping job of Talend, ways to search files in a directory and how to process them in a sequence. You will also learn to work with FTP connections, export and import Jobs, run the jobs remotely and parameterize them from the command line. 
  • Use various file components like tFileList, tFileCopy, tFileExists, tFileDelete, tFileArchive
  • Handle logs and errors
  • Cast data types using tConvert and tMap expression builder
  • Iterate components using tLoop
  • Store and retrieve files from FTP
  • Remotely access Talend
  • Various components of file management (like tFileList, tFileAchive, tFileTouch, tFileDelete)
  • Error Handling [tWarn, tDie]
  • Type Casting (convert datatypes among source-target platforms)
  • Looping components (like tLoop, tForeach)
  • Using FTP components (like tFTPFileList, tFTPFileExists, tFTPGet, tFTPPut)
  • Exporting and Importing Talend jobs
  • How to schedule and run Talend DI jobs externally (using Command line)
  • Parameterizing a Talend job from command line
Big Data and Hadoop with Talend
Learning Objectives: In this module of Talend Training, you will learn about Big Data and Hadoop concepts, such as HDFS (Hadoop Distributed File System) Architecture, MapReduce, leveraging Big Data through Talend and Talend & Big Data Integration. Learn to set up and use the Talend Open Studio for Big Data. In addition, you will learn to use Big Data connectors in TOS (Talend offers some 800+ connectors for Big Data environment) and access Hadoop Ecosystem from Talend. 
  • Understand scope of Talend Open Studio for Big Data
  • Integrate Hadoop HDFS and Talend
  • Use Hadoop operations like Map and Aggregate through TOS Big Data
  • Perform multiple analyses and store results in HDFS
  • Big Data and Hadoop
  • HDFS and MapReduce
  • Benefits of using Talend with Big Data
  • Integration of Talend with Big Data
  • HDFS commands Vs Talend HDFS utility
  • Big Data setup using Hortonworks Sandbox in your personal computer
  • Explaining the TOS for Big Data Environment
  • Creating a Project and a Job
  • Adding Components in a Job
  • Connecting to HDFS
  • 'Putting' files on HDFS
  • Using tMap, tAggregate functions
Hive in Talend
Learning Objectives: In this module of Talend Certification Training, you will learn Hive concepts and the setup of Hive environment in Talend. You will learn how to use Hive Big Data connectors in TOS and implement Use Cases using Hive in Talend. 
  • Integrate Hive with TOS Big Data
  • Perform complex Hive queries in Talend
  • Hive and It’s Architecture
  • Connecting to Hive Shell
  • Set connection to Hive database using Talend
  • Create Hive Managed and external tables through Talend
  • Load and Process Hive data using Talend
  • Transform data from Hive using Talend
  • Process and transform data from Hive
  • Load data from HDFS & Local File Systems to Hive Table using Hive Shell
  • Execute the HiveQL query using Talend
Pig and Kafka in Talend
Learning Objectives: In this module of Talend course, you will learn the PIG concepts, the setup of Pig Environment in Talend and Pig Big Data connectors in TOS for Big Data and implement Use Cases using Pig in Talend. Also, you will be given an insight of Apache Kafka, its architecture, and integration with Talend through a real-life use case. 
  • Integrate Talend projects with Pig and Kafka
  • Use Pig for scripting and Kafka for streaming jobs in TOS Big Data
  • Use TOS Big Data for running Pig and Kafka along with DI, Hadoop HDFS, and Hive
  • Pig Environment in Talend
  • Pig Data Connectors
  • Integrate Personalized Pig Code into a Talend job
  • Apache Kafka
  • Kafka Components in TOS for Big data
  • Use Pig and Kafka connectors in Talend
End to End Project in Talend

Learning Objectives: In this module of Talend Training, you will be developing a Project using Talend DI and Talend BD with MySQL, Hadoop, HDFS, Hive, Pig, and Kafka.

Data Integration Description

About Talend Training

Talend Training is designed to help you master Talend and Big Data Integration using Talend Open Studio. It is a free open source ETL tool using which you can easily integrate all your data with your Data Warehouse and Applications or synchronize data between systems. You’ll also use Talend ETL tool with HDFS, Pig, and Hive on real-life case studies.

What are the objectives of our Talend Training Course?

Talend Training for Data Integration and Big Data is designed by industry experts to make you a Certified Talend Practitioner. This Talend Certification Training course offers:

  • Complete understanding of the ETL concepts and ability to solve the real-time business problems using Talend
  • Comprehensive knowledge of Talend Architecture and its various Components
  • Familiarity with Talend Tool to automate your complete Data Integration/ Data Analysis/ Data Warehousing requirements
  • Interaction with various types of source or target platform like Flat Files (CSV, Fixed width), XML, Excel and work with Databases
  • Implementation of the real-time scenarios for Data Transformation, File & Error Handling, Scheduling Talend jobs, Automation/Parameterization
  • Understanding of Big Data and Hadoop concepts and the benefits of integrating Talend with Hadoop
  • Easy integration and Access to Hadoop Ecosystem using Talend
  • Implementation of Talend with HDFS, Pig, and Hive (the most demanded and futuristic skills)
  • Rigorous involvement of an SME throughout the Talend Training to learn industry standards and best practices
Why should you go for Talend Training?

Talend is one of the first providers of open source Data Integration Software. Talend provides specialized support for Big Data Integration. By using Talend, no coding effort is required for implementing Big Data Solution. This can be designed using drag-and-drop controls and native code is generated automatically. Talend is built in such a way that it is flexible to reside between any of the data sources and platforms out there. Having a solutions portfolio that includes Data Integration, Data Quality, Master Data Management, Enterprise Service Bus, and Business Process Management, there is everything you need here to make your data work for you.

What are the skills that you will be learning with our Talend Training?

Talend Certification training will help you to become a Talend expert. It will polish your skills by offering you comprehensive knowledge of ETL Processes and Big Data integration with Talend, and the required hands-on experience for solving real-time industry-based data integration and big data projects. During Talend Certification Training Course at Edureka, you will be trained by our expert instructors, so you can:

  • Master the Core ETL concepts
  • Understand various Talend products and their features
  • Design and implement jobs/tasks in Talend Open Studio
  • Work with different source files, target files, and databases
  • Perform Mapping and Transformation of data
  • Learn File Management and Error & Log Handling
  • Store & Retrieve Files using FTP
  • Remotely Access and Parameterize a Job using Talend
  • Understand the scope of Talend in Big Data
  • Integrate HDFS, Hive, Pig, and Kafka with Talend
  • Perform complex Hive queries and store data in HDFS
  • Use Pig for Scripting and Kafka for Streaming jobs from Talend
  • Solve Real Life Project on Data Integration and Big Data using Talend
Who should go for this Talend Training Course?
The market for Big Data is growing across the world and use of Talend makes it easier to communicate with Big Data. This strong growth pattern opens up a great opportunity for all the IT Professionals. Our Talend Certification Training will help you to grab this opportunity and accelerate your career. It is best suited for:
  • Business Analysts
  • Data Warehousing Professionals
  • Data Analysts
  • Solution & Data Architects
  • System Administrators
  • Software Engineers
What are the pre-requisites for Edureka Talend Training Course?

There are no prerequisites for learning Talend course in general. However, having knowledge of Data Warehousing will be beneficial, but certainly not a mandate. To brush up/ learn Data Warehousing concepts Coursesit offers you a complimentary self-paced course, i.e. "Data Warehousing Certification Training" when you enroll in Talend Certification course.


Click to rate this course!
[Total: 1 Average: 4]

Course Content

Time: 10 weeks

Curriculum is empty



0 rating

5 stars
4 stars
3 stars
2 stars
1 star
Month End Offer - Flat 20% Off + 20% Cashback  
WhatsApp us