HADOOP BIGDATA ONLINE TRAINING

HADOOP BIGDATA ONLINE TRAINING

FOR FREE DEMO contact us at:

Email : training@apex-online-it-training.com

HADOOP Using Cloudera Development & Admin Course content

Introduction to BigData, Hadoop:-

  • Big Data Introduction
  • Hadoop Introduction
  • What is Hadoop? Why Hadoop?
  • Hadoop History?
  • Different types of Components in Hadoop?
  • HDFS, MapReduce, PIG, Hive, SQOOP, HBASE, OOZIE, Flume, Zookeeper and so on…
  • What is the scope of Hadoop?

Deep Drive in HDFS (for Storing the Data):-

  • Introduction of HDFS
  • HDFS Design
  • HDFS role in Hadoop
  • Features of HDFS
  • Daemons of Hadoop and its functionality
    • Name Node
    • Secondary Name Node
    • Job Tracker
    • Data Node
    • Task Tracker
  • Anatomy of File Wright
  • Anatomy of File Read
  • Network Topology
    • Nodes
    • Racks
    • Data Center
  • Parallel Copying using DistCp
  • Basic Configuration for HDFS
  • Data Organization
    • Blocks and
    • Replication
  • Rack Awareness
  • Heartbeat Signal
  • How to Store the Data into HDFS
  • How to Read the Data from HDFS
  • Accessing HDFS (Introduction of Basic UNIX commands)
  • CLI commands

MapReduce using Java (Processing the Data):-

  • Introduction of MapReduce.
  • MapReduce Architecture
  • Data flow in MapReduce
    • Splits
    • Mapper
    • Portioning
    • Sort and shuffle
    • Combiner
    • Reducer
  • Understand Difference Between Block and InputSplit
  • Role of RecordReader
  • Basic Configuration of MapReduce
  • MapReduce life cycle
    • Driver Code
    • Mapper
    • and Reducer
  • How MapReduce Works
  • Writing and Executing the Basic MapReduce Program using Java
  • Submission & Initialization of MapReduce Job.
  • File Input/output Formats in MapReduce Jobs
    • Text Input Format
    • Key Value Input Format
    • Sequence File Input Format
    • NLine Input Format
  • Joins
    • Map-side Joins
    • Reducer-side Joins
  • Word Count Example
  • Partition MapReduce Program
  • Side Data Distribution
    • Distributed Cache (with Program)
  • Counters (with Program)
    • Types of Counters
    • Task Counters
    • Job Counters
    • User Defined Counters
    • Propagation of Counters
  • Job Scheduling

PIG:-

  • Introduction to Apache PIG
  • Introduction to PIG Data Flow Engine
  • MapReduce vs PIG in detail
  • When should PIG used?
  • Data Types in PIG
  • Basic PIG programming
  • Modes of Execution in PIG
    • Local Mode and
    • MapReduce Mode
  • Execution Mechanisms
    • Grunt Shell
    • Script
    • Embedded
  • Operators/Transformations in PIG
  • PIG UDF’s with Program
  • Word Count Example in PIG
  • The difference between the MapReduce and PIG

SQOOP:-

  • Introduction to SQOOP
  • Use of SQOOP
  • Connect to mySql database
  • SQOOP commands
    • Import
    • Export
    • Eval
    • Codegen and etc…
  • Joins in SQOOP
  • Export to MySQL

HIVE:-

  • Introduction to HIVE
  • HIVE Meta Store
  • HIVE Architecture
  • Tables in HIVE
    • Managed Tables
    • External Tables
  • Hive Data Types
    • Primitive Types
    • Complex Types
  • Partition
  • Joins in HIVE
  • HIVE UDF’s and UADF’s with Programs
  • Word Count Example

HBASE:-

  • Introduction to HBASE
  • Basic Configurations of HBASE
  • Fundamentals of HBase
  • What is NoSQL?
  • HBase DataModel
    • Table and Row
    • Column Family and Column Qualifier
    • Cell and its Versioning
  • Categories of NoSQL Data Bases
    • Key-Value Database
    • Document Database
    • Column Family Database
  • SQL vs NOSQL
  • How HBASE is differ from RDBMS
  • HDFS vs HBase
  • Client side buffering or bulk uploads
  • HBase Designing Tables
  • HBase Operations
    • Get
    • Scan
    • Put
    • Delete

MongoDB:--

  • What is MongoDB?
  • Where to Use?
  • Configuration On Windows
  • Inserting the data into MongoDB?
  • Reading the MongoDB data.

Cluster Setup:--

  • Downloading and installing the Ubuntu12.x
  • Installing Java
  • Installing Hadoop
  • Creating Cluster
  • Increasing Decreasing the Cluster size
  • Monitoring the Cluster Health
  • Starting and Stopping the Nodes

OOZIE

  • Introduction to OOZIE
  • Use of OOZIE
  • Where to use?


Hadoop Ecosystem Overview

Oozie

HBase

Pig

Sqoop

Casandra

Chukwa

Mahout

Zoo Keeper

Flume

 

 

  • Case Studies Discussions

  • Certification Guidance

  • Real Time Certification and

  • interview Questions and Answers

  • Resume Preparation

  • Providing all Materials nd Links

  • Real time Project Explanation and Practice

FOR FREE DEMO contact us at:

Email : training@apex-online-it-training.com