Learn Data Science – Big data: Hadoop

Course Name: – Learn Data Science – Big data: Hadoop

Date: – Sat 07th Sep and Sun 08th Sep 2019.

Cost: –

Booking between 27 Jul to 17 Aug 2019 – 1000 INR discount, you pay 6000 INR
Booking between 18 Aug to 31 Aug 2019 – 500 INR discount, you pay 6500 INR
Booking between 01 Sep to 06 Sep 2019 – 0 INR discount, you pay 7000 INR

How to Join?

Click Here

Key Features

- - No PPT’s completely Hands-on Apache Hadoop training.
  - Tea/Coffee as refreshment will be provided.
  - All at only 7000 INR
  - Installation required in your laptop for training
    - Ubuntu Virtual Machine Images for VirtualBox and VMware download link, get from here

Day 1 – First Day

1st Hour	Introduction to Hadoop (1 hr) Hadoop Distributed File System Hadoop Architecture Map Reduce & HDFS Hadoop Eco Systems Introduction to Pig Introduction to Hive Introduction to HBase Other eco system Map HDFS: Hadoop Distributed File System: – Significance of HDFS in Hadoop • Features of HDFS Nodes Name Node, Secondary Name Node and its functionality
2nd Hour	Data Storage in Hadoop (1 hr) Data Storage in HDFS Introduction about Blocks Data replication • Accessing HDFS • Fault tolerance • Installation and set-up of Hadoop Start-up & Shut down process
3rd Hour	Map Reduce: (1 hr) • Map Reduce Story • Map Reduce Architecture • How Map Reduce works • Developing Map Reduce • Map Reduce Programming Model
4th Hour	Input and Output Formats (1 hr) • Creating Input and Output Formats in Map Reduce Jobs Text Input Format Key Value Input Format Sequence File Input Format Data localization in Map Reduce Moving the Data into Hadoop
5th Hour	Reading and Writing the files in HDFS using Java program (1 hr) The Hadoop Java API for MapReduce Mapper Class Reducer Class Driver Class Writing Basic MapReduce Program In java Understanding the MapReduce Internal Components
6th Hour	Exploring more – Hadoop (30 mins)
7th & 8th Hour	PIG (2 hrs) • Introduction to Apache Pig • Map Reduce vs. Apache Pig • SQL vs. Apache Pig • Different data types in Pig • Modes of Execution in Pig • Grunt shell • Loading data • Exploring Pig

Day 2 – Next Day

1st to 3rd Hour	HIVE: (3 hrs) • Hive introduction • Hive architecture • Hive vs. RDBMS • HiveQL and the shell • Managing tables (external vs managed) • Data types and schemas • Partitions and buckets
4th and 5th Hour	HBASE: (2 hrs) • Architecture and schema design • HBase vs. RDBMS • HMaster and Region Servers • Column Families and Regions • Write pipeline • Read pipeline • HBase commands
6th to 8th Hour	Deep dive into Apache Flume & Sqoop (3 hrs) Flume SQOOP