Course Name: – Learn Data Science – Big data: Hadoop
Date: – Sat 07th Sep and Sun 08th Sep 2019.
Cost: –
Booking between 27 Jul to 17 Aug 2019 – 1000 INR discount, you pay 6000 INR
Booking between 18 Aug to 31 Aug 2019 – 500 INR discount, you pay 6500 INR
Booking between 01 Sep to 06 Sep 2019 – 0 INR discount, you pay 7000 INR
How to Join?
Click Here
Key Features
-
-
- No PPT’s completely Hands-on Apache Hadoop training.
- Tea/Coffee as refreshment will be provided.
- All at only 7000 INR
- Installation required in your laptop for training
- Ubuntu Virtual Machine Images for VirtualBox and VMware download link, get from here
-
Day 1 – First Day
1st Hour | Introduction to Hadoop (1 hr) Hadoop Distributed File System Hadoop Architecture Map Reduce & HDFS Hadoop Eco Systems Introduction to Pig Introduction to Hive Introduction to HBase Other eco system Map HDFS: Hadoop Distributed File System: – Significance of HDFS in Hadoop • Features of HDFS Nodes Name Node, Secondary Name Node and its functionality |
2nd Hour | Data Storage in Hadoop (1 hr) Data Storage in HDFS Introduction about Blocks Data replication • Accessing HDFS • Fault tolerance • Installation and set-up of Hadoop Start-up & Shut down process |
3rd Hour | Map Reduce: (1 hr) • Map Reduce Story • Map Reduce Architecture • How Map Reduce works • Developing Map Reduce • Map Reduce Programming Model |
4th Hour | Input and Output Formats (1 hr) • Creating Input and Output Formats in Map Reduce Jobs Text Input Format Key Value Input Format Sequence File Input Format Data localization in Map Reduce Moving the Data into Hadoop |
5th Hour | Reading and Writing the files in HDFS using Java program (1 hr) The Hadoop Java API for MapReduce Mapper Class Reducer Class Driver Class Writing Basic MapReduce Program In java Understanding the MapReduce Internal Components |
6th Hour | Exploring more – Hadoop (30 mins) |
7th & 8th Hour | PIG (2 hrs) • Introduction to Apache Pig • Map Reduce vs. Apache Pig • SQL vs. Apache Pig • Different data types in Pig • Modes of Execution in Pig • Grunt shell • Loading data • Exploring Pig |
Day 2 – Next Day
1st to 3rd Hour | HIVE: (3 hrs) • Hive introduction • Hive architecture • Hive vs. RDBMS • HiveQL and the shell • Managing tables (external vs managed) • Data types and schemas • Partitions and buckets |
4th and 5th Hour | HBASE: (2 hrs) • Architecture and schema design • HBase vs. RDBMS • HMaster and Region Servers • Column Families and Regions • Write pipeline • Read pipeline • HBase commands |
6th to 8th Hour | Deep dive into Apache Flume & Sqoop (3 hrs) Flume SQOOP |