Session on Big Data

(0 review)
Free
Session on Big Data

Introduction to Big Data and Hadoop

  • Evaluation of Big Data
  • Google’s White Paper on Distributed File System and Map Reduce Algorithm
  • IBM’s Definition of Big Data (4Vs)
  • Limitations of Existing Solution
  • Hadoop Components-DFS, MR Algorithm

Hadoop Architecture

  • Concept of Nodes, Block Size, Replication factor and Racks
  • HDFS Architecture – Namenode, Datanode
  • Anatomy Of File Read
  • Anatomy Of File Write
  • Mapreduce Flow- Job Tracker, Task Tracker
  • Differentiate Between Block And Split
  • Hadoop Partioners And Combiners

Hadoop Environment Setup

  • Hadoop Cluster
  • Hadoop Installation Modes
  • Hadoop Configueration Files
  • Web UI Status URLs
  • Mapreduce Demo Excercices

Introduction To Hadoop Ecosystem

  • Sqoop
  • Flume
  • Hive and PIG
  • Mahout
  • Apache Ooize

Analytics Using PIG And PIG Latin

  • Need For PIG (PIG Vs Hadoop)
  • PIG Usecases
  • Scenarios For Not Using PIG
  • PIG Data Models – Data Types And Operations
  • PIG Latin Program – LOAD, SPLIT, FILTER, GROUP, COUNT, COGROUP, STORE And UDFs Etc
  • Demo Excersices on PIG Commands

 Analytics Using Hive

  • Hive Feautures
  • Hive Usecases
  • Compare Hive and RDBMS, Hive And PIG
  • Hive Architecture – Metastore, Execution Engine, Compiler Etc.
  • Hive Data Model – Partitions And Bucketsconcept Of External And Internal Tables
  • Demo Excercies And Hive Commands

Hadoop 2.0 And Apache Oozie

  • Challenges of Hadoop 1.0
  • Features of Hadoop 2.0
  • Architecture of Hadoop 2.0
  • Solution Provide By Hadoop 2.0 – Yarn MR Work Flow
  • Apache Oozie as a Scheduling Service
  • Features and Usecase Of Apache Oozie

Course Features

  • Lectures 0
  • Quizzes 0
  • Duration 50 hours
  • Skill level All levels
  • Language English
  • Students 0
  • Certificate No
  • Assessments Self
Curriculum is empty.

Reviews

Average Rating

0
0 rating

Detailed Rating

5 stars
0
4 stars
0
3 stars
0
2 stars
0
1 star
0
Free

Leave A Reply

Your email address will not be published. Required fields are marked *