Day1

  • Big Data Overview
  • Different roles in Big Data
  • Skills needed
  • Hadoop Architecture
  • Tools used/needed in class
  • Environment setup
  • Major distributions, differences and market share
  • Platforms used for Hadoop
  • Cluster building: Manual, Using GUI

Day2

  • Linux commands overview
  • Cluster bring up and down
  • Different deployments
  • Ecosystem tools overview
  • Hive in practice

  • Total 8 Weekends (~70 hours)
  • Class material
  • 8 – Saturdays ( 9am – 5pm) In-class presentation, hands-on practice, Q & A
  • Lunch break – 30minutes
  • Bring your own laptop (16GB RAM preferred)
  • Starting Date:

Day3

  • PIGS can Fly
  • No SQL DBs
  • HBASE

Day4

  • Data Ingestion tools/Frameworks
  • Oozie

Day5

  • Ecosystem tools Integration
  • External interfaces to Hadoop
  • New trends In-memory tools
  • New tools in the market

Day6

  • Development tools
  • Developing MapReduce applications

Day7

  • Performance tuning intro.
  • Cluster tuning
  • Map reduce tuning
  • Hive Queries Tuning
  • Pig performance
  • JVM tuning

Day8

  • Intro to Security Kerberos, Knox and Ranger
  • General Use case Patterns
  • Miscellaneous
  • Q&A