basic computer skills, basic knowledge in programming
Module 01 - Big Data. Motivation, Hadoop components
Module 02 - Using the Hadoop HDFS Distributed Storage
Module 03 - Distributed processing Map Reduce
Module 04 - Word count java program in Map Reduce
Module 05 - A Better Word count program
Module 06 - Map Reduce and other languages (a simple example in python
Hive -
Module 01. Hive – Basic Concepts
Module 02. Hive - Joins
Module 03 Hive - Partitions
Module 04 Hive - Bucketing and external tables
Module 05 Hive - Data pipeline version 1 (This is basic data warehouse)
Module 06 Hive - Data pipeline upgrade (build on top of previous case)
Pig -
Module 01. Pig – Basic Concepts and comparison with Hive
Module 02. Pig – Programming language
Module 03 Pig – Programming language (Continuation)
Module 04 Pig – Reading date from Hive Tables
Module 04 Pig – Ad Hoc data analytics with Pig
Module 05 Hive - Re-implementing Data pipeline using pig