Hadoop workshop for Administrators (Live Virtual)

Icon-person

LIVE VIRTUAL

Duration: 3 Days

Videos: 0

COST: $795

AUDIENCE: Administrator, Architect, Developer

PREREQS:

Basic Linux system administration and scripting

DATES:

Introduction
Hadoop history, concepts, Ecosystem, Distributions, High level architecture, Hadoop myths, Hadoop challenges, (hardware / software)

Planning and installation
Selecting software, Hadoop distributions, Sizing the cluster, planning for growth, Selecting hardware and network, Rack topology, Installation, Multi-tenancy, Directory structure, logs, Benchmarking

HDFS operations
Concepts (horizontal scaling, replication, data locality, rack awareness), Nodes and daemons (NameNode, Secondary NameNode, HA Standby NameNode, DataNode), Health monitoring, Command-line and browser-based administration
Adding storage, replacing defective drives

MapReduce operations
Parallel computing before mapreduce: compare HPC vs Hadoop administration, MapReduce cluster loads, Nodes and Daemons (JobTracker, TaskTracker), MapReduce UI walk through, Mapreduce configuration, Job config, Job schedulers, Administrator view of MapReduce best practices, Optimizing MapReduce, Fool proofing, YARN architecture and use

Advanced topics
Hardware monitoring, System software monitoring, Hadoop cluster monitoring, Adding and removing servers, upgrading Hadoop, Backup, recovery and business continuity planning, Cluster configuration tweaks, Hardware maintenance schedule, Oozie scheduling for administrators, Securing your cluster with Kerberos.

 

 

Icon-linkedin Icon-twitter Icon-fb Icon-youtube