Monday, 19 August 2013

Hadoop training in Melbourne | Australia

Introduction to Big Data and Hadoop

What is Big Data?
What are the challenges for processing big data?
What technologies support big data?
Distributed systems
What is Hadoop?
Why Hadoop?
History of Hadoop
Use Cases of Hadoop
Hadoop eco System
HDFS
Map Reduce
Statistics
 

Understanding the Cluster

Typical workflow
Writing files to HDFS
Reading files from HDFS
Rack Awareness
5 daemons

Best Practices for Cluster Setup
Best Practices
How to choose the right hadoop distribution
How to choose right hardware
 

Cluster Setup

Install Pseudo cluster
Install Multi node cluster
Configuration
Setup cluster on Cloud - EC2
Tools
Security
Benchmarking the cluster

Routine Admin procedures
Metadata & Data Backups
Filesystem check (fsck)
File system Balancer
Commissioning and decommissioning nodes
Upgrading
Using DFSAdmin

Monitoring the Cluster
Using the Web user interfaces
Hadoop Log files
Setting the log levels
Monitoring with Nagios

Install ,Configure and use
PIG
HIVE
HBASE
Flume and Sqoop
Zookeeper


 

Hadoop Developer 



Introduction to Big Data and Hadoop
What is Big Data?
What are the challenges for processing big data?
What technologies support big data?
Distribution systems.
What is Hadoop?
Why Hadoop?
History of Hadoop
Use Cases of Hadoop
Hadoop eco System
HDFS
Map Reduce
Statistics

Understanding the Cluster
Typical workflow
Writing files to HDFS
Reading files from HDFS
Rack Awareness
5 daemons

Developing the Map Reduce Application
Configuring development environment - Eclipse
Writing Unit Test
Running locally
Running on Cluster
MapReduce workflows

How MapReduce Works
Anatomy of a MapReduce job run
Failures
Job Scheduling
Shuffle and Sort
Task Execution

MapReduce Types and Formats
MapReduce Types
Input Formats - Input splits & records, text input, binary input, multiple inputs & database input
Output Formats - text Output, binary output, multiple outputs, lazy output and database output
 

MapReduce Features

Counters
Sorting
Joins - Map Side and Reduce Side
Side Data Distribution
MapReduce Combiner
MapReduce Partitioner
MapReduce Distributed Cache
 

Hive and PIG

Fundamentals
When to Use PIG and HIVE
Concepts
 

HBASE

CAP Theorem
Hbase Architecture and concepts
Programming.

For any further details please contact +91-9052666559 or
visit www.hadooponlinetraining.net
please mail us all queries to
info@magnifictraining.com

3 comments: