Introduction to Big Data and Hadoop
What is Big Data?What are the challenges for processing big data?
What technologies support big data?
Distributed systems
What is Hadoop?
Why Hadoop?
History of Hadoop
Use Cases of Hadoop
Hadoop eco System
HDFS
Map Reduce
Statistics
Understanding the Cluster
Typical workflowWriting files to HDFS
Reading files from HDFS
Rack Awareness
5 daemons
Best Practices for Cluster Setup
Best Practices
How to choose the right hadoop distribution
How to choose right hardware
Cluster Setup
Install Pseudo clusterInstall Multi node cluster
Configuration
Setup cluster on Cloud - EC2
Tools
Security
Benchmarking the cluster
Routine Admin procedures
Metadata & Data Backups
Filesystem check (fsck)
File system Balancer
Commissioning and decommissioning nodes
Upgrading
Using DFSAdmin
Monitoring the Cluster
Using the Web user interfaces
Hadoop Log files
Setting the log levels
Monitoring with Nagios
Install ,Configure and use
PIG
HIVE
HBASE
Flume and Sqoop
Zookeeper
Hadoop Developer
Introduction to Big Data and Hadoop
What is Big Data?
What are the challenges for processing big data?
What technologies support big data?
Distribution systems.
What is Hadoop?
Why Hadoop?
History of Hadoop
Use Cases of Hadoop
Hadoop eco System
HDFS
Map Reduce
Statistics
Understanding the Cluster
Typical workflow
Writing files to HDFS
Reading files from HDFS
Rack Awareness
5 daemons
Developing the Map Reduce Application
Configuring development environment - Eclipse
Writing Unit Test
Running locally
Running on Cluster
MapReduce workflows
How MapReduce Works
Anatomy of a MapReduce job run
Failures
Job Scheduling
Shuffle and Sort
Task Execution
MapReduce Types and Formats
MapReduce Types
Input Formats - Input splits & records, text input, binary input, multiple inputs & database input
Output Formats - text Output, binary output, multiple outputs, lazy output and database output
MapReduce Features
CountersSorting
Joins - Map Side and Reduce Side
Side Data Distribution
MapReduce Combiner
MapReduce Partitioner
MapReduce Distributed Cache
Hive and PIG
FundamentalsWhen to Use PIG and HIVE
Concepts
HBASE
CAP TheoremHbase Architecture and concepts
Programming.