- HDFS Architecture
- Interfaces and Data read and write process
- distcp
- Command Line Interface
- SequenceFile and MapFile, Checksumming, codecs and Writables
MapReduce
- Parallelizing Map and Reduce
- MapReduce Workflow
- MapReduce Framework
- Hadoop Data Types
- MapReduce Internals- The Map Phase
- MapReduce – The Reduce Phase
- Job Format
- Debugging & Profiling
- Counters, Sorting and Joins
- Streaming
Hadoop Cluster Management
Administration
Pig
Hbase
Zookeeper
- Installation
- Group membership and management
- Znodes
- API, triggers and ACL
- States, consistency and sessions
- Implementation
Apply for Big Data and Hadoop Developer Certification
https://www.vskills.in/certification/certified-big-data-and-apache-hadoop-developer