Installation

HBase can be installed in 3 modes:

  1. Standalone Mode
  2. Pseudo-Distributed Mode
  3. Fully-Distributed Mode

The objective of this post is to provide a tried-and-true procedure for installing HBase in Fully-Distributed Mode. But before we dive deeper into installation nuts and bolts, here are some hbase preliminaries, that I feel I should include as a startup.

1. When talking about installing HBase in fully distributed mode we’ll be addressing the following:

  • HDFS: A running instance of HDFS is required for deploying HBase in distributed mode.
  • HBase Master: HBase cluster has a master-slave architecture where the HBase Master is responsible for monitoring all the slaves i.e. Region Servers.
  • Region Servers: These are the slave nodes responsible for storing and managing regions.
  • Zookeeper Cluster: A distributed Apache HBase installation depends on a running ZooKeeper cluster. All participating nodes and clients need to be able to access the running ZooKeeper ensemble.
2. The coin of setting up a Fully Distributed HBase Cluster has got two sides to it:
  • When Zookeeper cluster is managed by HBase internally
  • When Zookeeper cluster is managed externally
3. HBase is overparticular about the DNS entries of its cluster nodes. Therefore, to avert imminent discrepancies we would be assigning host names to the cluster nodes and using them for installation.

Get industry recognized certification – Contact us

Menu