What's the pros and cons to install HBase + Hadoop together vs. install HBase and Hadoop separately? -


i mean , 2 options : 1. install hbase on hadoop cluster offline computing, means 1 hadoop cluster. 2. install hadoop cluster offline computing , install hadoop cluster hbase use hdfs.

so 2 options : 1 integrated cluster , 2 clusters.

what's pros & cons these 2 options ?

option 1: integrated cluster.

pros: mapreduce reads or writes hbase more efficient data locality.

cons: hbase region server reduce performance of machine (datanode , tasktracker) need hold cpu , memory. hbase latency may seconds if there many mapreduce jobs. if want make hbase response in time, need more work (for example, using memcache improve read performance).

option 2: 2 clusters.

prons: hbase region server not impact performance of hdfs datenode , tasktracker.

cons: mapreduce needs read , write data remotely if wants access hbase. option needs more machines.


Comments

Popular posts from this blog

javascript - Count length of each class -

What design pattern is this code in Javascript? -

hadoop - Restrict secondarynamenode to be installed and run on any other node in the cluster -