What's the pros and cons to install HBase + Hadoop together vs. install HBase and Hadoop separately? -
i mean , 2 options : 1. install hbase on hadoop cluster offline computing, means 1 hadoop cluster. 2. install hadoop cluster offline computing , install hadoop cluster hbase use hdfs.
so 2 options : 1 integrated cluster , 2 clusters.
what's pros & cons these 2 options ?
option 1: integrated cluster.
pros: mapreduce reads or writes hbase more efficient data locality.
cons: hbase region server reduce performance of machine (datanode , tasktracker) need hold cpu , memory. hbase latency may seconds if there many mapreduce jobs. if want make hbase response in time, need more work (for example, using memcache improve read performance).
option 2: 2 clusters.
prons: hbase region server not impact performance of hdfs datenode , tasktracker.
cons: mapreduce needs read , write data remotely if wants access hbase. option needs more machines.
Comments
Post a Comment