When to use Hadoop, HBase, Hive and Pig?

前端 未结 16 1157
时光说笑
时光说笑 2020-12-04 04:21

What are the benefits of using either Hadoop or HBase or Hive ?

From my understanding, HBase avoi

16条回答
  •  温柔的废话
    2020-12-04 05:11

    Let me try to answer in few words.

    Hadoop is an eco-system which comprises of all other tools. So, you can't compare Hadoop but you can compare MapReduce.

    Here are my few cents:

    1. Hive: If your need is very SQLish meaning your problem statement can be catered by SQL, then the easiest thing to do would be to use Hive. The other case, when you would use hive is when you want a server to have certain structure of data.
    2. Pig: If you are comfortable with Pig Latin and you need is more of the data pipelines. Also, your data lacks structure. In those cases, you could use Pig. Honestly there is not much difference between Hive & Pig with respect to the use cases.
    3. MapReduce: If your problem can not be solved by using SQL straight, you first should try to create UDF for Hive & Pig and then if the UDF is not solving the problem then getting it done via MapReduce makes sense.

提交回复
热议问题