what should be considered before choosing hbase?

后端 未结 3 501
不知归路
不知归路 2021-01-03 14:44

I am very new in big data space.

We got suggestion from team we should use hbase instead of RDBMS for high performance . We do not have any idea what should/must be

3条回答
  •  没有蜡笔的小新
    2021-01-03 15:16

    HBase like all NoSQL DB come with great new features but sadly nothing is free (not even mentionning the money cost).

    In HBase, you really should check if all the query that you might want to do can be fullfilled with the HBase data model. An important thing to consider is the schema design (the modelisation of the rowkey most and foremost). I advice you to read this really good paper :

    http://0b4af6cdc2f0c5998459-c0245c5c937c5dedcca3f1764ecc9b2f.r43.cf2.rackcdn.com/9353-login1210_khurana.pdf

    I think that a really good answer to your question can be found on the HBase official site.

    "HBase isn’t suitable for every problem.

    First, make sure you have enough data. If you have hundreds of millions or billions of rows, then HBase is a good candidate. If you only have a few thousand/million rows, then using a traditional RDBMS might be a better choice due to the fact that all of your data might wind up on a single node (or two) and the rest of the cluster may be sitting idle.

    Second, make sure you can live without all the extra features that an RDBMS provides (e.g., typed columns, secondary indexes, transactions, advanced query languages, etc.) An application built against an RDBMS cannot be "ported" to HBase by simply changing a JDBC driver, for example. Consider moving from an RDBMS to HBase as a complete redesign as opposed to a port.

    Third, make sure you have enough hardware. Even HDFS doesn’t do well with anything less than 5 DataNodes (due to things such as HDFS block replication which has a default of 3), plus a NameNode.

    HBase can run quite well stand-alone on a laptop - but this should be considered a development configuration only. "

    https://hbase.apache.org/book.html

提交回复
热议问题