Migrate hive table to Google BigQuery

前端 未结 1 1434
难免孤独
难免孤独 2020-12-18 10:49

I am trying to design a sort of data pipeline to migrate my Hive tables into BigQuery. Hive is running on an Hadoop on premise cluster. This is my current design, actually,

相关标签:
1条回答
  • 2020-12-18 10:53

    Yes, your migration logic makes sense.

    I personally prefer to do the CAST for specific types directly into the initial "Hive query" that generates your Avro (Hive) data. For instance, "decimal" type in Hive maps to the Avro 'type': "type":"bytes","logicalType":"decimal","precision":10,"scale":2

    And BQ will just take the primary type (here "bytes") instead of the logicalType. So that is why I find it easier to cast directly in Hive (here to "double"). Same problem happens to the date-hive type.

    0 讨论(0)
提交回复
热议问题