aws-dms

Two way syncing between local mysql database and AWS RDS

余生长醉 提交于 2019-12-23 16:33:08
问题 my question is pretty similar to Two way syncing between on-premise SQL Server database and Azure SQL database but i want it between my on premises mysql and AWS RDS instance. So functional need is when i am within organization connected to local server (without internet) i will access localhost database (hosted on premises server) and from outside of organization i will access websize which is hosted on ec2 and ec2 will be connected with RDS database. so this both database needs to be in

Athena can't resolve CSV files from AWS DMS

Deadly 提交于 2019-12-08 05:23:10
问题 I've DMS configured to continuously replicate data from MySQL RDS to S3. This creates two type of CSV files: a full load and change data capture (CDC). According to my tests, I have the following files: testdb/addresses/LOAD001.csv.gz testdb/addresses/20180405_205807186_csv.gz After DMS is running properly, I trigger a AWS Glue Crawler to build the Data Catalog for the S3 Bucket that contains the MySQL Replication files, so the Athena users will be able to build queries in our S3 based Data

How to handle TIMESTAMP_MICROS parquet fields in Presto/Athena

帅比萌擦擦* 提交于 2019-12-08 03:56:17
问题 Presently, we have a DMS task that will take the contents of a MySQL DB and dump files to S3 in parquet format. The format for the timestamps in parquet ends up being TIMESTAMP_MICROS. This is a problem as Presto (the underlying implementation of Athena) does not support timestamps in microsecond precision and makes the assumption that all timestamps are in millisecond precision. This does not cause any errors directly but it makes the times display as some extreme future date as it is

How to handle TIMESTAMP_MICROS parquet fields in Presto/Athena

柔情痞子 提交于 2019-12-06 16:00:44
Presently, we have a DMS task that will take the contents of a MySQL DB and dump files to S3 in parquet format. The format for the timestamps in parquet ends up being TIMESTAMP_MICROS. This is a problem as Presto (the underlying implementation of Athena) does not support timestamps in microsecond precision and makes the assumption that all timestamps are in millisecond precision. This does not cause any errors directly but it makes the times display as some extreme future date as it is interpreting the number of microseconds as number of milliseconds. We are currently working around this by