etl

“pivot” table Oracle - how to change row items into columns

ⅰ亾dé卋堺 提交于 2020-01-19 06:25:05
问题 I have the following simple example: CREATE TABLE Cars ( Cars, Item, Value ) AS SELECT 'bmw', 'wheels', '4' FROM DUAL UNION ALL SELECT 'bmw', 'color', 'red' FROM DUAL UNION ALL SELECT 'bmw', 'price', '5' FROM DUAL UNION ALL SELECT 'mercedes', 'wheels', '4' FROM DUAL UNION ALL SELECT 'mercedes', 'color', 'black' FROM DUAL UNION ALL SELECT 'lambo', 'wheels', '5' FROM DUAL UNION ALL SELECT 'lambo', 'color', 'yellow' FROM DUAL UNION ALL SELECT 'lambo', 'price', '7' FROM DUAL UNION ALL SELECT

“pivot” table Oracle - how to change row items into columns

橙三吉。 提交于 2020-01-19 06:24:45
问题 I have the following simple example: CREATE TABLE Cars ( Cars, Item, Value ) AS SELECT 'bmw', 'wheels', '4' FROM DUAL UNION ALL SELECT 'bmw', 'color', 'red' FROM DUAL UNION ALL SELECT 'bmw', 'price', '5' FROM DUAL UNION ALL SELECT 'mercedes', 'wheels', '4' FROM DUAL UNION ALL SELECT 'mercedes', 'color', 'black' FROM DUAL UNION ALL SELECT 'lambo', 'wheels', '5' FROM DUAL UNION ALL SELECT 'lambo', 'color', 'yellow' FROM DUAL UNION ALL SELECT 'lambo', 'price', '7' FROM DUAL UNION ALL SELECT

Staging in ETL: Best Practices?

此生再无相见时 提交于 2020-01-17 04:16:06
问题 Currently, the architecture I work with takes a few data sources out of which one is staged locally because it's hosted in the cloud. The others are hosted locally anyway, so the ETL I perform takes it directly from the source. I don't really see the point in creating a stage for the other sources. 1) Is there a distinct benefit to duplicating the locally hosted source into a local stage? 2) Is it a better idea to host the stage on a separate machine or the same one as the Warehouse? 3) If I

How to load a huge CSV file in orient db

送分小仙女□ 提交于 2020-01-16 18:32:48
问题 I want to load a huge CSV file in my orient Db database.there is some checklist for the database ,that should our db follows. 1- there would be a single csv file and this CSV file will have millions of records and more then 20 Columns. 2- From this csv i have to create multiple Classes and each class will have different Properties (is it possible with Orient db). 3- i have to create index too Please help for this.how should i create Etl config file for this Thanks in advance. please let me

Creating a CSV file per Loop | PLSQL Oracle SQL Developer

匆匆过客 提交于 2020-01-16 15:26:50
问题 Updated Rad-folks! TL;DR Need working code to loop through an existing list of all my tables that will create CSV files selecting the top 100 rows of each table. There are variables set up to capture the table names which will be used to dynamically, call the table for the select and name the file. Must be done through PLSQL and SQLDeveloper. Do u kno da wae? Here is the situation: Must gather all tables (base tables non temporary) and row count (row count>0) Create a loop to select(*) top

Pentaho Kettle split CSV into multiple records

这一生的挚爱 提交于 2020-01-16 06:41:09
问题 I'm new to Kettle, but getting on well with it so far. However I can't figure out how to do this. I have a csv which looks something like this a, col1, col2, col3 a, col1, col2, col3 a, col1, col2, col3 b, col1, col2, col3 b, col1, col2, col3 c, col1, col2, col3 c, col1, col2, col3 The first column starts with a key (a,b,c), and then the rest of the columns follow. What I want to do is read in the csv (got that covered) and then split the csv based on key, so I have 3 chunks/ groups of data

SSIS: Dynamic Lookup Query

点点圈 提交于 2020-01-15 11:45:27
问题 I would like to implement the following logic to my Lookup query as an expression: SELECT ID, CASE @[User::ConnectionStringID] WHEN 1 THEN "NE" WHEN 2 THEN "BALT" WHEN 3 THEN "NY" WHEN 4 THEN "PA" END AS Region FROM dbo.Table; My package is executed in a loop where the connection string is dynamically updated using a config file and it loops 4 times as shown in the above logic. I realized today that the Lookup transformation has no expressions available to it and I haven't been able to find

Why does Bonobo's CsvReader() method yield tuples and not dicts?

非 Y 不嫁゛ 提交于 2020-01-15 09:41:11
问题 I can't seem to get the CsvReader in the Bonobo ETL library to yield anything other than tuples. The documentation seems to indicate that it should be yielding dicts and not tuples but try as I might I can't seem to get it to pass anything other than tuples. I'd really like to have access to the column names attached to each value. It throws an error that suggests the column names are present when passed but in the transform method I have defined, only the values themselves are available.

Trying to loop through, and delete from, a list of tables using SSIS

China☆狼群 提交于 2020-01-15 08:57:27
问题 I am currently working on establishing an archive and purge process for our database. I inherited this task from an employee who left a few days ago - he'd been working on it (infrequently) for the past several months so I've had to spend quite awhile retracing his steps so to speak and trying to figure out how he had this all set up. We are using a scheduled task to perform our archiving process and I have that part mostly ready to deploy. However, we're using an SSIS package to handle the

SSIS job fails - Access denied to flat file

眉间皱痕 提交于 2020-01-15 03:07:09
问题 I have created an SSIS package that reads a csv file from a network shared drive and imports the data into our table. The package works fine in the Visual studio. I imported the package to Integration Services Catalog and when I execute the package, it gives access denied to flat file error. What is required to run the package from Sql server, I created a job to run this package nightly and assigned a proxy for the job. Not sure what permissions I am missing, any help please 回答1: When SSIS