etl

SSIS Sequential Processing

Deadly 提交于 2019-12-10 20:39:59
问题 I have 5 independent data flows in the same data flow task each having a source and destination. How can I make them run Sequentially .they seem to run in parallel . I may do it in different data flow tasks. but how can i do it in a single data flow task 回答1: Don't have independent data flows in the same task. I know the Import/Export wizard will do that but just because a team at Microsoft does something, doesn't make it a best practice. The Data Flow gets its power and performance through

Loading SSIS Package Programmatically in C#

只愿长相守 提交于 2019-12-10 19:24:36
问题 I'm loading a ssis package from my application which works fine. However, what I'm trying to work out and failing miserably is that the package executes in 1 second (which is great) but the loading of the package takes 9 seconds. Working locally in visual studio 2015 connecting remotely to a SQL Server 2014 instance. The package is being loaded in a business object behind an async web api call from a console application. Heres the code (standard stuff)... Application application = new

How can I loop though columns by name in an SSIS Script component?

ぐ巨炮叔叔 提交于 2019-12-10 18:22:17
问题 I'm loading a pipe delimited flat file into a staging table. During the load process an SSIS script component performs some operations on a row. It may set a flag in one field based on values in another field, add a prefix to certain columns, or apply formatting. For example, if a date is missing, the field is assigned to a default date. (if Row.EndDate_isNull then Row.EndDate = defaultDate) These scripts become cumbersome when the same transformation needs to be applied to a series of rows.

Execute stored procedure for each row in the table in SSIS

我是研究僧i 提交于 2019-12-10 18:19:25
问题 I would like to execute a stored procedure MyProc with certain input parameters for each row in the table MyTable . Column for each row will act like input values for MyProc . How do I accomplish this in SSIS? I have been told by upper management to use SSIS and I don't have any choice in this matter. So.. I have done the following: Create package Create Execute SQL Task Created query inside ExecuteSQLTask Create Foreach Loop Container Create Data Flow Task inside Foreach loop container.

DTS vs. SSIS vs. Informatica vs. PL/SQL Scripting

↘锁芯ラ 提交于 2019-12-10 18:10:09
问题 In the past, I have used Informatica for some ETL (Extraction Transformation Loading) but found it rather slow and usually replaced it with some PL/SQL scripts (was using Oracle at the time). (questions revised based on feedback in answers) I gather that DTS was Microsoft's ETL tool prior to SSIS. Would it be difficult to convert an existing application using DTS to SSIS? Given that SSIS is a Microsoft tool and tightly integrated with SQL Server (virtually a part of it) are there any

Outputting a single Excel file with multiple worksheets

寵の児 提交于 2019-12-10 18:08:45
问题 Is there a component in Talend Open Studio for Data Integration to be able to output a single Excel file but with 2 separate sheets in it? I want to separate some columns in the original file into another sheet and another set of columns to the second sheet. 回答1: You'll need to output your data into two separate tFileOutputExcel components with the second one set to append the data to the file as a different sheet. A quick example has some name and age data held against a unique id that needs

Release Lock on variables used in Execute Process Task | SSIS

与世无争的帅哥 提交于 2019-12-10 18:05:36
问题 I have a package with a Foreach Container and Execute Process Task inside ForEach Container . On some error in Execute Process Task it redirects to OnError Event handler of ForEach Container . I am capturing the Error from .exe using StandardErrorvariable property of the Task and using this in the script task which is present in OnError Event Handler. The Script Task fails saying Error: A deadlock was detected while trying to lock variable "User::ErrorExcelName, User::ErrorFolder, User:

Easiest way to import a simple csv file to a graph with OrientDB ETL

纵饮孤独 提交于 2019-12-10 17:47:40
问题 I would like to import a very simple directed graph file in csv to OrientDB. Concretely, the file is the roadNet-PA dataset from the SNAP collection https://snap.stanford.edu/data/roadNet-PA.html. The first lines of the file are as follows: # Directed graph (each unordered pair of nodes is saved once) # Pennsylvania road network # Nodes: 1088092 Edges: 3083796 # FromNodeId ToNodeId 0 1 0 6309 0 6353 1 0 6353 0 6353 6354 There is only one type of vertex (a road intersection) and edges have no

Display foreach loop iteration number in SSIS

隐身守侯 提交于 2019-12-10 17:36:54
问题 I need to keep a check on the iteration number for a foreach loop container task that I am running in Visual Studio 2017. How could I achieve this ? 回答1: (1) Count iterations using Expression Task Task available in SSIS 2012+ In Foreach Loop container, there is no properties that contains the iteration number. You can achieve this by creating a SSIS variable of type Int, with a initial value equal 0 . example @[User::Counter] Inside the Foreach loop container, add an Expression Task with the

Create n new rows from raw data such as (1000…1000+n)

落爺英雄遲暮 提交于 2019-12-10 17:34:47
问题 I need to read data from an Excel workbook, where data is stored in this manner: Company Accounts Company1 (#3000...#3999) Company2 (#4000..4019)+(#4021..4024) where the expected output, using a OLE DB Destination in SSIS would be: Company Accounts Company1 3000 Company1 3001 Company1 3002 . . . . . . Company1 3999 Company2 4000 Company2 4001 . . . . . . Company2 4019 Company2 4021 . . . . Company2 4024 This has me perplexed, I don't know how to even begin process this problem. Does someone