etl

Pentaho Spoon Tool Transformation Order

梦想与她 提交于 2019-12-11 02:45:21
问题 I am trying to design an ETL structure and i stucked in the below step. As you can see i have 3 steps and each step holding a FK value from previous step. For example TABLE3 has a column with FK Constraints which shows the PK value in TABLE2 and TABLE2 has same relationship with TABLE1 . The problem is when i start this job all these 3 steps are running simultaneously and error is occuring about FK constraints since TABLE2 trying to set a FK before TABLE1 creation completes. I am not sure if

SSIS - Changing references to a Variable/Parameter through all the project

為{幸葍}努か 提交于 2019-12-11 02:39:41
问题 I've created an SSIS project, thinking it will be deployed in Project Deployment mode. A Lot of my packages have 2 Parameters (called P_OdsTbl and P_SrcEtl ), and whenever I used Execute package task I bound the Child Parameters to the Parent Parameters. I've since discovered that the project would be deployed in Package Deployment mode, which means I need to change the Parameters to Variables ( V_OdsTbl and V_SrcEtl ) and set their values with Variable Configurations. I have many references

How to specify join types in AWS Glue?

别等时光非礼了梦想. 提交于 2019-12-11 02:29:01
问题 I am using AWS Glue to join two tables. By default, it performs INNER JOIN. I want to do a LEFT OUTER JOIN. I referred the AWS Glue documentation but there is no way to pass the join type to the Join.apply() method. Is there a way to achieve this in AWS Glue? ## @type: Join ## @args: [keys1 = id, keys2 = "user_id"] ## @return: cUser ## @inputs: [frame1 = cUser0, frame2 = cUserLogins] #cUser = Join.apply(frame1 = cUser0, frame2 = +, keys1 = "id", keys2 = "user_id", transformation_ctx = "

SSIS : Conversion text stream DT_TEXT to DT_WSTR

谁说胖子不能爱 提交于 2019-12-11 02:25:28
问题 I am creating a package in SSIS, and want to convert a file with one large column into multiple columns. I have a table containing several rows with a single column of raw data. The data was copied from a notepad file, and each row contains pipe delimiters to separate each column, but because it is a notepad file, each row is copied as one large column. I want to convert each column per row to multiple columns based on their start/end positions. I tried using SSIS Derived Column

Filtering data values in one column based on another column and then inserting values into different columns in same SQL Table

倾然丶 夕夏残阳落幕 提交于 2019-12-11 02:11:14
问题 This is a bit of a conundrum I am trying to solve using SSIS and a conditional-split transformation. I have a .csv file that contains attribute data in one row for each unique user and the values for each attribute in another column. i.e.: Attribute, Attribute Type ID, 0000000001 Birthdate, 09/02/1976 Role, Manager Or something of the sort. I need to split the attributes into columns that include the Attribute Type Data. So the desired outcome would be: ID, Birthdate, Role, 0000000001, 09/02

How do i set password to zip file in SSIS?

这一生的挚爱 提交于 2019-12-11 01:40:39
问题 I have compressed file by 7-zip using Execute Process Task in SSIS. Now i have to set password to zip file How can i achieve that ? 回答1: I dont think this is a ssis issue. You have to take a look at 7zip command line user guide and check if this option is provided. Also there are other useful links: https://www.dotnetperls.com/7-zip-examples Update 1 I think this is the link you are looking for https://sevenzip.osdn.jp/chm/cmdline/switches/password.htm 来源: https://stackoverflow.com/questions

how to skip a bad row in ssis flat file source

佐手、 提交于 2019-12-11 01:27:11
问题 I am reading in a 17-column CSV file into a database. once in a while the file has a "less then 17-column" row. I am trying to ignore the row, but even when all columns are set to ignore, I can't ignore that row and the package fails. How to ignore those rows? 回答1: Solution Overview you can do this by adding one Flat File Connection Manager add only one column with Data type DT_WSTR and a length of 4000 (assuming it's name is Column0 ) - So all column are considered as one big column In the

how to capture timestamp with milliseconds in ssis package 2016

余生颓废 提交于 2019-12-11 00:40:05
问题 How to capture current timestamp in ssis package 2016 I declared a variable and using expression but milliseconds are missing @currenttimestamp = (DT_WSTR,50)(DT_DBTIMESTAMP)@[System::StartTime] I want the milliseconds too Thanks 回答1: The issue was solved when casting to (DT_DBTIMESTAMP2,3) : (DT_WSTR,50)(DT_DBTIMESTAMP2,3)GETDATE() but the time get changes as it using the getdate() asit returns the current time, I need the package start time I used the following (DT_WSTR,50)(DT_DBTIMESTAMP2

Maximum Number of User Per Mapping In ODI12c

余生长醉 提交于 2019-12-10 22:22:33
问题 I am new to ODI. While working i ODI Project, I am facing one issue. I have 10 mappings In ODI12c,and all are using same target table,but due to some performance issue, I want that at a time only max 2 users can Execute mappings(Max 2 mappings), since they are using same target table. If more then 2 user uses that same target then it should not execute. How should I implement this in ODI12c? 回答1: You can do something, but not exactly what you said. You can setup and option called "Concurrent

Strategy to load a set of files in Talend

风流意气都作罢 提交于 2019-12-10 21:25:18
问题 I want to know which is best strategy to aboard the following problem in Talend: I need to load data from a set of delimited files that are stored in a directory with names like (SAMPLE1.DAT, SAMPLE2.DAT, ... , SAMPLEX.DAT) The target will be a table in a MySQL database I have to load all data at once because after this task I need to work with all records in the same table I'm a bit confused because I don't know if it possible in Talend. I was seeing the tFileInputDelimited component but I