talend

Read a list of CSV files in Talend with ; in field

早过忘川 提交于 2019-12-25 09:01:12
问题 I have a list of CSV files which i receive for ETL into database every month. Its in a folder. My data has ; in many columns as well. For example, in the location column values like New York; USA are present, which i want to appear in a single column instead of splitting into many columns. How do i specify delimiter then? 回答1: I think you cannot have the field separator included in the field content or you have to incluse these values between "". For example: blabla;"New York; USA";blabla

How to push a big file data in talend?

a 夏天 提交于 2019-12-25 05:33:38
问题 I have created a table where I have a text input file which is 7.5 GB in size and there are 65 million records and now I want to push that data into an Amazon RedShift table. But after processing 5.6 million records it's no longer moving. What can be the issue? Is there any limitation with tFileOutputDelimited as the job has been running for 3 hours. Below is the job which I have created to push data in to Redshift table. tFileInputDelimited(.text)---tMap--->tFilOutputDelimited(csv) | |

How to implement tLoop in Talend?

我的梦境 提交于 2019-12-25 03:35:17
问题 I'm new to Talend and need an example job to implement tLoop . I want to run a job 10 times if it fails. I've looked at the documents, but I can't seem to figure this out. 回答1: This answer has 2 sections Creating a loop with tJava Retying a failed connection to a data source 5 times (with adding tJavaFlex ) ___________________________________ SECTION 1 : Creating a loop with tJava ----------------------------------------------------------- I just write a tJava component and then iterate to

Closed loop does not work in a Talend Job

落爺英雄遲暮 提交于 2019-12-24 21:36:24
问题 I have a Talend Job, where somehow a closed loop is formed by the components. Image is as follows: The schemas of both the tMap outputs is same. Now after connecting any tMap to tUnite, when I try to connect the second tMap, it does not connect to it. I heard that Talend does not allow, a closed loop in a Job. Is that true? If yes, the Why? Someone had a similar question here, but found no answers. 回答1: Talend actually creates a Java program; essentially that is the reason for the limitation

How to split a column which has data in XML form to different rows of new Database as KEY VALUE in TALEND

纵饮孤独 提交于 2019-12-24 20:17:35
问题 In old DB i have a data in one column as <ADDRESS> <CITY>ABC</CITY> <STATE>PQR</SERVICE> </ADDRESS> In my new DB i want this data to be stored in KEY VALUE fashion like: USER_ID KEY VALUE 1 CITY ABC 1 STATE PQR Someone please help me how to migrate this kind of data using TALEND tool. 回答1: Design job like below. tOracleInput---tExtractXMLFiled---output. tOracleInput component you can select XML column and make datatype as String. tExtractXmlFiled component pass this XML column as " XML Filed"

Read EDI X12 File and convert Using Talend Open Studio

空扰寡人 提交于 2019-12-24 11:29:38
问题 I am new to EDI. I got some information about EDI from Here. I heard that Talend supports reading of EDI X12 files using some technique called Smooks . I downloaded Talend Open Studio for Data Integration v5.3.1 . But I don't know how to Use it for reading EDI file I got a EDI text from one site ISA*00* *00* *12*3109992367T *ZZ*IAISNOKIST *070103*0839*^*00307*000024398*0*P*>~ GS*OG*3109992367*IAISNOKIST*20070103*0839*24398*T*004010UCS~ ST*875*000024479~ G50*N*20071230*59590001~ G62*10

utf8mb4 setting for talend - not working

徘徊边缘 提交于 2019-12-24 10:01:04
问题 I am migrating the data from sql server to mysql. I am using the tool Talend(ETL) for the same. The problem comes when I have emojis in the source (sql server) , it does not get inserted to the table in mysql. So, I know I must use utf8mb4 on mysql side. The client settings character encoding has to be set, for the smileys to get inserted. The database, tables and the server are all on utf8mb4 But, the client i.e., talend is not utf8mb4. So where do I set this? I tried with 'set names utf8mb4

Get output of TMSSqlRow in Talend

戏子无情 提交于 2019-12-24 09:23:11
问题 I would like to get the number of row affected / deleted / updated with a TMSSqlRow. Here is how the job is: the file use contains a lot of sql statement like DELETE ... INSERT ... UPDATE... each row are separate by ";" But now, I would like to get result of each statement (x rows updated, like results are display in management studio). When I go to "advanced settings" tab of tmssqlrow, I select " Propagate QUERY's recordset" and select a column I created before (Object Type). On execution, I

How to use the output of a query as input to another query

我怕爱的太早我们不能终老 提交于 2019-12-24 08:48:18
问题 I am new to Talend, but I want to have a job similar to this: in a tMysqlInput, make a select statement that returns a single row with a single column and then use that value in a tMongoDBInput query to get multiple documents. How can I do that in a single job, as I am unable to link tMysqlInput to tMongoDBInput with a row(Main)? 回答1: You can do this : tMysqlInput -- main -- tFlowToIterate -- Iterate -- tMongoDBInput -- tMap -- etc The input flow of tFlowToIterate gets converted to global

Run a single job in parallel

和自甴很熟 提交于 2019-12-24 08:35:17
问题 I need to know that how can we run a single job in parallel with different parameters in talend. 回答1: The answer is straightforward, but rather depends on what you want, and whether you are using free Talend or commercial. As far as parameters go, make sure that your jobs are using context variables - this is the preferred way of passing parameters in. As for running in parallel, there are a few options. Talend's studio is a java code generator, so you can export your job (it's just java code