talend

How to set context variable from database

只愿长相守 提交于 2019-12-11 16:27:38
问题 I'm trying to set a context variable from a row in a oracle database. I have the table in the database already set up like so: key | variable name | BigDecimal I also have created a BigDecimal context variable. How do I link the context variable to the database so it can be used in a SQL query? 回答1: If you have a table with schema (variable_name, variable_value) and each row id dedicated to a single job, then I recommand you to add a third column so it will be (variable_name, variable_value,

Comparison between current row, previous row and next row in Talend

巧了我就是萌 提交于 2019-12-11 15:52:40
问题 I want to know if my value is included in an interval consisting of the previous row and the next row. How to do it in Talend? I tried a tMemorizeRow function where I save 3 lines each time, but encountered a java.lang.NullPointerException exception as when I am on the first line it does not know the next row I saved. What am I missing? 回答1: You can try this : add a column "sequence" with value as Numeric.sequence("s1",1,1) to your data : this way you'll know this exact order between rows in

Talend Open Studio : scripting languages versus Microsoft SSIS

不打扰是莪最后的温柔 提交于 2019-12-11 14:48:36
问题 I have been trying to find out if Talend Open Studio has a scripting language. I hope that maybe it would be Perl or Python. I have been using Microsoft SSIS ETL tool, and they have a Script-Component to handle more complex ETL tasks. The SSIS Script-Component uses the languages C# and VB.NET as its scripting language. Does Talend Open Studio have an equivalent to MS-SSIS Scripting Component. I could not find much on the web on this. The amount of material available for Talend Open Studio is

Talend- Memory issues. Working with big files

倾然丶 夕夏残阳落幕 提交于 2019-12-11 14:21:53
问题 Before admins start to eating me alive, I would like to say to my defense that I cannot comment in the original publications, because I do not have the power, therefore, I have to ask about this again. I have issues running a job in talend ( Open Studio for BIG DATA !). I have an archive of 3 gb. I do not consider that this is too much since I have a computer that has 32 GB in RAM. While trying to run my job, first I got an error related to heap memory issue , then it changed for a garbage

converting Big Decimal(15,12) to string

人走茶凉 提交于 2019-12-11 14:06:30
问题 Hi i'am having a input Big Decimal(15,12) value in source in output i am expecting that as string. For example, my source file with value 0.000000000000 by using tmap to convert it into string. "String.valueOf(column name)" for this im getting output is 0E-12. expected output is 0.0000000000 can anyone provide the solution? 回答1: If you're not applying any transformations to your BigDecimal column, you can just read it as a String from your file. If you must read it as a BigDecimal and then

Java Timestamp to BigInt for Impala

强颜欢笑 提交于 2019-12-11 11:49:49
问题 I am reading a text file which has a field in Timestamp in this format "yyyy-MM-dd HH:mm:ss" I want to be able to convert it to a field in Impala as BigInt and should like yyyMMddHHmmss in Java. I am using Talend for the ETL but I get this error "schema's dbType not correct for this component" and so I want to have the right transformation in my tImpalaOutput component 回答1: One obvious option is to read the date in as a string, format it to the output you want and then convert it to a long

Naming a Talend Custom Component - Not working except for tTutorialRow

眉间皱痕 提交于 2019-12-11 10:19:21
问题 I have been following the Talend tutorial on "How to Create a Custom Component". If I follow the example verbatim, with my files named as follows and placed inside of the folder tTutorialRow , the tutorial works: tTutorialRow_java.xml tTutorialRow_messages.properties tTutorialRow_begin.javajet tTutorialRow_main.javajet tTutorialRow_end.javajet This is using the following code: tTutorialRow_java.xml: <COMPONENT> <HEADER PLATEFORM="ALL" SERIAL="" VERSION="2.0" STATUS="ALPHA" COMPATIBILITY="ALL"

Building a cache with Talend SOAP webservices

放肆的年华 提交于 2019-12-11 08:45:42
问题 I am trying to build a cache for a SOAP webservice in Talend : an input parameter "reinitCache" can be set to true if the cache has to be rebuilt (like once a day). Otherwise the cache is simply read , and the SOAP response is sent. Unfortunately, I tried to do this with tHash components, and while it is working well inside the studio, once deployed on Karaf it is not working anymore : around 50% of the time, an error "tHash is not initialized" is risen.. It seems like tHash components are

Talend 10 GB input and lookup out of memory error

别说谁变了你拦得住时间么 提交于 2019-12-11 07:55:55
问题 I'm new to Talend and trying to migrate a simple process from existing ETL into Talend ETL. The process itself is Input file-->tMap (few string manipulation and lookup)-->write output Lookup file has 3 columns (long, 1 char string, 2 char string). Long value is the key. Size of input and lookup file (each around 10GB). Server spec is 16 core (2.9GHz) 64GB RAM 8GB swap running linux. I executed the job with Xmx/Xms values of 30g,45g,50g but each time failed with either GC overhead limit

Talend - URI based on values obtained from a list

折月煮酒 提交于 2019-12-11 06:35:29
问题 I need to feed a collection in MongoDB based on values obtained from webservices that provide JSON, but I'm having trouble mounting the job because the URI of some of the web services rely on values that can be obtained from other webservices. For example, the URI http://172.31.15.180:80/ws/getAgenciasUF/52 provides a JSON with a collection, in this format: { "COD_AGENCIA", "521800300", "NAME", "PORANGATU", "UORG": "902", "INTRA_MUNICIPAL": "0"}, { "COD_AGENCIA", "521830000", "NAME", "HOLD",