kettle

Pivots using ETL Metadata Injection

眉间皱痕 提交于 2019-12-13 01:27:24
问题 It's quite simple to use row denormaliser to achieve pivots when we have few records which can be written manually in denorm step,but what when there's hundreds of thousands of records? I tried using etl metadata injection step, but I was unable to achieve my desired output. Here is the link to my previous post where my source data has been defined. and here is what I have tried row denormaliser options pivot_inject_etl_metadata.ktr 回答1: Try something like this: pivot_inject_etl_metadata.ktr

Kettle - Append XML files content

荒凉一梦 提交于 2019-12-12 23:58:46
问题 How can I append two (maybe more) XML files? I search for those files using the 'Get File Names'. If more than one file is found, how can I append their content, to result in just one XML file? Thanks 回答1: Check the "XML Join" step. Curiously named as its not really a join (altho when you think about it i guess it is) but should do what you want. 来源: https://stackoverflow.com/questions/19223203/kettle-append-xml-files-content

Replace string using regular expression in KETTLE

点点圈 提交于 2019-12-12 04:24:28
问题 I would like to use regular expression for replacing a certain pattern in the Kettle. For example, AAAA >5< BBBB, I want to replace this with AAAA 555 BBBB. I know how to find the pattern, but I am not sure how to replace that with new string. The one thing I have to keep is that I have to find pattern together ><, not separately like > or < because there is another pattern <5>. 回答1: You can use the "Replace in String" step in a transformation. Set use RegEx to "Y", type your regex on the

error while running pan.bat in command line

若如初见. 提交于 2019-12-12 04:18:30
问题 with the reference to my previous post,here is the link I'm still yet facing with some errors. I tried running other .ktr file it ran successfully but when I'm trying to run "pivot_inject_etl_metadata.ktr" it gives me following error C:\pdi-ce-5.2.0.0-209\data-integration>pan.bat /file:E:\Practise_TRANSFORMATION_OUTPUT\dynamic_pivot\pivot_inject_etl_metadata.ktr /level:Basic DEBUG: Using PENTAHO_JAVA_HOME DEBUG: _PENTAHO_JAVA_HOME=C:\Program Files\Java\jre1.8.0_51 DEBUG: _PENTAHO_JAVA=C:

Fetching the max value from ROWS in pentaho

一笑奈何 提交于 2019-12-12 03:23:49
问题 I have a table structure ID Col_1 col_2 col_3 col_4 1 34 23 45 32 2 20 19 67 18 3 40 10 76 86 I here want the max value from col_1,col_,col_3,col_4 so my output looks like ID Col_1 col_2 col_3 col_4 max 1 34 23 45 32 45 2 20 19 67 18 67 3 40 10 76 86 86 any help would be much appreciated. 回答1: Use a Modified Java Script Value step with the following code: var max = Math.max(col_1,col_2,col_3,col_4); 回答2: You can use Memory Group By or Group By steps in Pentaho. Use the aggregation method as

Count the number of null value per column with pentaho

萝らか妹 提交于 2019-12-12 02:49:30
问题 I've got a csv file that contain more than 60 columns and 2 000 000 lines, I'm trying to count the number of null value per variable (per column) then to do the sum of that new row to get the number total of null value in the entire csv. For example if we got this file in input: We expect this other file in output: I know how to count the number of null value per line but, I didn't figure out how to count the number of null value per column. 回答1: There has to be a better way to do this, but I

How upload file to Pentaho User Console server?

无人久伴 提交于 2019-12-12 01:27:21
问题 I need : 1) Let the user select a file from his local pc 2) Upload that file to the pentaho server 3) Process the file using a kettle transformation I tried with a csv data source in Pentaho User Console (PUC) 5.0 but found no way to access it from a .ktr file uploaded to PUC repository. I also try to upload the csv file to a folder and still not able to access it from a .ktr file. 回答1: I think this requirement is valid : Upload a csv data file and .ktr file to PUC folder. The .ktr should be

RabbitMQ Java Client - Java Dependencies (standard libraries?)

徘徊边缘 提交于 2019-12-11 22:26:39
问题 I need to integrate RabbitMQ with another system that runs on Java. So I went to the official website, downloaded the package and copied it to my classpath. Then I followed the basic tutorial, and I verified that importing external libraries work. The problem I face now is probably very easy to solve for a Java developer: which standard Java libraries do I have to import to make the following code work? import com.rabbitmq.client.Connection; import com.rabbitmq.client.Channel; import com

PDI - Read CSV Files, if missing field/data then move to the next file

核能气质少年 提交于 2019-12-11 17:49:07
问题 I'm new with PDI and still learn about it. I'm trying to create transformation that will read all the csv file from one folder, check if the data of the file is correct, meaning there is no rows with missing/error/wrong format, then store it in a database. What I have try is : Use Text File Input accessing CSV file in FTP using Apache Common VFS. Validate and make condition to check the data (checking filename, field if exist) in CSV using Filter Row Output into PostgreSQL Table using

看完上汽制动的数字化,才发现以前的数据可视化大屏都白做了

你说的曾经没有我的故事 提交于 2019-12-11 17:16:27
【推荐】2019 Java 开发者跳槽指南.pdf(吐血整理) >>> 2019年,汽车行业正在发生巨大的变化。 市场竞争也变得更为激烈。去年以来一批重大的国家政策和汽车行业政策出台,对汽车市场产生了重大影响。如何应对如此复杂的局面,各车企及相关产业链都在寻求转型、升级,以适应不断变化、竞争激烈的汽车市场,而数字化更是成为了汽车行业转型的重要突破点。 今天就来和大家分享 上海汽车制动系统有限公司(SABS) ,在信息化助力制造业转型升级的智能制造大背景下 ,数字化运营管理驾驶舱项目 的落地过程。 公司介绍 上海汽车制动系统有限公司(SABS)系上海汽车工业集团总公司(SAIC)与德国大陆股份公司(Continental AG)的合资企业,成立于1995年7月,是我国最早引进国际先进技术生产制动钳、防抱死制动系统、总泵助力器和后分泵产品的企业。 公司生产的防抱死制动系统(ABS)曾被列入“十五”期间国家重点项目,也是当前国内唯一具备系统开发配套的专业企业。2004年与上海康迪汽车制动器有限公司完成合并,进一步扩大公司产品范围,成为一家汽车制动的系统供应商。 项目背景 上海汽车制动系统有限公司的信息化建设,正处在不断完善的过程中。主要表现在硬件的更新换代、软件的稳步推进。信息化的建设,将企业内部各职能部门,企业上游、下游串联起来,打通其各个环节,以期各环节最优化、反应速度最快化