apache-nifi

Defining Apache Avro Schema fullname in Apache NiFi

落爺英雄遲暮 提交于 2019-12-24 07:14:53
问题 Using NiFi 1.7.1 (which uses Java Avro 1.8.1) and in the AvroSchemaRegistry, I'm trying to define a schema which has the fields name and app.name at the top level. According to the Avro docs[1] I would assume that I could just define the fullname like normal "name": "app.name" but I hit the error Illegal character in: app.name . It's true that the name portion of the fullname does not allow dots but according to the docs: "If the name specified contains a dot, then it is assumed to be a

Nifi commands on windows

泪湿孤枕 提交于 2019-12-24 00:29:31
问题 In my current project, I have been using apache-nifi on Windows operating system. I have extracted nifi-0.7.0-bin.zip file in C:\ . Now, when I run \bin\run-nifi.bat as an administrator I see the following message on the command line and I am not able to run nifi. Could you please suggest what should I do in order to run the nifi. the content of app.log file as follows 2016-08-08 11:41:57,716 INFO [main] org.apache.nifi.NiFi Launching NiFi... 2016-08-08 11:41:57,851 INFO [main] org.apache

Nifi DistributedCache lookup issue

烈酒焚心 提交于 2019-12-23 23:23:55
问题 I have configured a flow as follows: GetFile SplitText -> splitting into flowfiles ExtractText -> adding attributes with two keys PutDistributedMapCache -> Cache Entry Identifier is ${Key1}_${Key2} Now I configured one sample GenerateFlowFile which generates a sample record and then goes into LookupRecord ( concat(/Key1,'_',/Key2)) which looks for the same key in cache. I see a problem in my caching flow because when I configure a GenerateFlowFile to cache same records , I am able to do

Using dynamic property in connection pooling service for ExecuteSQL processor

自古美人都是妖i 提交于 2019-12-23 23:20:06
问题 I am using ExecuteSQL to get the table from one or more database but i want to provide URL, Username, and password dynamically to the dbcpconnectionPool service so that i won't need more than one ExecuteSQL processor to retrieve data from different DB. It is found that controller service doesn't accept attribute from incoming flow file. So how to achieve it if it is possible. Someone pointed out that it can be achieved only by using rest api, if that is the case please provide a working

Apache NiFi: How to compare multiple rows in a csv and create new column

≯℡__Kan透↙ 提交于 2019-12-23 19:17:57
问题 I have a csv which looks like this. Jc,TXF,timer,alpha,beta 15,44,55,12,33 18,87,33,111 9,87,61,29,77 Alpha and Beta combined makes up a city code. I want to add the name of the city to the csv as a new column. Jc,TXF,timer,alpha,beta,city 15,44,55,12,33,York 18,87,33,111,London 9,87,61,29,77,Sydney I have another csv with only the columns alpha,beta,city . Which looks like this: alpha,beta,city 12,33,York 33,111,London 29,77,Sydney How can I achieve this using Apache NiFi. Please suggest the

Nifi- Parallel and concurrent execution with ExecuteStreamCommand

廉价感情. 提交于 2019-12-23 11:35:32
问题 Currently, I have Nifi running on an edge node that has 4 cores. Say I have 20 incoming flow files and I give concurrent tasks as 10 for ExecuteStreamCommand processor, does it mean I get only concurrent execution or both concurrent and parallel execution? 回答1: In this case you get concurrency and parallelism, as noted in the Apache NiFi User Guide (emphasis added): Next, the Scheduling Tab provides a configuration option named Concurrent tasks. This controls how many threads the Processor

Nifi- Parallel and concurrent execution with ExecuteStreamCommand

雨燕双飞 提交于 2019-12-23 11:35:11
问题 Currently, I have Nifi running on an edge node that has 4 cores. Say I have 20 incoming flow files and I give concurrent tasks as 10 for ExecuteStreamCommand processor, does it mean I get only concurrent execution or both concurrent and parallel execution? 回答1: In this case you get concurrency and parallelism, as noted in the Apache NiFi User Guide (emphasis added): Next, the Scheduling Tab provides a configuration option named Concurrent tasks. This controls how many threads the Processor

How to stop the single processor in nifi1.1.0 using rest api

纵饮孤独 提交于 2019-12-23 10:26:41
问题 I am trying to stop the processor which is in running state using the PUT method in rest api /processors/{id} . I am able to start the processor by changing the state in the component as follows "state": "RUNNING" , and runStatus in the aggregatesnapshot as "runStatus": "Running" . Similarly I tried to stop the processor by changing the state as STOPPED but facing an error as 9204b68d-0159-1000-7d8f-720592b2a2dd is not stopped (409 error conflict nd 400 Badrequest). Please let me know how to

Nifi - “Connection refused” in gui after installation

蹲街弑〆低调 提交于 2019-12-23 06:49:32
问题 I'm installing a three node non-secure nifi cluster (1.5) with embedded zookeeper. The installation has completed, the cluster startup up and election have finished, without any obvious issues. However when I hit the gui I see the following: javax.ws.rs.ProcessingException: java.net.ConnectException: Connection refused (Connection refused) In nifi-apps.log there is: 2018-03-07 14:41:37,857 WARN [Replicate Request Thread-7] o.a.n.c.c.h.r.ThreadPoolRequestReplicator Failed to replicate request

Telegraf tail with grok pattern error

旧时模样 提交于 2019-12-23 05:43:09
问题 I am using Telegraf to get logs information from Apache NiFi, for this task I am using this config: [[inputs.tail]] ## files to tail. files = ["/var/log/nifi/nifi-app.log"] ## Read file from beginning. from_beginning = true #name_override = "nifi_app" ## https://github.com/influxdata/telegraf/blob/master/docs/DATA_FORMATS_INPUT.md data_format = "grok" grok_patterns = [ "%{DATE:date} %{TIME:time} %{WORD:EventType} \[%{GREEDYDATA:NifiTask} %{NOTSPACE:Thread}\] %{NOTSPACE:NifiEventType} %