Difference between Apache NiFi and StreamSets

前端 未结 4 532
执笔经年
执笔经年 2020-12-22 21:19

I am planning to do a class project and was going through few technologies where I can automate or set the flow of data between systems and found that there are couple of th

4条回答
  •  野趣味
    野趣味 (楼主)
    2020-12-22 21:47

    Two of the key differentiators between the two IMHO are.

    1. Apache NiFi is a Top Level Apache project, meaning it has gone through the incubation process described here, http://incubator.apache.org/policy/process.html, and can accept contributions from developers around the world who follow the standard Apache process which ensures software quality. StreamSets, is Apache LICENSED, meaning anyone can reuse the code, etc. But the project is not managed as an Apache project. In fact, in order to even contribute to Streamsets, you are REQUIRED to sign a contract. https://streamsets.com/contributing/ . Contrast this with the Apache NiFi contributor guide, which wasn't written by a lawyer. https://cwiki.apache.org/confluence/display/NIFI/Contributor+Guide#ContributorGuide-HowtocontributetoApacheNiFi

    2. StreamSets "runs atop Spark on YARN/Mesos instead, leveraging existing cluster resources you may have." which imposes a bit of restriction if you want to deploy your dataflows further toward the Edge where the Devices that are generating the data live. Apache MiniFi, a sub-project of NiFi can run on a single Raspberry Pi, while I am fairly confident that StreamSets cannot, as YARN or Mesos require more resources than a Raspberry Pi provides.

    Disclosure: I am a Hortonworks employee

提交回复
热议问题