How to create a HDInsightOnDemand LinkedService with a script action in Data Factory?

纵饮孤独 提交于 2020-01-03 02:47:07

问题


We are creating a DataFactory for running a pySpark job, that uses a HDInsight on demand cluster.

The problem is that we need to use additional python dependencies for running this job, such as numpy, that are not installed.

We believe that the way of doing so is configuring a Script Action for the HDInsightOnDemandLinkedService, but we cannot find this option on DataFactory or LikedServices.

Is there an alternative for automating the HDInsightOnDemand installation of the dependencies?


回答1:


Currently the Script Actions for HDInsightOnDemandLinkedService are not supported. You can use Azure Automation to run a PowerShell script that does the following:

  1. create HDInsight cluster
  2. execute Script Action
  3. run pipeline in your DataFactory
  4. delete the cluster.


来源:https://stackoverflow.com/questions/49456110/how-to-create-a-hdinsightondemand-linkedservice-with-a-script-action-in-data-fac

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!