Generating Separate Output files in Hadoop Streaming

后端 未结 3 996
一向
一向 2020-12-28 10:25

Using only a mapper (a Python script) and no reducer, how can I output a separate file with the key as the filename, for each line of output, rather than having long files o

3条回答
  •  余生分开走
    2020-12-28 10:55

    You can either write to a text file on the local filesystem using python file functions or if you want to use HDFS use the Thrift API.

提交回复
热议问题