Simplest method for text lemmatization in Scala and Spark

后端 未结 3 1934
梦谈多话
梦谈多话 2020-12-30 13:49

I want to use lemmatization on a text file:

surprise heard thump opened door small seedy man clasping package wrapped.

upgrading system found review spring          


        
3条回答
  •  自闭症患者
    2020-12-30 14:42

    I would suggest using the Stanford CoreNLP wrapper for Apache Spark as it gives the official API for the basic core nlp function such as Lemmatization, tokenization, etc.

    I have used the same for lemmatization on a spark dataframe.

    Link to use :https://github.com/databricks/spark-corenlp

提交回复
热议问题