R break corpus into sentences

后端 未结 7 746
面向向阳花
面向向阳花 2020-12-14 03:59
  1. I have a number of PDF documents, which I have read into a corpus with library tm. How can one break the corpus into sentences?

  2. It can

7条回答
  •  我在风中等你
    2020-12-14 04:23

    openNLP had some major changes. The bad news is it looks very different than it used to. The good news is that it's more flexible and the functionality you enjoyed before is still there, you just have to find it.

    This will give you what you're after:

    ?Maxent_Sent_Token_Annotator

    Just work through the example and you'll see the functionality you're looking for.

提交回复
热议问题