What is a regular expression for parsing out individual sentences?

前端 未结 6 928
攒了一身酷
攒了一身酷 2020-11-27 18:16

I am looking for a good .NET regular expression that I can use for parsing out individual sentences from a body of text.

It should be able to parse the following blo

6条回答
  •  一整个雨季
    2020-11-27 19:10

    This is not really possible with only regular expressions, unless you know exactly which "difficult" tokens you have, such as "i.d.", "Mr.", etc. For example, how many sentences is "Please show your I.D, Mr. Bond."? I'm not familiar with any C#-implementations, but I've used NLTK's Punkt tokenizer. Probably should not be too hard to re-implement.

提交回复
热议问题