I\'m using the RE expression in python and trying to split a chunk of text by period and by exclamation mark. However when I split it, I get a \"None\" in the result
<
Try the following:
re.split(r'((?<=\w)\.(?!\..)|!)', a)
You get the None because you have two capturing groups, and all groups are included as a part of the re.split() result.
So any time you match a . the second capture group is None, and any time you match a ! the first capture group is None.
Here is the result:
['This is my text...I want it to split by periods',
'.',
' I also want it to split by exclamation marks',
'!',
' Is that so much to ask?']
If you don't want to include '.' and '!' in your result, just remove the parentheses that surround the entire expression: r'(?<=\w)\.(?!\..)|!'