I have data, that I need to tokenize based on " - " notice the hyphen is surrounded by a space on each side, however, after parsing a hundred thousand
" - "