I have keywords that are all stored in lower case, e.g. \"discount nike shoes\", that I am trying to perform entity extraction on. The issue I\'ve run into is that spaCy seems t
In general, non-standardized casing is problematic for pre-trained models.
You have a few workarounds:
I would recommend Truecasing, as there are some decent open-source truecasers out there with good accuracy, and they allow you to then use pre-trained NER solutions such as spaCy.
Caseless and mixed-case models are more time-consuming to set up and won't necessarily give better results.