I\'m using a Logistic Regression on a feature vector that is composed of roughly 1,500 Boolean values (not a 1 hot encoding, several can be set) and training set of ~ 10,000