How do I use sklearn CountVectorizer with both \'word\' and \'char\' analyzer?
http://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.CountVectoriz
You can combine arbitrary feature extraction steps with the FeatureUnion estimator: http://scikit-learn.org/dev/modules/pipeline.html#featureunion-combining-feature-extractors
In this case this is probably less efficient than larsmans solution, but might be easier to use.