Has someone ever tested if you can make a boolean vector or a set representation from a word embedding, say by using one or multiple thresholds and whether the result still