goal: vectorizing on character-level
problem: output is not a unique number per character/letter, instead all letters are converted to 1