I would not be surprised if my approach is far more complicated than it neecds to be, but what I am trying to do is build a tf-idf matrix, perform Latent Dirichlet Allocatio