Gensim example, TypeError:between str and int error

时间秒杀一切 提交于 2019-12-13 09:37:54

问题


When running the below code. this Python 3.6, latest Gensim library in Jupyter

for model in models:
       print(str(model))
       pprint(model.docvecs.most_similar(positive=["Machine learning"], topn=20))

[1]: https://github.com/RaRe-Technologies/gensim/blob/develop/docs/notebooks/doc2vec-wikipedia.ipynb


回答1:


string= "machine learning".split()

doc_vector = model.infer_vector(string)
out= model.docvecs.most_similar([doc_vector])

I'm not sure 100% since I'm using a more recent release, but I think that the issue is connected to the fact that the most_similar function is expecting a string mapped in the feature space and not the raw string.




回答2:


The main problem is that 'Machine learning' is not a known tag in your model. (Maybe your model knows 'machine learning' or 'machine_learning' or some other such?)

It's harder to recognize that's the real issue, because of a poor error message from the code in this case. It's a known issue at the gensim project:

https://github.com/RaRe-Technologies/gensim/issues/1737



来源:https://stackoverflow.com/questions/53130738/gensim-example-typeerrorbetween-str-and-int-error

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!