What are some good ways of estimating 'approximate' semantic similarity between sentences?

后端未结

关注

 2  476

我在风中等你 2020-12-23 10:42

I have been looking at the nlp tag on SO for the past couple of hours and am confident I did not miss anything but if I did, please do point me to the question.

2条回答

不知归路 (楼主)

2020-12-23 11:15

I suggest you try a topic modelling framework such as Latent Dirichlet Allocation (LDA). The idea there is that documents (in your case sentences, which might prove to be a problem) are generated from a set of latent (hidden) topics; LDA retrieves those topics, representing them by word clusters.

An implementation of LDA in Python is available as part of the free Gensim package. You could try to apply it to your sentences, then run k-means on its output.

0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...