How to perform the Text Similarity using BERT on 10M+ corpus? Using LSH/ ANNOY/ fiass or sklearn?

后端 未结 0 369
攒了一身酷
攒了一身酷 2021-01-03 15:23

My idea is to extract the CLS token for all the text in the DB and save it in CSV or somewhere else. So when a new text comes in, instead of using the Cos

相关标签:
回答
  • 消灭零回复
提交回复
热议问题