Choosing between SimHash and MinHash for a production system

后端 未结 2 1838
萌比男神i
萌比男神i 2020-12-29 09:13

I\'m familiar with the LSH (Locality Sensitive Hashing) techniques of SimHash and MinHash. SimHash uses cosine similarity over real-valued data. MinHash calculates resemblan

2条回答
  •  旧时难觅i
    2020-12-29 09:42

    This paper might give you some ideas on the two algorithms.

    http://jmlr.org/proceedings/papers/v33/shrivastava14.pdf

提交回复
热议问题