What does the Brown clustering algorithm output mean?

前端 未结 5 837
暗喜
暗喜 2020-12-25 15:05

I\'ve ran the brown-clustering algorithm from https://github.com/percyliang/brown-cluster and also a python implementation https://github.com/mheilman/tan-clustering. And th

5条回答
  •  旧巷少年郎
    2020-12-25 15:36

    In Percy Liang's implementation (https://github.com/percyliang/brown-cluster), the -C parameter allows you to specify the number of word clusters. The output contains all the words in the corpus, together with a bit-string annotating the cluster and the word frequency in the following format: . The number of distinct bit strings in the output equals the number of desired clusters and the words with the same bit string belong to the same cluster.

提交回复
热议问题