Difference between Fasttext .vec and .bin file

后端 未结 2 1288
死守一世寂寞
死守一世寂寞 2021-02-07 01:16

I recently downloaded fasttext pretrained model for english. I got two files:

  1. wiki.en.vec
  2. wiki.en.bin

I am not sure what is the difference

2条回答
  •  再見小時候
    2021-02-07 01:53

    The .vec files contain only the aggregated word vectors, in plain-text. The .bin files in addition contain the model parameters, and crucially, the vectors for all the n-grams.

    So if you want to encode words you did not train with using those n-grams (FastText's famous "subword information"), you need to find an API that can handle FastText .bin files (most only support the .vec files, however...).

提交回复
热议问题