Text similarity algorithm

后端 未结 5 1991
萌比男神i
萌比男神i 2020-12-24 15:08

I have two subtitles files. I need a function that tells whether they represent the same text, or the similar text

Sometimes there are comments like \"The w

5条回答
  •  粉色の甜心
    2020-12-24 16:02

    Have a look at approximate grep. It might give you pointers, though it's almost certain to perform abysmally on large chunks of text like you're talking about.

    EDIT: The original version of agrep isn't open source, so you might get links to OSS versions from http://en.wikipedia.org/wiki/Agrep

提交回复
热议问题