Fuzzy text (sentences/titles) matching in C#

后端 未结 5 583
春和景丽
春和景丽 2020-12-13 10:11

Hey, I\'m using Levenshteins algorithm to get distance between source and target string.

also I have method which returns value from 0 to 1:

/// <         


        
5条回答
  •  难免孤独
    2020-12-13 10:34

    Your problem here may be distinguishing between noise words and useful data:

    • Rolling_Stones.Best_of_2003.Wild_Horses.mp3
    • Super.Quality.Wild_Horses.mp3
    • Tori_Amos.Wild_Horses.mp3

    You may need to produce a dictionary of noise words to ignore. That seems clunky, but I'm not sure there's an algorithm that can distinguish between band/album names and noise.

提交回复
热议问题