Is there a way to measure string similarity in Google BigQuery

前端 未结 7 2381
礼貌的吻别
礼貌的吻别 2020-12-03 15:35

I\'m wondering if anyone knows of a way to measure string similarity in BigQuery.

Seems like would be a neat function to have.

My case is i need to compare

7条回答
  •  抹茶落季
    2020-12-03 16:14

    Try Flookup for Google Sheets... it's definitely faster than Levenshtein distance and it calculates percentage similarities right out of the box. One Flookup function you might find useful is this:

    FUZZYMATCH (string1, string2)

    Parameter Details

    1. string1: compares to string2.
    2. string2: compares to string1.

    The percentage similarity is then calculated based on these comparisons. Both parameters can be ranges.

    I'm currently trying to optimise it for large data sets so you feedback would be very welcome.

    Edit: I'm the creator of Flookup.

提交回复
热议问题