fuzzy string matching with grep
问题 I am trying to match rows in a file containing a string say ACTGGGTAAACTA . If I do grep "ACTGGGTAAACTA" file It gives me rows which have exact matches. Is there a way to allow for certain number of mismatches (substitutions, insertions or deletions)? For example, I am looking for sequences Up to 3 allowed subtitutions like "AGTGGGTAACCAA" etc. Insertions/deletions (having a partial match like "ACTGGGAAAATAAACTA" or "ACTAAACTA") 回答1: There used to be a tool called agrep for fuzzy regex