Mysql optimization for REGEXP

前端 未结 3 1370
孤街浪徒
孤街浪徒 2020-12-07 01:35

This query (with different name instead of \"jack\") happens many times in my slow query log. Why?

The Users table has many fields (more than these three I\'ve selec

相关标签:
3条回答
  • 2020-12-07 01:55

    I reached 50% speedup just by adding fieldname != '' in where clause. It makes mysql to use indexes.

    SELECT name, username, id 
    FROM users 
    WHERE name != '' 
        AND (name REGEXP '[[:<:]]jack[[:>:]]' or username REGEXP '[[:<:]]jack[[:>:]]') 
    ORDER BY name 
    LIMIT 0,5;
    

    Not a perfect solution but helps.

    0 讨论(0)
  • 2020-12-07 02:03

    If you must use regexp-style WHERE clauses, you definitely will be plagued by slow-query problems. For regexp-style search to work, MySQL has to compare every value in your name column with the regexp. And, your query has doubled the trouble by also looking at your username column.

    This means MySQL can't take advantage of any indexes, which is how all DBMSs speed up queries of large tables.

    There are a few things you can try. All of them involve saying goodbye to REGEXP.

    One is this:

    WHERE name LIKE CONCAT('jack', '%') OR username LIKE CONCAT('jack', '%')
    

    If you create indexes on your name and username columns this should be decently fast. It will look for all names/usernames beginning with 'jack'. NOTICE that

    WHERE name LIKE CONCAT('%','jack') /* SLOW!!! */
    

    will look for names ending with 'jack' but will be slow like your regexp-style search.

    Another thing you can do is figure out why your application needs to be able to search for part of a name or username. You can either eliminate this feature from your application, or figure out some better way to handle it.

    Possible better ways:

    1. Ask your users to break up their names into given-name and surname fields, and search separately.
    2. Create a separate "search all users" feature that only gets used when a user needs it, thereby reducing the frequency of your slow regexp-style query.
    3. Break up their names into a separate name-words table yourself using some sort of preprocessing progam. Search the name-words table without regexp.
    4. Figure out how to use MySQL full text search for this feature.

    All of these involve some programming work.

    0 讨论(0)
  • 2020-12-07 02:15

    Add "LIKE" in front

    from

    SELECT cat_ID, categoryName FROM category WHERE cat_ID REGEXP '^15-64-8$' ORDER BY categoryName
    

    to

    SELECT cat_ID, categoryName FROM category WHERE cat_ID LIKE '15-64-8%' and cat_ID REGEXP '^15-64-8$' ORDER BY categoryName
    

    Of cos, that works only if U r search for phrases U know starting with what, else full text index is the solution.

    0 讨论(0)
提交回复
热议问题