PostgreSQL - GROUP BY clause

后端 未结 2 1531
囚心锁ツ
囚心锁ツ 2020-12-04 02:44

I want to search by tags, and then list all articles with that tag, and also how many of given tags they match. So for example I might have:

 Page1 - 2 (has          


        
相关标签:
2条回答
  • 2020-12-04 03:05

    First, to clarify, Postgres 9.1 or later (quoting release notes of 9.1) ...

    Allow non-GROUP BY columns in the query target list when the primary key is specified in the GROUP BY clause (Peter Eisentraut)

    More in this related answer:
    Return a grouped list with occurrences using Rails and PostgreSQL

    Next, the queries in the question and in @Michael's answer have got the logic backwards. We want to count how many tags match per article, not how many articles have a certain tag. So we need to GROUP BY w_article.id, not by a_tags.id.

    list all articles with that tag, and also how many of given tags they match

    To fix this:

    SELECT COUNT(t.tag) AS ct, a.* -- any column from a allowed ...
    FROM   a_tags         t
    JOIN   w_articles2tag a2t ON a2t.tag = t.id 
    JOIN   w_article      a   ON a.id = a2t.article 
    WHERE  t.tag IN ('css', 'php')
    GROUP  BY a.id           -- ... since grouped by pk column of a
    LIMIT  9
    

    Assuming id is the primary key of w_article.
    However, this form will be faster while doing the same:

    SELECT a.*, ct
    FROM  (
       SELECT a2t.article AS id, COUNT(*) AS ct
       FROM   a_tags         t
       JOIN   w_articles2tag a2t ON a2t.tag = t.id 
       GROUP  BY a.article 
       LIMIT  9      -- LIMIT early - cheaper
       ) sub
    JOIN   w_article a USING (id);  -- attached alias to article in the sub
    

    More in this closely related answer from just yesterday:
    Why does the following join increase the query time significantly?

    As an aside: It is an anti-pattern to use the generic, non-descriptive id as column name. Call it article_id etc. in both tables. Easier to join and you don't have to use aliases in queries all the time.

    0 讨论(0)
  • 2020-12-04 03:12

    When you use a "GROUP BY" clause, you need to enclose all columns that are not grouped in an aggregate function. Try adding title to the GROUP BY list, or selecting "min(a.title)" instead.

    SELECT COUNT(t.tag), a.title FROM a_tags t
    JOIN w_articles2tag a2t ON a2t.tag = t.id 
    JOIN w_article a ON a.id = a2t.article 
    WHERE t.tag = 'css' OR t.tag = 'php' GROUP BY t.tag, a.title LIMIT 9
    
    0 讨论(0)
提交回复
热议问题