How to unpivot a table in PostgreSQL

前端 未结 4 782
野趣味
野趣味 2020-12-07 03:39

I am having difficulties writing a Postgres function, as I am not familiar with it. I have multiple tables to import into Postgres with this format:

id | 196         


        
4条回答
  •  生来不讨喜
    2020-12-07 04:14

    A completely dynamic version requires dynamic SQL. Use a plpgsql function with EXECUTE:

    For Postgres 9.2 or older (before LATERAL was implemented):

    CREATE OR REPLACE FUNCTION f_unpivot_years92(_tbl regclass, VARIADIC _years int[])
      RETURNS TABLE(id int, year int, value int) AS
    $func$
    BEGIN
       RETURN QUERY EXECUTE '
       SELECT id
            , unnest($1) AS year
            , unnest(ARRAY["'|| array_to_string(_years, '","') || '"]) AS val
       FROM   ' || _tbl || '
       ORDER  BY 1, 2'
       USING _years;
    END
    $func$  LANGUAGE plpgsql;
    

    For Postgres 9.3 or later (with LATERAL):

    CREATE OR REPLACE FUNCTION f_unpivot_years(_tbl regclass, VARIADIC _years int[])
      RETURNS TABLE(id int, year int, value int) AS
    $func$
    BEGIN
       RETURN QUERY EXECUTE (SELECT
         'SELECT t.id, u.year, u.val
          FROM  ' || _tbl || ' t
          LEFT   JOIN LATERAL (
             VALUES ' || string_agg(format('(%s, t.%I)', y, y), ', ')
         || ') u(year, val) ON true
          ORDER  BY 1, 2'
          FROM   unnest(_years) y
          );
    END
    $func$  LANGUAGE plpgsql;
    

    About VARIADIC:

    • Return rows matching elements of input array in plpgsql function

    Call for arbitrary years:

    SELECT * FROM f_unpivot_years('tbl', 1961, 1964, 1963);
    

    Same, passing an actual array:

    SELECT * FROM f_unpivot_years('tbl', VARIADIC '{1960,1961,1962,1963}'::int[]);
    

    For a long list of sequential years:

    SELECT * 
    FROM f_unpivot_years('t', VARIADIC ARRAY(SELECT generate_series(1950,2014)));
    

    For a long list with regular intervals (example for every 5 years):

    SELECT *
    FROM f_unpivot_years('t', VARIADIC ARRAY(SELECT generate_series(1950,2010,5)));
    

    Output as requested.

    The function takes:
    1. A valid table name - double-quoted if it's otherwise illegal (like '"CaMeL"'). Using the object identifier type regclass to assert correctness and defend against SQL injection. You may want to schema-qualify the tale name to be unambiguous (like 'public."CaMeL"'). More:

    • Table name as a PostgreSQL function parameter

    2. Any list of numbers corresponding to (double-quoted) column names.
    Or an actual array, prefixed with the keyword VARIADIC.

    The array of columns does not have to be sorted in any way, but table and columns must exist or an exception is raised.

    Output is sorted by id and year (as integer). If you want years to be sorted according to the sort order of the input array, make it just ORDER BY 1. Sort order according to array is not strictly guaranteed, but works in the current implementation. More about that:

    • PostgreSQL unnest() with element number

    Also works for NULL values.

    SQL Fiddle for both with examples.

    References:

    • Is there something like a zip() function in PostgreSQL that combines two arrays?

    • Table name as a PostgreSQL function parameter

    • PostgreSQL generate_series() with SQL function as arguments

提交回复
热议问题