How can I get the set difference of two result sets?
Say I have a result set (just one column in each):
result1:
\'a\'
\'b\'
\'c\'
result2:
\'b\'
\'
I recently had this requirement where I had to find the difference between two resultsets. Although the above answers helped me wished they were a little detailed. For the given question I found two interpretation:
For the first one where the resultset can be from 2 different tables, let's take two tables: science_student and math_student.
science_student
math_student
result1 - result2
result1: select student_id from science_student where id > 2
result2: select student_id from math_student
The difference between result1 - result2 is STUD3
So the query to find the difference will be:
select result1.student_id
from
(select student_id from science_student where id > 2) result1
left join
(select student_id from math_student) result2
on result1.student_id = result2.student_id
where result2.student_id is null;
For the second interpretation where the resultset can be from the same table:
result1 - result2
result1: select student_id from science_student
result2: select student_id from science_student where id > 2
The difference between result1 - result2 is STUD1, STUD2
And the query for the same will be:
select result1.student_id
from
(select student_id from science_student) result1
left join
(select student_id from science_student where id > 2) result2
on result1.student_id = result2.student_id
where result2.student_id is null;
To perform result1 - result2, you can join result1 with result2, and only output items that exist in result1. For example:
SELECT DISTINCT result1.column
FROM result1 LEFT JOIN result2 ON result1.column = result2.column
WHERE result2.column IS NULL
Note that is not a set difference, and won't output items in result2 that don't exist in result1. It's set subtraction.
See also: Web archive'd version of relevant blog post.
If you want things in result1 that are not in result2, what about:
SELECT distinct result1
FROM t1
WHERE result1 NOT IN (select distinct result2 from t2);
Or:
SELECT distinct result
from t1 t
where NOT EXISTS (select 1 from t2 where result2 = t.result1)
NOTE: if result1 is a subset of result2 then the above queries will return an empty set (they won't show you things in result2 that are not in result1) so they are not set difference, but may be useful too (probably it's more efficient than the outer join).