LINQ Join with Multiple From Clauses

前端 未结 4 1790
耶瑟儿~
耶瑟儿~ 2020-12-08 02:18

When writing LINQ queries in C#, I know I can perform a join using the join keyword. But what does the following do?

from c in Companies
from e          


        
相关标签:
4条回答
  • 2020-12-08 02:23

    All the first set of objects will be joined with all the second set of objects. For example, the following test will pass...

        [TestMethod()]
        public void TestJoin()
        {
            var list1 = new List<Object1>();
            var list2 = new List<Object2>();
    
            list1.Add(new Object1 { Prop1 = 1, Prop2 = "2" });
            list1.Add(new Object1 { Prop1 = 4, Prop2 = "2av" });
            list1.Add(new Object1 { Prop1 = 5, Prop2 = "2gks" });
    
            list2.Add(new Object2 { Prop1 = 3, Prop2 = "wq" });
            list2.Add(new Object2 { Prop1 = 9, Prop2 = "sdf" });
    
            var list = (from l1 in list1
                        from l2 in list2
                        select l1).ToList();
    
            Assert.AreEqual(6, list.Count);
    
        }
    
    0 讨论(0)
  • 2020-12-08 02:24

    This will get translated into a SelectMany() call. It is essentially a cross-join.

    Jon Skeet talks about it on his blog, as part of the Edulinq series. (Scroll down to Secondary "from" clauses.)

    0 讨论(0)
  • 2020-12-08 02:27

    Multiple "from" statements are considered compound linq statments. They are like nested foreach statements. The msdn page does list a great example here

    var scoreQuery = from student in students
                     from score in student.Scores
                     where score > 90
                     select new { Last = student.LastName, score };
    

    this statement could be rewritten as:

    SomeDupCollection<string, decimal> nameScore = new SomeDupCollection<string, float>();
    foreach(Student curStudent in students)
    {
       foreach(Score curScore in curStudent.scores)
       {
          if (curScore > 90)
          {
             nameScore.Add(curStudent.LastName, curScore);
          }
       }
    }
    
    0 讨论(0)
  • 2020-12-08 02:30

    The code that you listed:

    from c in company
    from e in c.Employees
    select e;
    

    ... will produce a list of every employee for every company in the company variable. If an employee works for two companies, they will be included in the list twice.

    The only "join" that might occur here is when you say c.Employees. In an SQL-backed provider, this would translate to an inner join from the Company table to the Employee table.

    However, the double-from construct is often used to perform "joins" manually, like so:

    from c in companies
    from e in employees
    where c.CompanyId == e.CompanyId
    select e;
    

    This would have a similar effect as the code you posted, with potential subtle differences depending on what the employees variable contains. This would also be equivalent to the following join:

    from c in companies
    join e in employees
       on c.CompanyId equals e.CompanyId
    select e;
    

    If you wanted a Cartesian product, however, you could just remove the where clause. (To make it worth anything, you'd probably want to change the select slightly, too, though.)

    from c in companies
    from e in employees
    select new {c, e};
    

    This last query would give you every possible combination of company and employee.

    0 讨论(0)
提交回复
热议问题