Bad implementation of Enumerable.Single?

后端未结

关注

 7  585

佛祖请我去吃肉 2020-12-15 03:19

I came across this implementation in Enumerable.cs by reflector.

public static TSource Single(this IEnumerable source, Func<


      
      
        
          7条回答        

        
                    
            
            
                         
                
              
              
                
                   旧时难觅i
                                             
                
                
                (楼主)
            
              
              
                2020-12-15 04:00
              

            
            
                        
I think it's a premature optimization "bug".

Why this is NOT reasonable behavior due to side effects

Some have argued that due to side effects, it should be expected that the entire list is evaluated.  After all, in the correct case (the sequence indeed has just 1 element) it is completely enumerated, and for consistency with this normal case it's nicer to enumerate the entire sequence in all cases.

Although that's a reasonable argument, it flies in the face of the general practice throughout the LINQ libraries: they use lazy evaluation everywhere.  It's not general practice to fully enumerate sequences except where absolutely necessary; indeed, several methods prefer using IList.Count when available over any iteration at all - even when that iteration may have side effects.

Further, .Single() without predicate does not exhibit this behavior: that terminates as soon as possible.  If the argument were that .Single() should respect side-effects of enumeration, you'd expect all overloads to do so equivalently.

Why the case for speed doesn't hold

Peter Lillevold made the interesting observation that it may be faster to do...

foreach(var elem in elems)
    if(pred(elem)) {
        retval=elem;
        count++;
    }
if(count!=1)...


than

foreach(var elem in elems)
    if(pred(elem)) {
        retval=elem;
        count++;
        if(count>1) ...
    }
if(count==0)...


After all, the second version, which would exit the iteration as soon as the first conflict is detected, would require an extra test in the loop - a test which in the "correct" is purely ballast.  Neat theory, right?

Except, that's not bourne out by the numbers; for example on my machine (YMMV) Enumerable.Range(0,100000000).Where(x=>x==123).Single() is actually faster than Enumerable.Range(0,100000000).Single(x=>x==123)!

It's possibly a JITter quirk of this precise expression on this machine - I'm not claiming that Where followed by predicateless Single is always faster.

But whatever the case, the fail-fast solution is very unlikely to be significantly slower.  After all, even in the normal case, we're dealing with a cheap branch: a branch that is never taken and thus easy on the branch predictor.  And of course; the branch is further only ever encountered when pred holds - that's once per call in the normal case.  That cost is simply negligible compared to the cost of the delegate call pred and its implementation, plus the cost of the interface methods .MoveNext() and .get_Current() and their implementations.

It's simply extremely unlikely that you'll notice the performance degradation caused by one predictable branch in comparison to all that other abstraction penalty - not to mention the fact that most sequences and predicates actually do something themselves.
    
             
                                                        
            
            
              
                
                0
              
                   
                
               讨论(0)
              
                                                  
              
              
                          
             
       
          
              
                                       
     查看其它7个回答


            
                         
                    


               
            
    发布评论:
    
         
                        
    
    提交评论 
  
  

                    
                    
                    
                        
                        
                         加载中...
                        
                    
                
          
                              			
        
        
        
          
            
            
              
              
            
    


                                 
              
            
                          
    

        
         
                验证码
                
                  
                
                
                   看不清?
                
              
                                  
                    
   
                 
             
              提交回复