Function optimized to infinite loop at 'gcc -O2'

后端未结

关注

 3  1476

Context
I was asked the following puzzle by one of my friends:

void fn(void)
{
  /* write something after this comment so that the progr


                      
              相关标签:


      
      
        
          3条回答        

        
                         				            
            
           
            
                              
                
              
              
                
                  佛祖请我去吃肉        
                
              
                            
                2020-12-14 15:14
              
            
            
                                                                       
If the loop does get optimized out into an infinite loop, it could be due to static code analyzis seeing that your array is 


not volatile
contains only 0
never gets written to


and thus it is not possible for it to contain the number 5. Which means an infinite loop.

Even if it didn't do this, your approach could fail easily. For example, it's possible that some compiler would optimize your code without making your loop infinite, but would stuff the contents of i into a register, making it unavailable from the stack.

As a side note, I bet what your friend actually expected was this:

void fn(void)
{
  /* write something after this comment so that the program output is 10 */
  printf("10\n"); /* Output 10 */
  while(1); /* Endless loop, function won't return, i won't be output */
  /* write something before this comment */
}


or this (if stdlib.h is included):

void fn(void)
{
  /* write something after this comment so that the program output is 10 */
  printf("10\n"); /* Output 10 */
  exit(0); /* Exit gracefully */
  /* write something before this comment */
}

                                                                        
                                                        
            
            
              
                
                0
              
                 
                
               讨论(0)
              
              
                                                   
              
                                                            
            
                      
                    


               
            
    发布评论:
    
         
                        
    
    提交评论 
  
  

                    
                    
                    
                        
                        
                         加载中...
                        
                    
                
          
          	          
            
           
            
                              
                
              
              
                
                  野的像风        
                
              
                            
                2020-12-14 15:31
              
            
            
                                                                       
In the optimized version, the compiler has decided a few things:


The array a doesn't change before that test.
The array a doesn't contain a 5.


Therefore, we can rewrite the code as:

void fn(void) {
  int a[1] = {0};
  int j = 0;
  while(true) ++j;
  a[j] = 10;
}


Now, we can make further decisions:


All the code after the while loop is dead code (unreachable).
j is written but never read. So we can get rid of it.
a is never read.


At this point, your code has been reduced to:

void fn(void) {
  int a[1] = {0};
  while(true);
}


And we can make the note that a is now never read, so let's get rid of it as well:

void fn(void) {
  while(true);
}


Now, the unoptimized code:

In unoptimized generated code, the array will remain in memory. And you'll literally walk it at runtime. And it's possible that there will be a 5 thats readable after it once you walk past the end of the array.

Which is why the unoptimized version sometimes doesn't crash and burn.
                                                                        
                                                        
            
            
              
                
                0
              
                 
                
               讨论(0)
              
              
                                                   
              
                                                            
            
                      
                    


               
            
    发布评论:
    
         
                        
    
    提交评论 
  
  

                    
                    
                    
                        
                        
                         加载中...
                        
                    
                
          
          	          
            
           
            
                              
                
              
              
                
                  旧时难觅i        
                
              
                            
                2020-12-14 15:38
              
            
            
                                                                       
This is undefined behavior so the compiler can really do anything at all, we can find a similar example in GCC pre-4.8 Breaks Broken SPEC 2006 Benchmarks, where gcc takes a loop with undefined behavior and optimizes it to:

L2:
    jmp .L2


The article says (emphasis mine):


  Of course this is an infinite loop. Since SATD() unconditionally
  executes undefined behavior (it’s a type 3 function), any
  translation (or none at all) is perfectly acceptable behavior for a
  correct C compiler. The undefined behavior is accessing d[16] just
  before exiting the loop. In C99 it is legal to create a pointer to
  an element one position past the end of the array, but that pointer
  must not be dereferenced. Similarly, the array cell one element past
  the end of the array must not be accessed.


which if we examine your program with godbolt we see:

fn:
.L2:
    jmp .L2


The logic being used by the optimizer probably goes something like this:


All the elements of a are initialized to zero
a is never modified before or within the loop
So  a[j] != 5 is always true -> infinite loop
Because of the infinite, the a[j] = 10; is unreachable and so that can be optimized away, so can a and j since they are no longer needed to determine the loop condition.


which is similar to the case in the article which given:

int d[16];


analyzes the following loop:

for (dd=d[k=0]; k<16; dd=d[++k]) 


like this:


  upon seeing d[++k], is permitted to assume that the incremented value
  of k is within the array bounds, since otherwise undefined behavior
  occurs. For the code here, GCC can infer that k is in the range 0..15.
  A bit later, when GCC sees k<16, it says to itself: “Aha– that
  expression is always true, so we have an infinite loop.”


Perhaps an interesting secondary point, is whether an infinite loop is considered observable behavior(w.r.t. to the as-if rule) or not, which effects whether an infinite loop can also be optimized away. We can see from C Compilers Disprove Fermat’s Last Theorem that before C11 there was at least some room for interpretation:


  Many knowledgeable people (including me) read this as saying that the
  termination behavior of a program must not be changed.  Obviously some
  compiler writers disagree, or else don’t believe that it matters.  The
  fact that reasonable people disagree on the interpretation would seem
  to indicate that the C standard is flawed.


C11 adds clarification to section 6.8.5 Iteration statements and is covered in more detail in this answer.
                                                                        
                                                        
            
            
              
                
                0
              
                 
                
               讨论(0)
              
              
                                                   
              
                                                            
            
                      
                    


               
            
    发布评论:
    
         
                        
    
    提交评论 
  
  

                    
                    
                    
                        
                        
                         加载中...
                        
                    
                
          
          	          
                             
        
        
          
            
            
              
              
            
    


                                 
              
            
                          
    

        
         
                验证码
                
                  
                
                
                   看不清?
                
              
                                  
                    
   
                 
             
              提交回复