When using a mask register with AVX-512 load and stores, is a fault raised for invalid accesses to masked out elements?

前端 未结 1 1066
清酒与你
清酒与你 2020-12-20 14:28

When I do a writemasked AVX-512 store, like so:

vmovdqu8 [rsi] {k1}, zmm0

Will the instruction fault if some portion of the memory accessed

相关标签:
1条回答
  • 2020-12-20 15:35

    No fault is raised if masked out elements touch invalid memory.


    Here's some Windows test code to prove that masking does indeed suppress memory faults.

    #include <immintrin.h>
    #include <iostream>
    #include <Windows.h>
    using namespace std; 
    
    
    int main(){
        const size_t PAGE = 4096;
    
        //  Map 2 pages.
        char* ptr = (char*)VirtualAlloc(
            nullptr, 2*PAGE,
            MEM_COMMIT,
            PAGE_READWRITE
        );
    
        //  Store 64 bytes across page boundary.
        cout << "Store across page boundary." << endl;
        _mm512_storeu_si512(ptr + PAGE - 32, _mm512_set1_epi8(-1));
    
        //  Unmap top page.
        cout << "Unmap top page." << endl;
        VirtualFree(ptr + PAGE, PAGE, MEM_DECOMMIT);
    
        //  Write on boundary masking out the part that touches the top (unmapped page).
        //  Does not crash because bad accesses are masked out.
        cout << "Store across page boundary, but mask out bytes that are on unmapped page." << endl;
        _mm512_mask_storeu_epi8(ptr + PAGE - 32, 0x00000000ffffffff, _mm512_set1_epi8(-1));
    
        //  Store 64 bytes across page boundary.
        //  Crashes because of bad access.
        cout << "Store across page boundary." << endl;
        _mm512_storeu_si512(ptr + PAGE - 32, _mm512_set1_epi8(-1));
    
        cout << "Release bottom page." << endl;
        VirtualFree(ptr, 0, MEM_RELEASE);
    
        system("pause");
    }
    

    Output:

    Store across page boundary.
    Unmap top page.
    Store across page boundary, but mask out bytes that are on unmapped page.
    Store across page boundary.
    **Access violation**
    

    This test works as follows:

    1. Map 2 adjacent pages.
    2. Do an AVX512 store across the page boundary to prove that both pages are mapped.
    3. Unmap the upper page.
    4. Do the same AVX512 store, but mask out the bytes that are on the upper page. It does not crash.
    5. Repeat the 1st AVX512 store (without masking). It crashes, thus proving that the upper page has been unmapped and the masking suppressed the crash.
    0 讨论(0)
提交回复
热议问题