发表新帖

发表新帖

How to use clang to compile OpenCL to ptx code?

后端未结

关注

 3  706

北恋 2021-02-02 00:31

Clang 3.0 is able to compile OpenCL to ptx and use Nvidia\'s tool to launch the ptx code on GPU. How can I do this? Please be specific.

3条回答

天命终不由人 (楼主)

2021-02-02 00:51
With the current version of of llvm(3.4), libclc and nvptx back-end, the compilation process has changed slightly.

You have to explicitly tell the nvptx backend which driver interface to use; your options are nvptx-nvidia-cuda or nvptx-nvidia-nvcl (for OpenCL) and their 64 bit equivalents nvptx64-nvidia-cuda or nvptx64-nvidia-nvcl.

The generated .ptx code differs slightly according to the chosen interface. In the assembly code produced for the CUDA driver API, intrinsics .global and .ptr are dropped from entry functions but they are required by OpenCL. I've modified Mikael's compile steps slightly to produce code that can be run with OpenCL host:
1. Compile to LLVM IR:
```
clang -Dcl_clang_storage_class_specifiers -isystem libclc/generic/include -include clc/clc.h -target nvptx64-nvidia-nvcl -xcl test.cl -emit-llvm -S -o test.ll
```
2. Link kernel:
```
llvm-link libclc/built_libs/nvptx64--nvidiacl.bc test.ll -o test.linked.bc
```
3. Compile to Ptx:
```
clang -target nvptx64-nvidia-nvcl  test.linked.bc -S -o test.nvptx.s
```
0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...

热议问题