Actually output file is created by clang itself, but isn't updated and is deleted after processing. In cooperation with -print-stat -no-output (or single -n) is used for examine the source CUDA code. Conflicting options: -inplce -o