-
Notifications
You must be signed in to change notification settings - Fork 5.9k
Closed
Labels
Description
问题描述 Issue Description
Build release/2.6 branch but triggers following error if -DWITH_TESTING=ON:
/workspace/paddle/paddle/phi/kernels/fusion/cutlass/cutlass_kernels/fpA_intB_gemm/fpA_intB_gemm_template.h:48:10: fatal error: paddle/phi/kernels/fusion/cutlass/cutlass_kernels/fpA_intB_gemm/autogen/arch_define.h: No such file or directory
48 | #include "paddle/phi/kernels/fusion/cutlass/cutlass_kernels/fpA_intB_gemm/autogen/arch_define.h"
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~According the error message, it needs files automatically generated by something.
Manually running phi/.../generic_mixed_gemm_kernelLauncher.py triggers another error - FileNotFoundError until I manually created the autogen folder:
$ cd /workspace/paddle/paddle/phi/kernels/fusion/cutlass/cutlass_kernels/fpA_intB_gemm
$ python ./generic_mixed_gemm_kernelLauncher.py
Traceback (most recent call last):
File "/home/scratch.rjeng_sw/baidu/paddle/paddle/paddle/phi/kernels/fusion/cutlass/cutlass_kernels/fpA_intB_gemm/./generic_mixed_gemm_kernelLauncher.py", line 212, in <module>
with open(header_name, "w") as f:
FileNotFoundError: [Errno 2] No such file or directory: 'autogen/arch_define.h'
$ mkdir autogen
$ python ./generic_mixed_gemm_kernelLauncher.py # generating needed filesThe CMakeLists should cover build process instead of asking users manually creating files and folder.
版本&环境信息 Version & Environment Information
CCACHE_DIR=/workspace/ccache_storage
ARCH_FLAGS="-march=sandybridge -mtune=broadwell"
CXX_FLAGS="$ARCH_FLAGS -Wno-error=range-loop-construct -Wno-error=maybe-uninitialized -Wno-error=switch -faligned-new=32"
CUDA_FLAGS="-t2 --forward-unknown-to-host-compiler -Xfatbin=-compress-all $ARCH_FLAGS -lineinfo"
SKIP_DOWNLOAD_INFERENCE_DATA=ON cmake -Bpaddle/build -Spaddle \
-GNinja \
-DINFERENCE_DEMO_INSTALL_DIR=/home/scratch.rjeng_sw/baidu/paddle/test_data \
-DCMAKE_CXX_FLAGS="$CXX_FLAGS" \
-DCMAKE_EXPORT_COMPILE_COMMANDS=ON \
-DCMAKE_BUILD_TYPE=Release \
-DCMAKE_CUDA_FLAGS="$CUDA_FLAGS" \
-DCUDA_ARCH_NAME=Manual \
-DCUDA_ARCH_BIN="80" \
-DWITH_INCREMENTAL_COVERAGE=OFF \
-DWITH_INFERENCE_API_TEST=OFF \
-DWITH_DISTRIBUTE=ON \
-DWITH_COVERAGE=OFF \
-DWITH_TENSORRT=OFF \
-DWITH_TESTING=ON \
-DWITH_CONTRIB=ON \
-DWITH_ROCM=OFF \
-DWITH_RCCL=OFF \
-DWITH_STRIP=ON \
-DWITH_MKL=OFF \
-DWITH_AVX=OFF \
-DWITH_GPU=ON \
-DWITH_PYTHON=ON \
-DWITH_CUDNN_FRONTEND=ON \
-DPY_VERSION=$PYVER \
-Wno-dev
cmake --build paddle/build -j$((`nproc`))