🐛 Describe the bug C10_CUDA_KERNEL_LAUNCH_CHECK calls cudaGetLastError: pytorch/c10/cuda/CUDAException.h Line 73 in 18b37bb #define C10_CUDA_KERNEL_LAUNCH_CHECK() C10_CUDA_CHECK(cudaGetLastError()) however, the result is discarded! pytor...
问CUDA GPU处理: TypeError: compile_kernel()获得意外的关键字参数“boundscheck”EN还是以谷歌的colab为...
2 changes: 1 addition & 1 deletion 2 ggml-cuda/fattn-vec-f32.cuh Original file line numberDiff line numberDiff line change @@ -149,7 +149,7 @@ static __global__ void flash_attn_vec_ext_f32( for (int i0 = 0; i0 < D/2; i0 += WARP_SIZE) { const int i = i0 + ...