启用cuda graph的情况下,tuning部分的代码不能执行在计算stream上,同时,tensor 也无法保证可用,所以重新分配 liujuncheng added 2 commits January 16, 2023 17:34 cutlass conv support CudaGraph 2ada25d enable CudaGraphSupport 7738127 liujuncheng added enhancement op labels Jan 16, 2023 liujuncheng requ...
🐛 Describe the bug import torch @torch.compile(fullgraph=True) def f(q, k, v): q = torch.cos(q) with torch.backends.cuda.sdp_kernel(enable_flash=True): return F.scaled_dot_product_attention(q, k, v) f(*[torch.randn(1,8,1024,64, dtype=tor...
All the 'graph_port' warnings have been fixed or have pending fixes, so we can enable it by default now. Signed-off-by: Rob Herring <robh@kernel.org>main robherring committed Nov 30, 2018 1 parent 70523a3 commit a2237fe Showing 1 changed file with 0 additions and 1 deletion. Whitesp...