Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. This error only happened onvit_h_14model in cuda device (the cpu is fine). Also I cannot reproduce the error on AWS cluster machine. Seems like this error is either machine or environment dependent and likely to be pyto...