(device='cpu', nvme_path=None, buffer_count=5, buffer_size=100,000,000, max_in_cpu=1,000,000,000, pin_memory=False) offload_optimizer=None sub_group_size=1,000,000,000 cpu_offload_param=None cpu_offload_use_pin_memory=None cpu_offload=None prefetch_bucket_size=50,000,000 param_...
on Feb 9, 2024 I ran the script above, but I cannot repro it. Can you rerun the script withTORCH_SHOW_CPP_STACKTRACES=1, which will give a better clue where the assertion failed. PyTorch: 2.2.0a0+git197ff40 GPU: NVIDIA A100-PG509-200 Total GPU memory 39.4GB, alloc 381.5MB, free...
[W upsampling.h:67] Warning: The default behavior for interpolate/upsample with float scale_factor changed in 1.6.0 to align with other frameworks/libraries, and uses scale_factor directly, instead of relying on the computed output size. If you wish to keep the old behavior, please set reco...