(device='cpu', nvme_path=None, buffer_count=5, buffer_size=100,000,000, max_in_cpu=1,000,000,000, pin_memory=False) offload_optimizer=None sub_group_size=1,000,000,000 cpu_offload_param=None cpu_offload_use_pin_memory=None cpu_offload=None prefetch_bucket_size=50,000,000 param_...
cpuhrsch added module: CUDACachingAllocator on Feb 9, 2024 I ran the script above, but I cannot repro it. Can you rerun the script withTORCH_SHOW_CPP_STACKTRACES=1, which will give a better clue where the assertion failed. PyTorch: 2.2.0a0+git197ff40 GPU: NVIDIA A100-PG509-200 Tota...
and uses scale_factor directly, instead of relying on the computed output size. If you wish to keep the old behavior, please set recompute_scale_factor=True. See the documentation of nn.Upsample for details. (function _