sudo apt-get update wget https://repo.radeon.com/amdgpu-install/22.10.3/ubuntu/focal/amdgpu-install_22.10.3.50103-1_all.deb sudo apt-get install ./amdgpu-install_22.10.3.50103-1_all.deb amdgpu-install --usecase=dkms Pull and run pytorch docker image. sudo docker pull rocm/pytorch:late...
In general, it looks like settingHIP_VISIBLE_DEVICES=xleads to GPU(x+2)%4being used: HIP_VISIBLE_DEVICES=0-> GPU2 HIP_VISIBLE_DEVICES=1-> GPU3 HIP_VISIBLE_DEVICES=2-> GPU0 HIP_VISIBLE_DEVICES=3-> GPU1 Versions Used therocm/pytorch:latestdocker image (image id:b80124b96134) from ...
PyTorch version: 2.6.0+cu124 Is debug build: False CUDA used to build PyTorch: 12.4 ROCM used to build PyTorch: N/A OS: Ubuntu 24.04.2 LTS (x86_64) GCC version: (Ubuntu 13.3.0-6ubuntu2~24.04) 13.3.0 Clang version: 18.1.8 (++20240731025043+3b5b5c1ec4a3-1~exp1~20240731145144.92)...
onnxruntime_rocm, tensorflow_rocm' from AMD source. conda create -n pytorch python==3.10 conda activate pytorch pip3 install torch==2.3.0 torchvision==0.18.0 pytorch_triton_rocm==2.3.0 onnxruntime_rocm==1.18.0 tensorflow_rocm==2.16.2 -fhttps://repo.radeon.com/rocm/manylinux/rocm-rel-...
Your current environment Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 22.04.4 LTS (x86_64) GCC version: (Ubuntu 11.4...
NVIDIA_PYTORCH_VERSION=24.08 TORCH_ALLOW_TF32_CUBLAS_OVERRIDE=1 Model Input Dumps No response 🐛 Describe the bug llm=LLM(model=model_path,task="reward") (output,)=llm.encode("Hello, my name is")data=output.outputs.dataprint(f"Data:{data!r}") ...
PyTorch version: 2.4.1.post3 Is debug build: False CUDA used to build PyTorch: Could not collect ROCM used to build PyTorch: N/A OS: AlmaLinux release 8.9 (Midnight Oncilla) (x86_64) GCC version: (GCC) 8.5.0 20210514 (Red Hat 8.5.0-20) Clang version: Could not collect CMake ...
speediedan opened on Aug 15, 2024· edited by pytorch-bot Edits 🐛 Describe the bug Testing a variety of TP requires_grad patterns (validating maximally flexible finetuning) revealed DTensor sharding propagation of aten.native_layer_norm_backward (default) fails with the following IndexError (...
Platform: Linux-5.4.56.bsk.10-amd64-x86_64-with-glibc2.31 Python version: 3.9.2 PyTorch version (GPU?): 2.0.0+cu117 (True) Huggingface_hub version: 0.15.1 Transformers version: 4.27.4 Accelerate version: 0.20.3 xFormers version: 0.0.18 Using GPU in script?: Using distributed or pa...
CUDA used to build PyTorch: 12.6 ROCM used to build PyTorch: N/A OS: Ubuntu 22.04.2 LTS (x86_64) GCC version: (conda-forge gcc 13.3.0-1) 13.3.0 Clang version: Could not collect CMake version: version 3.30.3 Libc version: glibc-2.35 ...