load_in_8bit_fp32_cpu_offload+true

2025-01-10 13:14:48

拼音 [ 拼音 ]

`load_in_8bit_fp32_cpu_offload=True · Issue #39 · Vision...

these modules in 32-bit, you need to set load_in_8bit_fp32_cpu_offload=True and pass a custom device_map to from_pretrained. Check https://huggingface.co/docs/transformers/main/en/main_classes/quantization#offload-between-cpu-and-gpu for more details. I have 48gb of vram the GPU RAM...
...with load_in_8bit=True, llm_int8_enable_fp32_cpu_offload=...

Huggingface_hub version: 0.12.1 PyTorch version (GPU?): 1.13.1+cu117 (True) Tensorflow version (GPU?): not installed (NA) Flax version (CPU?/GPU?/TPU?): not installed (NA) Jax version: not installed JaxLib version: not installed ...