gradient_checkpointing_kwargs = {"use_reentrant": False} model.config.use_cache = False peft_model = get_peft_model(model, LORA_CONFIG) tokenizer = AutoTokenizer.from_pretrained(MODEL, token=TOKEN, max_length=8192, padding_side="left") # load data data = load_dataset("json", data_...
training_args["gradient_checkpointing_kwargs"] = {"use_reentrant": False} if config.mixed_precision == "fp16": training_args["fp16"] = True if config.mixed_precision == "bf16": training_args["bf16"] = True if config.trainer == "reward": training_args["max_length"] = config.b...
Use accelerator to replace cuda in setup and runner by @Andy666G in #5769 Link GDS blog to site by @tjruwase in #5820 Non-reentrant checkpointing hook fix by @ic-synth in #5781 Fix NV references by @tjruwase in #5821 Fix docs building guide by @tjruwase in #5825 Update clang-fo...
/home/sourab/miniconda3/envs/hf/lib/python3.10/site-packages/torch/utils/checkpoint.py:426: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain curre...
We read every piece of feedback, and take your input very seriously. Include my email address so I can be contacted Cancel Submit feedback Saved searches Use saved searches to filter your results more quickly Cancel Create saved search Sign in Sign up Reseting focus {...
Use accelerator to replace cuda in setup and runner by @Andy666G in #5769 Link GDS blog to site by @tjruwase in #5820 Non-reentrant checkpointing hook fix by @ic-synth in #5781 Fix NV references by @tjruwase in #5821 Fix docs building guide by @tjruwase in #5825 Update clang-fo...
Use accelerator to replace cuda in setup and runner by @Andy666G in #5769 Link GDS blog to site by @tjruwase in #5820 Non-reentrant checkpointing hook fix by @ic-synth in #5781 Fix NV references by @tjruwase in #5821 Fix docs building guide by @tjruwase in #5825 Update clang-fo...