wandb: Currently logged in as: anony-moose-529863. Use `wandb login --relogin` to force relogin I have similar problems. I am trying to resume a wandb run on a SLURM job, by running: And I get the errors: Sign up for freeto join this conversation on GitHub. Already have an account...
The notebook was working fine till a day before and I was storing checkpoints but now when I try to run either from the checkpoint or by loading t5-small, I get asked for the wandb API key on running the trainer. I don't even have a profile. When I tried wandb off, it again as...
False, # for hybrid auto-labelling save_conf=False, # save auto-label confidences plots=True, wandb_logger=None, compute_loss=None, half_precision=True, trace=False, is_coco=False, v5_metric=False): # Initialize/load model and set device training = model is not None if training: # ...
= -1: # -1 rank indicates serial code dist.destroy_process_group() def get_batch(batch: Tuple[Tensor, Tensor], rank) -> Tuple[Tensor, Tensor]: x, y = batch if torch.cuda.is_available(): x, y = x.to(rank), y.to(rank) else: # I don't think this is needed... # x, ...