Can you provide more details (how many GBs allocated, which model, etc.?) Thanks! beyondguo commented Jun 19, 2023 Sure. Model: ChatGLM-6B device: 4 * A800-80G 70 GBs allocated for each GPU. The code I'm using is https://github.com/beyondguo/LLM-Tuning/blob/796384e837b3b6d705...
You indicated that it’s been working well “for quite some time”. Depending on how you’re using it, how much you write to it, the quality of the device, and just how long “some time” is, I’m willing to bet that the device has simply reached the end of its useful life. An...