2 days ago a notice appeared next to my pod: *We have detected a critical error on this machine which may affect some pods. We are looking into the root cause and apologize for any inconvenience. We would recommend backing up your data and creating a new pod in the meantime.* I guess this is a HW error. Since then, trying to boot with GPU gives me this error in log: `error creating container: nvidia-smi: parsing output of line 6: failed to parse ([GPU requires reset]) into int: strconv.Atoi: parsing "": invalid syntax` And wont boot up at all. If I try to bootup in CPU mode, the server seems to go online - with 512MB RAM which is immediately 100% utilized and 0.5 vCPU. Web terminal fails to launch and when I try to connect from my OSX terminal, I get through the authentication but end up with this: > `-- RUNPOD.IO -- > Enjoy your Pod #cqszf7XXXXXX ^_^ > > failed to resize tty, using default size > task 47a9a5d9f53e86219dfbxxxxf1469674753c965x778655d7132f05d469614845f not found: not found > Connection to 100.65.XX.XX closed. > Connection to ssh.runpod.io closed.` I am really desperate, I've been using runpod for over a month and have though what a great service it is. I've built and configured a perfect pod for my work workflow. Was currently running a big job for a client (which I have now loosed for not delivering on time). Despite the notice (quoted above) nobody is proactively looking into the issue, no updates. I have cotacted RunPods customer service and created a ticket (and have read the whole documentation). The support was completely useless - replying with some template answer telling me to create a network storage and migrate my data there, pointing me to two knowledgebase articles. But.. 1) I cannot connect to my pod to migrate the data 2) they've sent me two step-by-step articles which are useless - they miss a lots of steps and are not even correct (stating wrong parameters for runpodctl etc) 3) why did this even happen in the first place? 4) why are they keeping me paying for a pod and storage I can't use? 5) why is there no proactive monitoring that would sending me an email to inform me "hey, we are sorry but there is a malfunction on the server your pod is hosted on .." 6) how do I know this wont happen again in few weeks if I migrate the pod? (but for that I would need someone from runpod to actually reach out to me - but they are not very responsive - in the meantime I am loosing money) 7) Also they say to provide as much information as possible in these This is very unfortunate situation for me and terrible customer experience. I've though "This is it" when I first discovered runpod but if this is how they care about their customers and the level of SLA they provide .. Any ideas? Please help.