1

Our server Supermicro SYS-6016T-U often crashes due to https://bugs.centos.org/view.php?id=10642 .

Since it's unable to write to disk, crash info isn't saved. I also cannot login using either IPMI kvm console or ssh to run dmesg.

How to configure it to save its errors to some network location? (actually, NetworkManager crashes too, so the network card may lose its config) Or maybe the ipmi device has some buffer for that?

I made a mistake and reserved 100% disk space for software raid. Not sure I can use kdump now

basin
  • 548
  • 1
  • 3
  • 20

1 Answers1

0

I think you should be looking to configure kdump. It's pretty good at what it does as it runs a new kernel to capture the state of the old one. I've used it to capture kernel crash data that was later used by RH to patch a kenel bug.

It seems like there is also a work-around for the bug you are experiencing - downgrade your kernel to 3.10.0-229.4.2

user9517
  • 114,104
  • 20
  • 206
  • 289