First of all, I apologize if this questions seems too generic. I searched this site and a few others and really did not get the answers I was looking for.
Since migrating to a new virtual environment, we have been plagued with issues. It seems every day it's something else. From corporate firewall blocks to tmp disk filling up and shutting down services. It seems as if there is a new problem each and every day, which means I get screamed out every day.
I told my boss I cannot predict the future, and know what this firewall will do or who will have us black listed, etc. I told him I would do the best I can to prepare for these kind of issues, however every time something comes out of left field, most of the time I have no way of fixing it until the problem presents itself.
My boss is a "big picture" kind of guy. If One thing is broken, then the whole thing is broken. I did my best to explain to him I don't always see problems coming, however I do my best to monitor our server and take care of the things I know about.
Here is a list of things I have done to ensure the operation of our server:
- Checked sending mail IP against blacklist
- Moved TMP disk for MySQL to larger disk (we were having problems with it filling up tmp dir)
- Monitor CPU, Memory and Loads
- Removed firewall blocks from all known corporate firewalls such as McAfee
- Set up backups
- Made sure all DNS records were set up properly
- Set up rDNS and PTR record
- Check the reputation of our dedicated IP's on a regular basis
- Monitor disk usage
- Keep software updated
- Do my best to make sure our server is spam free by monitoring mail volume changes
- Have SPF and DKIM records set up
Is there anything else I can do to preemptively prevent down time and other problems?