I recently experienced a filesystem meltdown. I had a server running for about 180 days non stop without any issues, but then I noticed weird stuff happen and apparently the ext3 filesystem was in really bad shape. I had the drives and the memory tested and they were all fine. Ultimately, I was forced to hose the system and do a full reinstall. fsck.ext3
only made things worse.
Now, I don't want this to happen again so this time I went with XFS instead, which I feel is more mature than ext3, but I am at a loss how to monitor the health of the file system. xfs_check
simply won't let me scan the device while it is mounted.
So, how do you monitor the health of an XFS filesystem while the system is online?