I met a problem when executing FIO stress test on a RAID0 which built from 6x SSDs thru mdadm under Yocto OS, below is the information:
- 6x PCIe NVMe SSD are the same vendor and model which is with 1.02TB automotive grade.
- FIO parameter used for the test: fio --filename=/dev/md127 --direct=1 --rw=randrw --bs=64k --ioengine=libaio --iodepth=64 --runtime=43200 --numjobs=16 --time_based --group_reporting --name=randomrw --eta-newline=1
- The system auto restart after 30 minutes run
The question is that I'd want to know why it would cause the system auto restart randomly, is that a software issue or software limitation, or a hardware issue? Would you suggest on how to isolate the issue?
I'm going to delete RAID0 and try agaig with same FIO parameters and on a single SSD first, if it cannot be reproduced, then will run the test again on all 6 SSDs with same parameters but without RAID mode.
Thanks, Jacky