Our lab has a cluster with
- 70 compute nodes
- 4 IO nodes
- Infiniband QDR interconnection
- 12T disk array accessed using IB SRP protocol
The major application is debugging and running MPI-based parallel scientific program. The clients/compute-nodes will write several Gigabytes data (in total) simultaneously every several minutes.
The filesystem used to be Lustre, for the similarity to mainstream super computer center. But the installation is too complicated and the maintainability is so awfull.
So is there any easy-to-use 'Small Scale' distributed network file system? Or, is NFS OK for this scenario?