Cooperative storage cloud
A cooperative storage cloud is a decentralized model of networked online storage where data is stored on multiple computers (nodes), hosted by the participants cooperating in the cloud. For the cooperative scheme to be viable, the total storage contributed in aggregate must be at least equal to the amount of storage needed by end users. However, some nodes may contribute less storage and some may contribute more. There may be reward models to compensate the nodes contributing more.
Unlike a traditional storage cloud, a cooperative does not directly employ dedicated servers for the actual storage of the data, thereby eliminating the need for a significant dedicated hardware investment. Each node in the cooperative runs specialized software which communicates with a centralized control and orchestration server, thereby allowing the node to both consume and contribute storage space to the cloud. The centralized control and orchestration server requires several orders of magnitude less resources (storage, computing power, and bandwidth) to operate, relative to the overall capacity of the cooperative.
Data security
Files hosted in the cloud are fragmented and encrypted before leaving the local machine. They are then distributed randomly using a load balancing and geo-distribution algorithm to other nodes in the cooperative. Users can add an additional layer of security and reduce storage space by compressing and encrypting files before they are copied to the cloud.
Data redundancy
In order to maintain data integrity and high availability across a relatively unreliable set of computers over a wide area network like the Internet, the source node will add some level of redundancy to each data block.[1] This allows the system to recreate the entire block even if some nodes are temporarily unavailable (due to loss of network connectivity, the machine being powered off or a hardware failure). The most storage and bandwidth efficient forms of redundancy use erasure coding techniques like Reed-Solomon. A simple, less CPU intensive but more expensive form of redundancy is duplicate copies.
Flexible contribution
Due to bandwidth or hardware constraints some nodes may not be able to contribute as much space as they consume in the cloud. On the other hand, nodes with large storage space and limited or no bandwidth constraints may contribute more than they consume, thereby the cooperative can stay in balance.
Examples
Examples include MIT's Chord[1], Filecoin, Sia, and Storj.
A partly centralized system was operated by Symform, Inc., a startup company based in Seattle.[1][2] Symform generated and kept the keys used to encrypt and decrypt, and since it also decided which server will host which parts of a file, users have to trust Symform not to share those with any other party or misuse the information.[3] Symform discontinued its service on July 31, 2016.
See also
- Cloud computing
- Cyber Resilience
- Distributed data store
- Freenet
- InterPlanetary File System
- Tahoe-LAFS
References
- Mearian, Lucas (2009-09-21). "Start-up unveils cloud storage co-op". Computerworld. Retrieved 2010-10-14.
- Huang, Gregory T (2009-02-19). "Symform, Founded by Ex-Microsoft Pair, Offers Cheap, Efficient Data Storage in the Cloud". Xconomy. Retrieved 2010-10-14.
- "Symform Security Analysis". VirtualServerGuy. December 19, 2012. Archived from the original on May 26, 2015. Retrieved May 26, 2015.