High Performance Storage System

High Performance Storage System (HPSS) is a flexible, scalable, policy-based Hierarchical Storage Management product developed by the HPSS Collaboration. It provides scalable hierarchical storage management (HSM), archive, and file system services using cluster, LAN and SAN technologies to aggregate the capacity and performance of many computers, disks, disk systems, tape drives and tape libraries.[1]

High Performance Storage System
Developer(s)HPSS Collaboration (IBM, LANL, LBNL, LLNL, ORNL, SNL)
Stable release
8.3 / March 2020
Operating systemLinux
TypeHierarchical Storage Management
LicenseProprietary
Websitehpss-collaboration

Architecture

HPSS supports a variety of methods for accessing and creating data. Among them are support for FTP, parallel FTP, FUSE (Linux), as well as a robust client API with support for parallel I/O.

As of version 7.5, HPSS has full support on Linux. The HPSS client API is supported on AIX, Linux, and Solaris.[1]

The implementation is built around IBM's Db2, a scalable relational database management system.

The HPSS Collaboration

The collaboration which produced HPSS began in the fall of 1992, and involved IBM's Houston Global Services and five United States Department of Energy (DOE) National Laboratories (Lawrence Berkeley, Lawrence Livermore, Los Alamos, Oak Ridge, and Sandia).[1] At that time, the DOE national laboratory and IBM HPSS design team recognized there would be a data storage explosion driven by computing power rising to teraops/petaops requiring data stored in HSMs to rise to petabytes and beyond, data transfer rates with the HSM to rise to gigabytes/s and higher, and daily throughput with a HSM in 10s of terabytes/day. Therefore, the collaboration set out to design and deploy a system that would scale by a factor of 1,000 or more and evolve from the base above toward these expected targets and beyond.[2]

The HPSS collaboration is based on the premise that no single organization has the experience and resources to meet all the challenges represented by the growing imbalance between computing power and data collection capabilities, and storage system I/O, capacity, and functionality. Over twenty organizations worldwide including industry, US Department of Energy (DOE), other federal laboratories, universities, National Science Foundation (NSF) supercomputer centers and French Commissariat a l'Energie Atomique (CEA) have contributed to various aspects of this effort.

As of 2014, the primary HPSS development team consists of:

Notable achievements

gollark: "Interesting" and highly cursed: Google appear to have implemented some sort of horrible BASIC-y language encoded in YAML for "cloud workflows": https://cloud.google.com/workflows/docs/reference/syntax
gollark: I don't really know about the details at all, but I think the way it works is that when you observe one end, it collapses into one of two random states, and the other one collapses into the other. Or something vaguely like that.
gollark: It doesn't allow FTL communications.
gollark: Faster than light communication would break causality though, which is bad.
gollark: There's no real way to know if it could be made since there aren't really very detailed theories of operation for them.

References

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.