NIRD Data Peak vs NIRD Data Lake
NIRD consists of two distinct storage systems, namely NIRD Data Peak (known internally as TS) and NIRD Data Lake (codenamed DL).
NIRD Data Peak has several tiers spanned by single filesystem and designed for performance and used mainly for active project data.
NIRD Data Lake has a flat structure, designed mainly for less active data, sharing data across multiple projects, and interfacing with external storages.
Both are based on IBM Elastic Storage System.
Architecture comparison
NIRD Data Peak |
NIRD Data Lake |
|
---|---|---|
Tiers |
Performance and capacity tiers |
Flat architecture (no tiers) |
Designed for |
- high-performance storage for any type of active research data |
- long-term storage of non-persistent data |
Data integrity secured by |
- erasure coding |
- erasure coding |
Functionality comparison
NIRD Data Peak |
NIRD Data Lake |
|
---|---|---|
Protocols |
POSIX, GPFS and NFS |
POSIX, GPFS and S3[2] |
APIs |
GPFS, Discover REST API[3] |
GPFS, S3, Discover REST API[3] |
Possibilities for |
- file access logs |
- file access logs |
Access controls |
- ACLs |
- ACLs |
On-demand backup |
Yes |
No |
Filesystems
NIRD Data Peak
NIRD DL
–