NIRD Data Peak vs NIRD Data Lake

NIRD consists of two distinct storage systems, namely NIRD Data Peak (known internally as TS) and NIRD Data Lake (codenamed DL).

NIRD Data Peak has several tiers spanned by single filesystem and designed for performance and used mainly for active project data.

NIRD Data Lake has a flat structure, designed mainly for less active data, sharing data across multiple projects, and interfacing with external storages.

Both are based on IBM Elastic Storage System.

Architecture comparison

NIRD Data Peak

NIRD Data Lake

Tiers

Performance and capacity tiers
Automatic, transparent tiering
Dedicated pools for metadata

Flat architecture (no tiers)

Designed for

- high-performance storage for any type of active research data
- high-performance storage for I/O intensive computing such as HPC and AI workloads
- storage of valuable research data requiring a secondary copy
- storage for structured data of any volume larger than 1 TiB

- long-term storage of non-persistent data
- storage for any type of inactive/cold data
- storage for structured or unstructured data of any volume larger than 1 TiB
- sharing datasets and libraries for collaboration across projects and institutions in the sector
- interfacing with sensors or third party storage system
- object storage

Data integrity secured by

- erasure coding
- snapshots
- backup[1]

- erasure coding
- snapshots

Functionality comparison

NIRD Data Peak

NIRD Data Lake

Protocols

POSIX, GPFS and NFS

POSIX, GPFS and S3[2]

APIs

GPFS, Discover REST API[3]

GPFS, S3, Discover REST API[3]

Possibilities for

- file access logs
-data insight: metadata harvesting[3]

- file access logs
- data insight: metadata harvesting[3]
- encrypted projects

Access controls

- ACLs
- extended attributes

- ACLs
- extended attributes
- RBAC via S3[2]

On-demand backup

Yes

No

Filesystems

NIRD Data Peak

  • Project storage /nird/datapeak

  • User’s home /nird/home

  • Scratch storage /nird/scratch[4]

  • Archive /archive[5]

NIRD DL

  • Project storage /nird/datalake

  • Backup /backup[5]

  • Archive /archive[5]