NIRD
National Infrastructure for Research Data
NIRD is the National e-Infrastructure for Research Data. It is owned by Sigma2 and operated by NRIS.
Note
NIRD offers storage services for high-performance data analytics, unified file and object storage, archiving services, cloud services, and processing capabilities on the stored data. It offers services and capacities to any scientific discipline that requires access to advanced, large scale, or high-end resources for storing, processing, publishing research data or searching digital databases and collections.
NIRD is a high-performance storage system, capable of supporting AI and analytics workloads, offering simultaneous multi-protocol access to the same data.
The next generation NIRD storage system is installed in Lefdal Mine Datacenter. The new NIRD is redesigned for the evolving needs of Norwegian researchers and has been procured through the NIRD2020 project.
NIRD provides storage resources with yearly capacity upgrades, data security through backup services and adaptable application services, multiple storage protocol support, migration to third-party cloud providers and much more.
Alongside the national high-performance computing resources, NIRD forms the backbone of the national e-infrastructure for research and education in Norway, connecting data and computing resources for efficient provisioning of services.
Technical Specifications
Hardware
NIRD consists of two separate storage systems, namely NIRD Data Peak (known internally as TS) and NIRD Data Lake (codenamed DL), each tailored to optimally address two different categories of use cases. Commencing with the 2024.1 allocation, the array of functionalities provided by the TS and the DL resources are consolidated and presented as two distinct services. The total capacity of NIRD is 49 PB (24 PB on TS and 25 PB on DL).
NIRD Data Peak has several tiers spanned by single filesystem and designed for performance and used mainly for active project data.
NIRD Data Lake has a flat structure, designed mainly for less active data. The Data Lake provides a unified access, i.e., file- and object storage for sharing data across multiple projects, and interfacing with external storages.
NIRD is based on IBM Elastic Storage System, built using ESS3200, ESS3500 and ESS5000 building blocks. I/O performance is ensured with IBM POWER servers for I/O operations, having dedicated data movers, protocol nodes and more.
NIRD |
||
---|---|---|
System |
Building blocks |
IBM ESS3200 |
Clusters |
Two physically separated clusters |
NIRD TS |
Storage media |
NIRD Data Peak |
NVMe SSD & NL-SAS |
Capacity |
Total capacity: 49 PB |
NIRD Data Peak: 24 PB |
Performance |
Aggregated I/O throughput |
NIRD Data Peak: 209 GB/s |
Interconnect |
100 Gbit/s Ethernet |
NIRD Data Peak: balanced 400 Gbit/s |
Protocol nodes |
NFS |
4 x 200 Gbit/s |
Software
IBM Storage Scale (GPFS) is deployed on NIRD, providing a software-defined high-performance file- and object storage for AI and data intensive workloads.
Insight into data is ensured by IBM Storage Discover.
Backup services and data integrity is ensured with IBM Storage Protect.