Table of contents
TOC
Collapse the table of content
Expand the table of content

Resilient File System (ReFS) overview

Garrett Watumull|Last Updated: 2/17/2017
|
1 Contributor

Applies To: Windows Server 2016, Windows Server 2012 R2, Windows Server 2012

The Resilient File System (ReFS) is Microsoft's newest file system, designed to maximize data availability, scale efficiently to large data sets across diverse workloads, and provide data integrity by means of resiliency to corruption. It seeks to address an expanding set of storage scenarios and establish a foundation for future innovations.

Key benefits

Resiliency

ReFS introduces new features that can precisely detect corruptions and also fix those corruptions while remaining online, helping provide increased integrity and availability for your data:

  • Integrity-streams - ReFS uses checksums for metadata and optionally for file data, giving ReFS the ability to reliably detect corruptions.
  • Storage Spaces integration - When used in conjunction with a mirror or parity space, ReFS can automatically repair detected corruptions using the alternate copy of the data provided by Storage Spaces. Repair processes are both localized to the area of corruption and performed online, requiring no volume downtime.
  • Salvaging data - If a volume becomes corrupted and an alternate copy of the corrupted data doesn't exist, ReFS removes the corrupt data from the namespace. ReFS keeps the volume online while it handles most non-correctable corruptions, but there are rare cases that require ReFS to take the volume offline.
  • Proactive error correction - In addition to validating data before reads and writes, ReFS introduces a data integrity scanner, known as a scrubber. This scrubber periodically scans the volume, identifying latent corruptions and proactively triggering a repair of corrupt data.

Performance

In addition to providing resiliency improvements, ReFS introduces new features for performance-sensitive and virtualized workloads. Real-time tier optimization, block cloning, and sparse VDL are good examples of the evolving capabilities of ReFS, which are designed to support dynamic and diverse workloads:

  • Real-time tier optimization - Real-time tier optimization delivers both high performance and also capacity efficient storage for your data. To do so, ReFS divides a volume into two logical storage groups, known as tiers. These tiers can have their own drive and resiliency types, allowing each tier to optimize for either performance or capacity. Some example configurations include:

    Performance tierCapacity tier
    Mirrored SSDMirrored HDD
    Mirrored SSDParity SSD
    Mirrored SSDParity HDD
    • Once these tiers are configured, ReFS use them to deliver fast storage for hot data and capacity-efficient storage for cold data:
      • All writes will occur in the performance tier, and large chunks of data that remain in the performance tier will be efficiently moved to the capacity tier in real-time.
      • If using a hybrid deployment (mixing flash and HDD drives), the cache in Storage Spaces Direct will help accelerate reads, reducing the effect of data fragmentation characteristic of virtualized workloads. Otherwise, if using an all-flash deployment, reads will also occur in the performance tier.
  • Accelerated VM operations - ReFS introduces new functionality specifically targeted to improve the performance of virtualized workloads:

    • Block cloning - Block cloning accelerates copy operations, enabling quick, low-impact VM checkpoint merge operations.
    • Sparse VDL - Sparse VDL allows ReFS to zero files rapidly, reducing the time needed to create fixed VHDs from 10s of minutes to mere seconds.
  • Variable cluster sizes - ReFS supports both 4K and 64K cluster sizes. 4K is the recommended cluster size for most deployments, but 64K clusters are appropriate for large, sequential IO workloads.

Scalability

ReFS is designed to support extremely large data sets--millions of terabytes--without negatively impacting performance, achieving greater scale than prior file systems.

Supported deployments

Storage Spaces Direct

Deploying ReFS on Storage Spaces Direct is the recommended setup for virtualized workloads:

  • Real-time tier optimization and the cache in Storage Spaces Direct deliver high performance and capacity-efficient storage.
  • The introduction of block clone and sparse VDL dramatically accelerates .vhdx file operations, such as creation, merge, and expansion.
  • Built-in checksums, online repair, and alternate data copies enable ReFS and Storage Spaces Direct to jointly to detect and correct corruptions within both metadata and data.
  • ReFS provides the functionality to efficiently scale and support massive data sets.

Storage Spaces with SAS drive enclosures

Deploying ReFS on Storage Spaces with shared SAS enclosures is suitable for hosting archival data and storing user documents:

  • Built-in checksums, online repair, and alternate data copies enable ReFS and Classic Storage Spaces to jointly to detect and correct corruptions within both metadata and data.
  • ReFS provides the functionality to efficiently scale and support massive data sets.
  • Data will be stored on a non-tiered volume.

Basic disks

Deploying ReFS on basic disks is suitable for applications that provide their own resiliency and availibility solutions.

  • For example, Exchange and SQL Server AlwaysOn both introduce their own concepts of resiliency and availability suited for their specific workloads, and they can leverage the scalability offered in ReFS.

Feature comparison

Limits

FeatureReFSNTFS
Maximum file name length255 Unicode characters255 Unicode characters
Maximum path name length32K Unicode characters32K Unicode characters
Maximum file size18 EB (exabytes)18 EB (exabytes)
Maximum volume size4.7 ZB (zettabytes)256 TB

Functionality

The following features are available on ReFS and NTFS:

FunctionalityReFSNTFS
BitLocker encryptionYesYes
Cluster Shared Volume (CSV) supportYesYes
Soft linksYesYes
Failover cluster supportYesYes
Access-control listsYesYes
USN journalYesYes
Changes notificationsYesYes
Junction pointsYesYes
Mount pointsYesYes
Reparse pointsYesYes
Volume snapshotsYesYes
File IDsYesYes
OplocksYesYes
Sparse filesYesYes
Named streamsYesYes

The following features are only available on ReFS:

FunctionalityReFSNTFS
Block cloneYesNo
Sparse VDLYesNo
Real-time tier optimizationYes (on Storage Spaces Direct)No

The following features are unavailable on ReFS at this time:

FunctionalityReFSNTFS
File system compressionNoYes
File system encryptionNoYes
Data DeduplicationNoYes
TransactionsNoYes
Hard linksNoYes
Object IDsNoYes
Short namesNoYes
Extended attributesNoYes
Disk quotasNoYes
BootableNoYes
Supported on removable mediaNoYes
NTFS storage tiersNoYes

See also

© 2017 Microsoft