Large datasets often have redundant data. For example, with user file shares, multiple users tend to have files that are similar or identical. As another example, with software development shares, most binaries remain largely unchanged from build to build. Data Deduplication is a feature in Windows Server that reduces costs that are associated with redundant data by storing duplicated portions of files only once. It optimizes files transparently such that users and applications accessing the data are unaware of deduplication. Storage administrators can start saving on storage costs for their Amazon FSx file systems by turning on Data Deduplication with a single command. Typical savings are 50-60% for general-purpose file shares, 30-50% for user documents, and 70-80% for software development data sets.

