This document explains the infrastructural changes required in GlusterFS to support various data compliance
feature. Data management is a generic term that includes filesystem data handling and management activities such as locality aware data placement
, data tiering
, BitRot detection
and the likes. Operational mechanism of these features are more or less similar w.r.t. the input operation set being worked on. Additionally and more importantly order of operations (or traces) tend to be much more relaxed in nature unlike replication which relies of strict ordering
of operation for correctness.
This document is split into two parts. The first part elaborates on the infrastructure design required for the correct functioning of various data classification mechanisms. Requirements for each sub-feature is presented briefly and correctness is proven as part of the design. Thereafter, the nature of changes for each component is listed and links to ap