2 Feb 2007 | Alexandros G. Dimakis, P. Brighten Godfrey, Martin J. Wainwright and Kannan Ramchandran
This paper introduces network coding for distributed storage systems to minimize bandwidth usage while maintaining data reliability. The authors propose two new schemes: Optimally Maintained MDS (OMMDS) and Regenerating Codes (RC). OMMDS optimally generates MDS fragments directly from existing fragments, while RC minimizes data transfer by allowing newcomers to store all downloaded data, resulting in slightly larger fragments but significantly lower maintenance bandwidth. Simulation results show that RC can reduce maintenance bandwidth by up to 25% compared to the best previous design, a hybrid of replication and erasure codes, while preserving the symmetry of MDS codes. The paper also presents a general graph-theoretic framework to analyze storage architectures and derive lower bounds on bandwidth requirements. It discusses the trade-offs between different redundancy strategies, showing that RC offers a promising alternative with lower bandwidth overhead and simpler system architecture. The study evaluates the performance of these schemes in realistic environments, demonstrating that RC provides better reliability-bandwidth trade-offs, especially in stable environments. The results challenge previous conclusions that erasure codes offer limited practical benefits due to their added complexity.This paper introduces network coding for distributed storage systems to minimize bandwidth usage while maintaining data reliability. The authors propose two new schemes: Optimally Maintained MDS (OMMDS) and Regenerating Codes (RC). OMMDS optimally generates MDS fragments directly from existing fragments, while RC minimizes data transfer by allowing newcomers to store all downloaded data, resulting in slightly larger fragments but significantly lower maintenance bandwidth. Simulation results show that RC can reduce maintenance bandwidth by up to 25% compared to the best previous design, a hybrid of replication and erasure codes, while preserving the symmetry of MDS codes. The paper also presents a general graph-theoretic framework to analyze storage architectures and derive lower bounds on bandwidth requirements. It discusses the trade-offs between different redundancy strategies, showing that RC offers a promising alternative with lower bandwidth overhead and simpler system architecture. The study evaluates the performance of these schemes in realistic environments, demonstrating that RC provides better reliability-bandwidth trade-offs, especially in stable environments. The results challenge previous conclusions that erasure codes offer limited practical benefits due to their added complexity.