decentralized deduplication in san cluster file systems
play

Decentralized Deduplication in SAN Cluster File Systems Austin T. - PowerPoint PPT Presentation

Decentralized Deduplication in SAN Cluster File Systems Austin T. Clements Irfan Ahmad Murali Vilayannur Jinyuan Li VMware, Inc. MIT CSAIL Decentralized Deduplication in SAN Cluster File Systems Storage Area Networks Decentralized


  1. The Index 0e7a26.. Unique 15ba2b.. Shared 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Map from hashes to block locators, list sorted by hash Unique blocks are located in files and remain mutable Decentralized Deduplication in SAN Cluster File Systems

  2. The Index 0e7a26.. Unique 15ba2b.. Shared 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Map from hashes to block locators, list sorted by hash Unique blocks are located in files and remain mutable Decentralized Deduplication in SAN Cluster File Systems

  3. The Index 0e7a26.. Unique 15ba2b.. Shared 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Map from hashes to block locators, list sorted by hash Unique blocks are located in files and remain mutable A virtual arena stores COW references to all shared blocks Decentralized Deduplication in SAN Cluster File Systems

  4. The Index 0e7a26.. Unique 15ba2b.. Shared 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Map from hashes to block locators, list sorted by hash Unique blocks are located in files and remain mutable A virtual arena stores COW references to all shared blocks Decentralized Deduplication in SAN Cluster File Systems

  5. The Index 0e7a26.. Unique 15ba2b.. Shared 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Map from hashes to block locators, list sorted by hash Unique blocks are located in files and remain mutable A virtual arena stores COW references to all shared blocks Decentralized Deduplication in SAN Cluster File Systems

  6. Indexing and Duplicate Elimination 0e7a26.. Unique 15ba2b.. Shared 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  7. Indexing and Duplicate Elimination 0e7a26.. Unique 15ba2b.. Shared 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  8. Indexing and Duplicate Elimination 0e7a26.. Unique 15ba2b.. Shared 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  9. Indexing and Duplicate Elimination 6cd412.. 12067c.. c277d6.. 0e7a26.. Unique 15ba2b.. Shared 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  10. Indexing and Duplicate Elimination 12067c.. 0e7a26.. Unique 6cd412.. 15ba2b.. Shared c277d6.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  11. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. 15ba2b.. Shared 6cd412.. 6cd412.. c277d6.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  12. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 6cd412.. 15ba2b.. c277d6.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  13. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 6cd412.. 15ba2b.. c277d6.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  14. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 6cd412.. 15ba2b.. c277d6.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  15. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 15ba2b.. 6cd412.. 6cd412.. c277d6.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  16. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 15ba2b.. 6cd412.. c277d6.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  17. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 15ba2b.. 6cd412.. c277d6.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  18. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 15ba2b.. 6cd412.. c277d6.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  19. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 15ba2b.. 6cd412.. c277d6.. ab7373.. bc6887.. c277d6.. d5e341.. ? f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  20. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 15ba2b.. 6cd412.. c277d6.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  21. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  22. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  23. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  24. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  25. Indexing and Duplicate Elimination 0e7a26.. Unique 12067c.. Shared 15ba2b.. 6cd412.. ab7373.. bc6887.. c277d6.. d5e341.. f2a4d2.. f4bea9.. Decentralized Deduplication in SAN Cluster File Systems

  26. Phew. Decentralized Deduplication in SAN Cluster File Systems

  27. Evaluation Decentralized Deduplication in SAN Cluster File Systems

  28. Evaluation How much space does DeDe save? Decentralized Deduplication in SAN Cluster File Systems

  29. Evaluation How much space does DeDe save? How much overhead does DeDe introduce? Decentralized Deduplication in SAN Cluster File Systems

  30. Evaluation How much space does DeDe save? How much overhead does DeDe introduce? How fast can DeDe deduplicate? Decentralized Deduplication in SAN Cluster File Systems

  31. Space Savings: VDI Cluster Corporate Virtual Desktop Infrastructure cluster Desktop XP VM’s 6–12 months of active use ... Originally cloned from small number of base images Decentralized Deduplication in SAN Cluster File Systems

  32. Space Savings: VDI Cluster Corporate Virtual Desktop 113 VM’s Infrastructure cluster                            Desktop XP VM’s 6–12 months of active use ... Originally cloned from small number of base images   1.9 TB  Decentralized Deduplication in SAN Cluster File Systems

  33. Space Savings: VDI Cluster Corporate Virtual Desktop 113 VM’s Infrastructure cluster                            Desktop XP VM’s 6–12 months of active use ... Originally cloned from small number of base images   1.9 TB  1.3 TB Decentralized Deduplication in SAN Cluster File Systems

  34. Space Savings: VDI Cluster 1.3 TB Decentralized Deduplication in SAN Cluster File Systems

  35. Space Savings: VDI Cluster 237 GB 1.3 TB Decentralized Deduplication in SAN Cluster File Systems

  36. Space Savings: VDI Cluster 237 GB 1.3 TB 173 GB 61 GB unique shared Decentralized Deduplication in SAN Cluster File Systems

  37. Space Savings: VDI Cluster 237 GB 1.3 TB 173 GB 61 GB 2.7 GB unique shared Decentralized Deduplication in SAN Cluster File Systems

  38. Space Savings: VDI Cluster 237 GB 1.3 TB 173 GB 61 GB 2.7 GB unique shared 1.3 GB 194 MB 1.1 GB index file v. arena FS metadata Decentralized Deduplication in SAN Cluster File Systems

  39. Runtime Effects Write monitoring Disk array caching Decentralized Deduplication in SAN Cluster File Systems

  40. Runtime Effects Write monitoring Disk array caching Decentralized Deduplication in SAN Cluster File Systems

  41. Runtime Effects Write monitoring Disk array caching EMC CLARiiON CX3-40 Decentralized Deduplication in SAN Cluster File Systems

  42. Runtime Overhead: Write Monitoring Worst-case benchmark: 100% sequential write IO, No computation Decentralized Deduplication in SAN Cluster File Systems

  43. Runtime Overhead: Write Monitoring Worst-case benchmark: 100% sequential write IO, No computation Baseline Write Monitor CPU 33% 220% Decentralized Deduplication in SAN Cluster File Systems

Recommend


More recommend