new bound for batch codes with restricted query size
play

New Bound for Batch Codes with Restricted Query Size Vitaly Skachek - PowerPoint PPT Presentation

New Bound for Batch Codes with Restricted Query Size Vitaly Skachek Joint work with Hui Zhang Estonian CS Theory Days 29 January 2016 Supported by the research grants PUT405 and IUT2-1 from the Estonian Research Council and by the COST Action


  1. New Bound for Batch Codes with Restricted Query Size Vitaly Skachek Joint work with Hui Zhang Estonian CS Theory Days 29 January 2016 Supported by the research grants PUT405 and IUT2-1 from the Estonian Research Council and by the COST Action IC1104 on random network coding and designs over F q . V. Skachek Bounds for batch codes

  2. Distributed storage systems Enormous amounts of data are stored in a large number of servers. Occasionally servers fail. Failed server is replaced and the data has to be copied to the new server. V. Skachek Bounds for batch codes

  3. Distributed storage systems Enormous amounts of data are stored in a large number of servers. Occasionally servers fail. Failed server is replaced and the data has to be copied to the new server. V. Skachek Bounds for batch codes

  4. Distributed storage systems Enormous amounts of data are stored in a huge number of servers. Occasionally servers fail. Failed server is replaced and the data has to be copied to the new server. V. Skachek Bounds for batch codes

  5. Distributed storage systems Enormous amounts of data are stored in a huge number of servers. Occasionally servers fail. Failed server is replaced and the data has to be copied to the new server. V. Skachek Bounds for batch codes

  6. Distributed storage systems Enormous amounts of data are stored in a huge number of servers. Occasionally servers fail. Failed server is replaced and the data has to be copied to the new server. V. Skachek Bounds for batch codes

  7. Locally repairable codes Consideration: minimize amount of transferred data. Proposed in [Dimakis, Godfrey, Wu, Wainwright, Ramchandran 2008]. V. Skachek Bounds for batch codes

  8. Locally repairable codes Consideration: minimize amount of transferred data. Proposed in [Dimakis, Godfrey, Wu, Wainwright, Ramchandran 2008]. Erasure-correcting codes! Additional property: erasures can be recovered by using a small number of other symbols (locality). V. Skachek Bounds for batch codes

  9. Locally repairable codes Consideration: minimize amount of transferred data. Proposed in [Dimakis, Godfrey, Wu, Wainwright, Ramchandran 2008]. Erasure-correcting codes! Additional property: erasures can be recovered by using a small number of other symbols (locality). 1 0 0 1 0 0 1 0 1 V. Skachek Bounds for batch codes

  10. Locally repairable codes Consideration: minimize amount of transferred data. Proposed in [Dimakis, Godfrey, Wu, Wainwright, Ramchandran 2008]. Erasure-correcting codes! Additional property: erasures can be recovered by using a small number of other symbols (locality). ? 1 0 0 1 0 1 0 1 V. Skachek Bounds for batch codes

  11. Locally repairable codes Consideration: minimize amount of transferred data. Proposed in [Dimakis, Godfrey, Wu, Wainwright, Ramchandran 2008]. Erasure-correcting codes! Additional property: erasures can be recovered by using a small number of other symbols (locality). ? 0 1 0 1 0 1 1 0 V. Skachek Bounds for batch codes

  12. Batch codes Proposed in [Ishai, Kushilevitz, Ostrovsky, Sahai 2004]. Can be used in: Load balancing. Private information retrieval. Distributed storage systems. V. Skachek Bounds for batch codes

  13. Batch codes Proposed in [Ishai, Kushilevitz, Ostrovsky, Sahai 2004]. Can be used in: Load balancing. Private information retrieval. Distributed storage systems. Constructions: [Ishai et al. 2004]: algebraic, expander graphs, subsets, RM codes, locally-decodable codes V. Skachek Bounds for batch codes

  14. Prior art Design-based constructions and bounds: [Stinson, Wei, Paterson 2009] [Brualdi, Kiernan, Meyer, Schroeder 2010] [Bujtas, Tuza 2011] [Bhattacharya, Ruj, Roy 2012] [Silberstein, Gal 2013] V. Skachek Bounds for batch codes

  15. Prior art Design-based constructions and bounds: [Stinson, Wei, Paterson 2009] [Brualdi, Kiernan, Meyer, Schroeder 2010] [Bujtas, Tuza 2011] [Bhattacharya, Ruj, Roy 2012] [Silberstein, Gal 2013] Application to distributed storage: [Rawat, Papailiopoulos, Dimakis, Vishwanath 2014] [Silberstein 2014] V. Skachek Bounds for batch codes

  16. Prior art Design-based constructions and bounds: [Stinson, Wei, Paterson 2009] [Brualdi, Kiernan, Meyer, Schroeder 2010] [Bujtas, Tuza 2011] [Bhattacharya, Ruj, Roy 2012] [Silberstein, Gal 2013] Application to distributed storage: [Rawat, Papailiopoulos, Dimakis, Vishwanath 2014] [Silberstein 2014] Graph-based constructions: [Dimakis, Gal, Rawat, Song 2014] V. Skachek Bounds for batch codes

  17. Batch codes Definition [Ishai et al. 2004] C is an ( k , N , t , n , ν ) Σ batch code over Σ if it encodes any string x = ( x 1 , x 2 , · · · , x k ) ∈ Σ k into n strings (buckets) of total length N over Σ, namely y 1 , y 2 , · · · , y n , such that for each t -tuple (batch) of (not neccessarily distinct) indices i 1 , i 2 , · · · , i t ∈ [ k ], the symbols x i 1 , x i 2 , · · · , x i t can be retrieved by t users, respectively, by reading ≤ ν symbols from each bucket, such that x i ℓ is recovered from the symbols read by the ℓ -th user alone. V. Skachek Bounds for batch codes

  18. Batch codes Definition [Ishai et al. 2004] C is an ( k , N , t , n , ν ) Σ batch code over Σ if it encodes any string x = ( x 1 , x 2 , · · · , x k ) ∈ Σ k into n strings (buckets) of total length N over Σ, namely y 1 , y 2 , · · · , y n , such that for each t -tuple (batch) of (not neccessarily distinct) indices i 1 , i 2 , · · · , i t ∈ [ k ], the symbols x i 1 , x i 2 , · · · , x i t can be retrieved by t users, respectively, by reading ≤ ν symbols from each bucket, such that x i ℓ is recovered from the symbols read by the ℓ -th user alone. Definition If ν = 1, then we use notation ( k , N , t , n ) Σ for it. Only one symbol is read from each bucket. V. Skachek Bounds for batch codes

  19. Batch codes Definition [Ishai et al. 2004] C is an ( k , N , t , n , ν ) Σ batch code over Σ if it encodes any string x = ( x 1 , x 2 , · · · , x k ) ∈ Σ k into n strings (buckets) of total length N over Σ, namely y 1 , y 2 , · · · , y n , such that for each t -tuple (batch) of (not neccessarily distinct) indices i 1 , i 2 , · · · , i t ∈ [ k ], the symbols x i 1 , x i 2 , · · · , x i t can be retrieved by t users, respectively, by reading ≤ ν symbols from each bucket, such that x i ℓ is recovered from the symbols read by the ℓ -th user alone. Definition If ν = 1, then we use notation ( k , N , t , n ) Σ for it. Only one symbol is read from each bucket. Definition An ( k , N , t , n , ν ) q batch code is linear , if every symbol in every bucket is a linear combination of original symbols. V. Skachek Bounds for batch codes

  20. Small buckets In what follows, consider linear codes with ν = 1 and N = n : each encoded bucket contains just one symbol in F q . V. Skachek Bounds for batch codes

  21. Small buckets In what follows, consider linear codes with ν = 1 and N = n : each encoded bucket contains just one symbol in F q . 1 1 0 0 0 0 1 0 1 x x x 2 3 2 V. Skachek Bounds for batch codes

  22. Linear batch codes For simplicity we refer to a linear ( k , N = n , t , n ) q batch code as [ n , k , t ] q batch code. V. Skachek Bounds for batch codes

  23. Linear batch codes For simplicity we refer to a linear ( k , N = n , t , n ) q batch code as [ n , k , t ] q batch code. Let x = ( x 1 , x 2 , · · · , x k ) be an information string. Let y = ( y 1 , y 2 , · · · , y n ) be an encoding of x . Each encoded symbol y i , i ∈ [ n ], is written as y i = � k j =1 g j , i x j . Form the matrix G : � � G = g j , i j ∈ [ k ] , i ∈ [ n ] ; the encoding is y = xG . V. Skachek Bounds for batch codes

  24. Retrieval Theorem Let C be an [ n , k , t ] q batch code. It is possible to retrieve x i 1 , x i 2 , · · · , x i t simultaneously if and only if there exist t non-intersecting sets T 1 , T 2 , · · · , T t of indices of columns in G , and for T r there exists a linear combination of columns of G indexed by that set, which equals to the column vector e T i r , for all r ∈ [ t ]. V. Skachek Bounds for batch codes

  25. Retrieval Theorem Let C be an [ n , k , t ] q batch code. It is possible to retrieve x i 1 , x i 2 , · · · , x i t simultaneously if and only if there exist t non-intersecting sets T 1 , T 2 , · · · , T t of indices of columns in G , and for T r there exists a linear combination of columns of G indexed by that set, which equals to the column vector e T i r , for all r ∈ [ t ]. Example [Ishai et al. 2004] Consider the following linear binary batch code C whose 4 × 9 generator matrix is given by  1 0 1 0 0 0 1 0 1  0 1 1 0 0 0 0 1 1   G =  .   0 0 0 1 0 1 1 0 1  0 0 0 0 1 1 0 1 1 V. Skachek Bounds for batch codes

  26. Retrieval (cont.) Example Let x = ( x 1 , x 2 , x 3 , x 4 ), y = xG . Assume that we want to retrieve the values of ( x 1 , x 1 , x 2 , x 2 ). We can retrieve ( x 1 , x 1 , x 2 , x 2 ) from the following set of equations:  = x 1 y 1   x 1 = y 2 + y 3  . = y 5 + y 8 x 2   x 2 = y 4 + y 6 + y 7 + y 9  It is straightforward to verify that any 4-tuple ( x i 1 , x i 2 , x i 3 , x i 4 ), where i 1 , i 2 , i 3 , i 4 ∈ [4], can be retrieved by using columns indexed by some four non-intersecting sets of indices in [9]. Therefore, the code C is a [9 , 4 , 4] 2 batch code. V. Skachek Bounds for batch codes

Recommend


More recommend