peculiarities of large dimensions and some repercussions
play

PECULIARITIES OF LARGE DIMENSIONS and some repercussions Anatoly - PowerPoint PPT Presentation

PECULIARITIES OF LARGE DIMENSIONS and some repercussions Anatoly Zhigljavsky Cardiff University, MATHS Cardiff, November 7, 2018 Plan I. Large dimensions II. Applications to global optimization III. Other repercussions IV. Conclusions


  1. PECULIARITIES OF LARGE DIMENSIONS and some repercussions Anatoly Zhigljavsky Cardiff University, MATHS Cardiff, November 7, 2018

  2. Plan I. Large dimensions II. Applications to global optimization III. Other repercussions IV. Conclusions

  3. Chapter I. Large dimensions where we learn that our intuition usually deceives us

  4. Chapter I. Large dimensions where we learn that our intuition often deceives us

  5. Dimension R d Small dimension: d = 1 , 2 , 3 Medium dimension: d = 10 , 20 (MANY) Large dimension: d = 100 (REALLY MANY)

  6. Volume of the d -dimensional unit ball B (0 , 1) = { x ∈ R d : � x � ≤ 1 } π d / 2 V d = vol ( B (0 , 1)) = Γ( d / 2 + 1)

  7. Volume of the d -dimensional unit ball log 10 V d as a function of d : F.e., V 100 ≃ 2 . 368 · 10 − 40

  8. d -dimensional ball Almost all the volume is near the equator: Th. For any c > 0 , the fraction of the volume of the unit ball above the √ d − 1 is less than 2 c exp {− c 2 / 2 } . plane x 1 = c /

  9. d -dimensional ball Almost all the volume is also there (in B (0 , 1) \ B (0 , 1 − ǫ ) with ǫ = c / d ): Indeed, vol( B (0 , 1 − ǫ ))/vol( B (0 , 1)) = (1 − ǫ ) d ≃ 0 for ǫ = c / d , large d and c fixed but large enough. Radius of a uniform random point has density p d ( r ) = dr d − 1 , 0 ≤ r ≤ 1 .

  10. Random points in a 100-d ball; projection to 2 dimensions B (0 , 1) = { x ∈ R d : x 2 1 + x 2 2 + . . . + x 2 d ≤ 1 }

  11. Random points in a 100-d ball; projection to 2 dimensions B (0 , 1) = { x ∈ R d : x 2 1 + x 2 2 + . . . + x 2 d ≤ 1 }

  12. d -dimensional cube and ball Unit cube: { x = ( x 1 , . . . , x d ) ∈ R d : | x i | ≤ 1 / 2 } Unit ball: B (0 , 1) = { x ∈ R d : � x � ≤ 1 } Length of the cube’s half-diagonal: √ �� 1 � 2 � 2 � 2 � 1 � 1 d + + . . . + = 2 2 2 2

  13. d -dimensional cube

  14. Shape of the d -dimensional cube [ − 1 2 , 1 [ − 1 2 , 1 [ − 1 2 , 1 2 ] 2 2 ] 3 2 ] 8

  15. Volume of the largest ball inscribed into the unit cube π d / 2 Volume of the cube =1, v d = d / 2) (volume of the ball of radius 1 / 2) 2 d Γ(1 + v 2 = π v 3 = π 4 ≃ 0 . 78 , 6 ≃ 0 . 52 v 10 ≃ 0 . 0025 , v 20 ≃ 0 . 25 · 10 − 7 , v 100 ≃ 10 − 70

  16. Volume of the largest ball inscribed into the unit cube π d / 2 Volume of the cube =1, v d = d / 2) (volume of the ball of radius 1 / 2) 2 d Γ(1 + v 2 = π v 3 = π 4 ≃ 0 . 78 , 6 ≃ 0 . 52 , v 10 ≃ 0 . 0025 v 20 ≃ 0 . 25 · 10 − 7 , v 100 ≃ 10 − 70

  17. Volume of the largest ball inscribed into the unit cube π d / 2 Volume of the cube =1, v d = d / 2) (volume of the ball of radius 1 / 2) 2 d Γ(1 + v 2 = π v 3 = π 4 ≃ 0 . 78 , 6 ≃ 0 . 52 , v 10 ≃ 0 . 0025 , v 20 ≃ 0 . 25 · 10 − 7 , v 100 ≃ 10 − 70

  18. small ball in-between large ones, d = 2

  19. small ball in-between large ones, d = 3

  20. ‘small’ ball in-between ‘large’ ones, d ≥ 3 Cube [ − 1 , 1] d ; centers of ‘large’ balls of radius 1 2 are ( ± 1 2 , . . . , 1 2 ). √ Therefore the radius of the ‘small’ ball is r d = 1 2 ( d − 1) . F.e., r 1 = 0, r 2 ≃ 0 . 207, r 3 ≃ 0 . 366, r 4 = 1 2 , r 9 = 1, r 100 = 4 . 5

  21. ‘small’ ball in-between ‘large’ ones, d ≥ 3 Cube [ − 1 , 1] d ; centers of ‘large’ balls of radius 1 2 are ( ± 1 2 , . . . , ± 1 2 ). √ Therefore the radius of the ‘small’ ball is r d = 1 2 ( d − 1) . F.e., r 1 = 0, r 2 ≃ 0 . 207, r 3 ≃ 0 . 366, r 4 = 1 2 , r 9 = 1, r 100 = 4 . 5

  22. ‘small’ ball in-between ‘large’ ones, d ≥ 3 Cube [ − 1 , 1] d ; centers of ‘large’ balls of radius 1 2 are ( ± 1 2 , . . . , ± 1 2 ). √ Therefore the radius of the ‘small’ ball is r d = 1 2 ( d − 1) . F.e., r 1 = 0, r 2 ≃ 0 . 207, r 3 ≃ 0 . 366, r 4 = 1 2 , r 9 = 1, r 100 = 4 . 5

  23. ‘small’ ball in-between ‘large’ ones, d ≥ 3 Cube [ − 1 , 1] d ; centers of ‘large’ balls of radius 1 2 are ( ± 1 2 , . . . , ± 1 2 ). √ Therefore the radius of the ‘small’ ball is r d = 1 2 ( d − 1) . F.e., r 1 = 0, r 2 ≃ 0 . 207, r 3 ≃ 0 . 366, r 4 = 1 2 , r 9 = 1, r 100 = 4 . 5

  24. ‘small’ ball in-between ‘large’ ones, d ≥ 3 Cube [ − 1 , 1] d ; centers of ‘large’ balls of radius 1 2 are ( ± 1 2 , . . . , ± 1 2 ). √ Therefore the radius of the ‘small’ ball is r d = 1 2 ( d − 1) . For d > 1205, the volume of the ‘small’ ball is larger than 2 d !

  25. Covering of the space (Conway & Sloan) Θ d (thickness) = average number of balls that contain a random point. Some values of this thickness are: Θ 2 ≃ 1 . 2092, Θ 3 ≃ 1 . 4635, Θ 10 ≃ 5 . 2517, Θ 20 ≃ 31 . 14.

  26. Packing (Conway & Sloan) ∆ d (density) = proportion of the space occupied by the balls. Some values of this density are: ∆ 2 ≃ 0 . 906, ∆ 3 ≃ 0 . 74, ∆ 10 ≃ 0 . 099, ∆ 20 ≃ 0 . 0032

  27. Covering and packing, d = 100 Θ d (thickness of covering) = average number of balls that contain a random point. ∆ d (packing density) = proportion of the space occupied by the balls. Θ 2 ≃ 1 . 2092, Θ 3 ≃ 1 . 4635, Θ 10 ≃ 5 . 2517, Θ 20 ≃ 31 . 14, Θ 100 ≃ ? ∆ 2 ≃ 0 . 906, ∆ 3 ≃ 0 . 74, ∆ 10 ≃ 0 . 099, ∆ 20 ≃ 0 . 0032, ∆ 100 ≃ ?

  28. Packing and covering, d = 100 Θ d (thickness of covering) = average number of balls that contain a random point. ∆ d (packing density) = proportion of the space occupied by the balls. Θ 2 ≃ 1 . 2092, Θ 3 ≃ 1 . 4635, Θ 10 ≃ 5 . 2517, Θ 20 ≃ 31 . 14 ∆ 2 ≃ 0 . 906, ∆ 3 ≃ 0 . 74, ∆ 10 ≃ 0 . 099, ∆ 20 ≃ 0 . 0032, ∆ 100 ≃ ? Θ 100 ≃ 4 . 28 · 10 7 (an average point is covered more than 40 million times!)

  29. Packing and covering, d = 100 Θ d (thickness of covering) = average number of balls that contain a random point. ∆ d (packing density) = proportion of the space occupied by the balls. Θ 2 ≃ 1 . 2092, Θ 3 ≃ 1 . 4635, Θ 10 ≃ 5 . 2517, Θ 20 ≃ 31 . 14 ∆ 2 ≃ 0 . 906, ∆ 3 ≃ 0 . 74, ∆ 10 ≃ 0 . 099, ∆ 20 ≃ 0 . 0032 Θ 100 ≃ 4 . 28 · 10 7 (an average point is covered more than 40 million times!) ∆ 100 < 10 − 26 (less than 0.000000000000000000000001% of the space is occupied by the balls!)

  30. Uniform random points on a square

  31. Uniform points in a cube are at almost the same distance from each other The distribution of the distances � d � � � ( x i − y i ) 2 � x − y � = � i =1 � is concentrated around its expected value which is approximately d / 6. Similar results hold for the unit ball and for the distributions different from the uniform.

  32. Gaussian distribution (density function)

  33. Gaussian random vectors If x is Gaussian N ( 0 , I d ) then the distance from the origin � d � � � x 2 r = � i i =1 √ is very close to d . √ More precisely, for any 0 < β < d , √ √ d + β } ≥ 1 − 3 β 2 / 64 Pr { d − β ≤ r ≤ Two i.i.d. Gaussian vectors are almost orthogonal to each other. Similar for uniform r.v. in a ball and in a cube.

  34. Random projections Johnson-Lindenstrauss Lemma. For any 0 < ε < 1 and any integer n , let k ≥ c ε 2 log n for some c > 0. For any set of n points in R d , the random projection f : R d → R k has the property that for all pairs of points v i and v j , with probability at least 1– 3 2 n , (1 − ε ) � v i − v j � ≤ � f ( v i ) − f ( v j ) � ≤ (1 − ε ) � v i − v j �

  35. Chapter II. Applications to global optimization where we do not see many reasons for optimism

  36. Chapter II. Applications to global optimization where we do not see many reasons for optimism

  37. Global optimization f ( x ) → min x ∈ A ; x ∗ = arg min x ∈ A f ( x )

  38. Random points in a ball; projection to 2 dimensions

  39. How far are the points from the boundary? d ∈ [5 , 200] Figure: The difference y 1 , n − f ∗ for n = 10 6 (solid) and n = 10 10 (dashed), where y 1 , n is the record of evaluations of the function f ( x ) = e T 1 x at points x 1 , . . . , x n with uniform distribution in the unit ball in the dimension d as d varies in [5 , 200].

  40. Are quasi-random points any better? y 1 , n y 4 , n Figure: Boxplots of y 1 , n and y 4 , n for 500 runs with points generated from the Sobol low-dispersion sequence (left) and the uniform distribution (right), d = 20.

  41. Rate of convergence of the simple random search The number of points n γ required to hit a ball or radius ε centered at the minimizer, with probability ≥ 1 − γ , for different dimensions d : d γ = 0 . 1 γ = 0 . 05 ε = 0 . 5 ε = 0 . 2 ε = 0 . 1 ε = 0 . 5 ε = 0 . 2 ε = 0 . 1 1 0 5 11 0 6 14 2 2 18 73 2 23 94 3 4 68 549 5 88 714 5 13 1366 43743 17 1788 56911 8.8 · 10 6 9.0 · 10 9 1.1 · 10 7 1.2 · 10 10 10 924 1202 9.4 · 10 7 8.5 · 10 15 8.9 · 10 21 1.2 · 10 8 1.1 · 10 16 1.2 · 10 22 20 1.5 · 10 28 1.2 · 10 48 1.3 · 10 63 1.9 · 10 28 1.5 · 10 48 1.7 · 10 63 50 1.2 · 10 70 7.7 · 10 109 9.7 · 10 139 1.6 · 10 70 1.0 · 10 110 1.3 · 10 140 100 n γ is roughly ε − d / V d (multiplied by − ln γ ); recall V 100 ≃ 10 − 40 .

Recommend


More recommend