how to estimate a density on a spider web
play

How to estimate a density on a spider web ? Dominique Picard How to - PowerPoint PPT Presentation

How to estimate a density on a spider web ? How to estimate a density on a spider web ? Dominique Picard How to estimate a density on a spider web ? 2020 1 / 37 Spider web from exhibition On Air Tomas Saraceno spider web spider web Spider


  1. How to estimate a density on a spider web ? How to estimate a density on a spider web ? Dominique Picard How to estimate a density on a spider web ? 2020 1 / 37

  2. Spider web from exhibition On Air Tomas Saraceno

  3. spider web

  4. spider web

  5. Spider

  6. Density estimation problem • one observes X 1 , . . . , X n that are i.i.d. random variables defined on a space M with the common density f . • M is equipped with a distance ρ and a measure µ . • our aim is to identify the density

  7. M = spider web We identify M with a graph (simple, undirected, no loops) with T vertices and edges. Adjacency matrix A ij = 1 if there is an edge between i and j , 0, oth- erwise. M is naturally equipped with a distance (geodesic-distance) L − 1 � ρ ( x, y ) = inf A γ ( i ) γ ( i +1) . pathsγ ⊂ M , γ (0)= x, γ ( L )= y i =0

  8. M graph Laplacian we calculate a Laplacian matrix L T × T as: L = D − A, where D is the (diagonal) degree matrix (popularity of each vertex) and A is the adjacency matrix of the graph. � D ii = A i,j j � = i is the degree of the vertex i .

  9. Spectral decomposition of L L has λ 1 ≤ . . . ≤ λ T for eigenvalues and V 1 , . . . , V T for eigenvectors (normed) : V j = ( V j 1 , . . . , V j T ) . Our estimator is : for x point of the graph : n T T n � � � � � � K δ ( x ) := 1 1 λ j ) V j V j ˆ X i V j λ j ) V j Φ( δ x = Φ( δ x X i n n i =1 j =1 j =1 i =1 δ is the bandwidth, Φ is a ’Littlewood-Paley function’ : Φ be a C ∞ ( R ) real-valued function with the following properties: supp (Φ) ⊂ [0 , 1] and Φ( λ ) = 1 for λ ∈ [0 , 1 / 2] .

  10. Theoretical part : Kernel and wavelet density estimators on manifolds and more general metric spaces Theoretical part : Kernel and wavelet density estimators on manifolds and more general metric spaces Issued from a joint work : G. Cleanthous, A. Georgiadis, G. Kerkyacharian, P. Petrushev, D. P. Theoretical part : Kernel and wavelet density estimators on manifolds and more general me 2020 10 / 37

  11. Setting and motivation We assume that ( M , ρ, µ ) is a metric measure space equipped with a distance ρ and a measure µ . X 1 , . . . , X n are i.i.d. random variables on M with a common density function (pdf) f with respect to the measure µ . Our purpose is to estimate the density f . To an estimator ˆ f n of f , we associate its risk: �� � 1 p R n ( ˆ | ˆ f n ( x ) − f ( x ) | p µ ( dx ) f, f, p ) = E f 1 ≤ p < ∞ M as well as its L ∞ risk: R n ( ˆ f, f, ∞ ) = E f � ˆ f n − f � ∞ . We will operate in the following setting. Most of the material can be found in an extended form in the papers [4, 8] Coulhon, Kerkyacharian, Petrushev.

  12. Doubling condition : setting the dimension C1. We assume that the metric space ( M , ρ, µ ) satisfies the so called doubling volume condition : µ ( B ( x, 2 r )) ≤ c 0 µ ( B ( x, r )) for any x ∈ M and r > 0 , (1) where B ( x, r ) := { y ∈ M : ρ ( x, y ) < r } and c 0 > 1 is a constant. The above implies that there exist constants c ′ 0 ≥ 1 and d > 0 such that µ ( B ( x, λr )) ≤ c ′ 0 λ d µ ( B ( x, r )) for any x ∈ M , r > 0 , and λ > 1 , (2) The least d such that (2) holds is the so called homogeneous dimension of ( M , ρ, µ )

  13. We can additionally assume that ( M , ρ, µ ) is a compact measure space with µ ( M ) < ∞ satisfying the following condition: C1A. Ahlfors regular volume condition: There exist constants c 1 , c 2 > 0 and d > 0 such that c 1 r d ≤ µ ( B ( x, r )) ≤ c 2 r d for any x ∈ M and 0 < r ≤ diam ( M ) . The doubling condition is precisely related to the metric entropy : For ǫ > 0 , we define, as usual, the covering number N ( ǫ, M ) as the smallest number of balls of radius ǫ covering M . Lemma Under the condition C1A and if M is compact, there exist constants c ′ > 0 , c ′′ > 0 and ǫ 0 > 0 such that � 1 � d � 1 � d ≤ N ( ǫ, M ) ≤ 2 d 1 , c ′ c ′′ ǫ ǫ for all 0 < ǫ ≤ ǫ 0 .

  14. spider web

  15. Smooth operator : Setting regularity, construct a kernel One rather standard method in density estimation is the kernel estimation method, i.e. considering a family of functions indexed by δ > 0 : K δ : M × M → R an associated kernel density estimator is defined by n � K δ ( x ) := 1 � K δ ( X i , x ) , x ∈ M . (3) n i =1 In R d , an important family is the family of translation kernels δ ) , where G is a function R d → R . When M is a δ ] d G ( x − y K δ ( x, y ) = [ 1 more involved set such as a spider web, a manifold or a set of graphs, of matrices, the simple operations of translation and dilation may not be meaningful . Hence, even finding a family of kernels to start with might be a difficulty.

  16. When dealing with a kernel estimation method, it is standard to consider two quantities : b δ ( f ) := � E f � � ξ f � p := � � K δ − E f � K δ − f � p , K δ � p . The analysis of the first term b δ ( f ) is linked to the approximation properties of the family E f � K δ . which are also often linked to regularity properties of the function f . It is standardly proved (see e.g. [7]), that in R d if K is a translation family with mild properties on K , then polynomial rates of approximation are obtained for functions with Besov regularity .

  17. Regularity spaces Hence, an important issue becomes finding spaces of regularity associated to a possibly complex set M . On a compact metric space ( M, ρ ) one can always define the scale of s -Lipschitz spaces defined by the following norm | f ( x ) − f ( y ) | � f � Lip s := � f � ∞ + sup , 0 < s ≤ 1 . (4) ρ ( x, y ) s x � = y In Euclidian spaces a function can be much more regular than Lipschitz , for instance differentiable at different orders, or belong to some Sobolev or Besov spaces. When M is a set where there is no obvious notion of differentiability , one can make the observation that in R d or Riemannian manifolds, regularity properties can also be expressed via the associated Laplacian . The Laplacian itself is an operator of order 2, but its square root is of order 1 and can be interpreted as a substitute for derivation.

  18. Smooth operator Our main assumption is that the space ( M , ρ, µ ) is complemented by an essentially self-adjoint non-negative operator L on L 2 ( M , µ ) , mapping real-valued to real-valued functions, such that the associated semigroup P t = e − tL consists of integral operators with the (heat) kernel p t ( x, y ) obeying the conditions mentioned in the sequel. � e − tλ k P k ( x, y ) , p t ( x, y ) := k with P k ( x, y ) = � i v λ k i ( x ) v λ k i ( y ) , where v λ k i ( x ) , i = 1 , . . . , dim( E λ k ) is an orthonormal basis of E λ k , when the spectrum is discrete.

  19. C3. Gaussian localization: There exist constants c 4 , c 5 > 0 such that � � − c 5 ρ 2 ( x,y ) c 4 exp t | p t ( x, y ) | ≤ for x, y ∈ M , t > 0 . (5) � √ √ � 1 / 2 | B ( x, t ) || B ( y, t ) | C4. H¨ older continuity: There exists a constant α > 0 such that � � − c 5 ρ 2 ( x,y ) � ρ ( y, y ′ ) � α exp � � t � p t ( x, y ) − p t ( x, y ′ ) � ≤ c 4 √ (6) � √ √ � 1 / 2 t | B ( x, t ) || B ( y, t ) | √ for x, y, y ′ ∈ M and t > 0 , whenever ρ ( y, y ′ ) ≤ t . C5. Markov property: � p t ( x, y ) dµ ( y ) = 1 for x ∈ M and t > 0 . (7) M

  20. Associated regularity spaces Regularity classes defined through approximation properties: � Σ u = H λ k √ λ k ≤ u B s p = { f ∈ L p , φ ∈ Σ u � f − φ � p ≤ Cu − s } inf (usually denoted B s p, ∞ ). Moreover, one can develop a theory of ”wavelet” which completely characterizes the previous regularity classes.

  21. Typical examples M = R d Here dµ is the Lebesgue measure and ρ is the Euclidean distance on R d . In this case we consider the operator d � ∂ 2 − L ( f )( x ) = ∆ f ( x ) = i f ( x ) = div( ∇ f )( x ) j =1 defined on the space D ( R d ) of C ∞ functions with compact support. As is well known the operator L is positive essentially self-adjoint. The associate semigroup e t ∆ is given by the operator with the Gaussian kernel: � � − | x − y | 2 p t ( x, y ) = (4 πt ) − d 2 exp . 4 t

  22. Periodic case on M = [ − 1 , 1] Here dµ is the Lebesgue measure and ρ is the Euclidean distance on the circle. The operator is L ( f ) = − f ′′ defined on the set on infinitely differentiable periodic functions. It has eigenvalues k 2 π 2 for k ∈ N 0 and eigenspaces � 1 � ker( L − k 2 π 2 ) = H k = span { cos kπx, sin kπx } ker( L ) = H 0 = span √ , 2

  23. Unit sphere M = S d − 1 in R d , d ≥ 3 This is the most famous Riemannian manifold with the induced structure from R d . Here dµ is the Lebesgue measure on S d − 1 , ρ is the geodesic distance on S d − 1 : ρ ( ξ, η ) = arccos( � ξ, η � R d ) , and L := − ∆ 0 with ∆ 0 being the Laplace-Beltrami operator on S d − 1 . The spectral decomposition of the operator L can be described as follows: � L 2 ( S d − 1 , µ ) = E λ k , E λ k = ker ( L − λ k I d ) , λ k = k ( k + d − 2) . Here E λ k is the restriction to S d − 1 of harmonic homogeneous polynomials of degree k (spherical harmonics). We have � d − 1 � � d − 1 � dim( E λ k ) = − . d + k − 1 d + k − 3

Recommend


More recommend