optimal transport for structured data with application on
play

Optimal Transport for structured data with application on graphs - PowerPoint PPT Presentation

Optimal Transport for structured data with application on graphs Titouan Vayer Joint work with Laetitia Chapel, Remi Flamary, Romain Tavenard and Nicolas Courty A novel distance between labeled graphs based on optimal transport Contributions:


  1. Optimal Transport for structured data with application on graphs Titouan Vayer Joint work with Laetitia Chapel, Remi Flamary, Romain Tavenard and Nicolas Courty A novel distance between labeled graphs based on optimal transport

  2. Contributions: • Di ff erentiable distance between labeled graphs. Jointly considers the features and the structures

  3. Contributions: • Di ff erentiable distance between labeled graphs. Jointly considers the features and the structures Optimal transport : soft assignment between the nodes Distance = 1.41

  4. Contributions: • Di ff erentiable distance between labeled graphs. Jointly considers the features and the structures Computing average 1 2 ( + ) = of labeled graphs

  5. Structured data as probability distribution

  6. Structured data as probability distribution Features ( a i ) i

  7. Structured data as probability distribution Features ( a i ) i nodes in the metric ( x i ) i space of the graph

  8. Structured data as probability distribution Features ( a i ) i nodes in the metric ( x i ) i space of the graph weighted by their masses ( h i ) i

  9. Optimal transport in a nutshell Compare two probability distributions by transporting one onto another ν Y μ A ν B μ X d ( a i , b j ) Gromov-Wasserstein distance Wasserstein distance

  10. Optimal transport in a nutshell Compare two probability distributions by transporting one onto another ν Y μ A ν B μ X d ( a i , b j ) Gromov-Wasserstein distance Wasserstein distance

  11. Optimal transport in a nutshell Compare two probability distributions by transporting one onto another ν Y μ A ν B μ X d ( a i , b j ) Gromov-Wasserstein distance Wasserstein distance

  12. Fused Gromov-Wasserstein distance ( (1 − α ) d ( a i , b j ) q + α | C 1 ( i , k ) − C 2 ( j , l ) | q ) π i , j π k , l π ∈Π ( μ , ν ) ∑ FGW q , α ( μ , ν ) = min i , j , k , l where is the soft assignment matrix π α is a trade-off features/structures

  13. Fused Gromov-Wasserstein distance Properties Interpolate between Wasserstein distance on features and Gromov-Wasserstein distance on the structures • Distance on labeled graph : vanishes iff graphs have same labels and weights at the same place up to a permutation • Optimization problem Non convex Quadratic Program: hard ! • Conditional Gradient Descent (aka Frank Wolfe) • Suitable for entropic regularization + Sinkhorn iteraterations •

  14. Applications Classification Graph Barycenter + k-means clustering of graphs Noiseless graph Sample 1 Sample 2 Sample 3 Sample 4 Sample 5 Sample 6 Bary n= 15 Bary n= 7

  15. Check out our poster at Pacific Ballroom #133!!

Recommend


More recommend