microscopic advances with large scale learning stochastic
play

Microscopic Advances with Large-Scale Learning: Stochastic - PowerPoint PPT Presentation

Microscopic Advances with Large-Scale Learning: Stochastic Optimization for Cryo-EM Ali Punjani, Marcus Brubaker University of Toronto Department of Computer Science Structure Determination } Macromolecules } Protein structure determines


  1. Microscopic Advances with Large-Scale Learning: Stochastic Optimization for Cryo-EM Ali Punjani, Marcus Brubaker University of Toronto Department of Computer Science

  2. Structure Determination } Macromolecules } Protein structure determines function } Traditional approaches: } X-ray Crystallography } NMR Spectroscopy

  3. Electron Cryo-Microscopy (Cryo-EM) Computational Task: Low dose electron beam Recover 3D Electron Density Particles in unknown 3D pose Ice Transfer Function Film/CCD Corrupted Noisy Integral Projections } No crystals needed, large molecules and complexes

  4. Cryo-EM Image Formation Low dose electron beam 2D Particle Images Corruption by CTF Particles in unknown 3D pose = Ice Transfer Function Film/CCD Corrupted Noisy Integral Projections } Challenges for reconstruction: } Destructive CTF } Low SNR } Unknown pose

  5. Cryo-EM Image Formation R t θ I V K p ( I| θ , R , t , V ) = N ( I| S t C θ P R V , σ 2 I )

  6. Cryo-EM Image Formation R t θ I V K Voxels p ( I| θ , R , t , V ) = N ( I| S t C θ P R V , σ 2 I ) Integral Projection Linear

  7. Cryo-EM Image Formation R t θ I V K Voxels p ( I| θ , R , t , V ) = N ( I| S t C θ P R V , σ 2 I ) Integral Projection Linear In Fourier Domain: Fourier Coefficients p (˜ I| θ , R , t , ˜ V ) = N (˜ I| ˜ S t ˜ C θ ˜ P R ˜ V , σ 2 I ) Slicing Diagonal

  8. Marginalization for Latent Variables R t θ I V K Z Z p (˜ I| θ , ˜ p (˜ I| θ , R , t , ˜ V ) = V ) p ( R ) p ( t ) d R d t R 2 SO (3)

  9. Marginalization for Latent Variables R t θ I V K Z Z p (˜ I| θ , ˜ p (˜ I| θ , R , t , ˜ V ) = V ) p ( R ) p ( t ) d R d t R 2 SO (3) M w j p (˜ I| θ , R j , t j , ˜ X V ) ≈ j =1 } Numerical Quadrature

  10. Maximum-a-Posteriori Estimation R t θ I V K K p (˜ I i | θ i , ˜ Y p ( V| D ) ∝ p ( V ) V ) i =1

  11. Optimization Problem R t θ I V K K p (˜ I i | θ i , ˜ Y p ( V| D ) ∝ p ( V ) V ) i =1 K ⇣ V ) + K − 1 log p ( V ) ⌘ log p (˜ I| θ , ˜ X arg min V − i =1

  12. Stochastic Optimization for Cryo-EM K V ) + K − 1 log p ( V ) ⇣ ⌘ X log p (˜ I| θ , ˜ arg min V − i =1 } Expensive to compute objective with large K } Stochastic Optimization: } Approximate objective with subset of images } Update based on approximate gradient } Various Algorithms (vary by update rule) } Advantages: speed, random initialization

  13. Experiments: Datasets } Real Dataset: } 46K Images of ATP Synthase from Thermus Thermophilius } Low SNR and known CTF parameters

  14. Experiments: Datasets } Synthetic Dataset: } 50,000 Projections of known artificial density } Low SNR and realistic CTF parameters

  15. Experiments: Seven Methods } Vanilla Stochastic Gradient Descent (SGD) } Momentum Methods: } Classical Momentum } Nesterov’s Accelerated Gradient } Adaptive Methods: } AdaGrad } TONGA } Quasi-Second Order Methods: } Online L-BFGS } Hessian Free

  16. Experiments: Results } Identical random initialization in all experiments

  17. Experiments: Results } Simplest Method

  18. Experiments: Results } Momentum Method

  19. Experiments: Results } Adaptive Step-size

  20. Experiments: Results } Quasi-second order

  21. Experiments: Results } Qualitatively Similar } Reasonable in one pass through data

  22. Experiments: Results

  23. Experiments: Results

  24. Experiments: Comparison Projection Matching RELION (E-M) Proposed Approach 3 Hours – 1 Epochs

  25. Experiments: Comparison Projection Matching RELION (E-M) Proposed Approach 24 Hours – 5 Epochs 24 Hours – 5 Epochs 3 Hours – 1 Epochs

  26. Experiments: Comparison Projection Matching RELION (E-M) Proposed Approach 24 Hours – 5 Epochs 24 Hours – 5 Epochs 3 Hours – 1 Epochs

  27. Experiments: Comparison Projection Matching RELION (E-M) Proposed Approach 24 Hours – 5 Epochs 24 Hours – 5 Epochs 3 Hours – 1 Epochs } Random Initialization is difficult for other methods

  28. Conclusions } Introduced Cryo-EM Structure Determination } Stochastic Optimization solution } Simple methods are best } State of the art speed and robustness

  29. Recent Progress } Higher resolution reconstructions } Importance Sampling: 100,000x speedup

  30. Recent Progress } Higher resolution reconstructions } Importance Sampling: 100,000x speedup } Forward: } Heterogeneous mixtures of particles } Better priors } Video exposure

Recommend


More recommend