Structured Autoencoders for Operator-theoretic decomposition and - PowerPoint PPT Presentation

Structured Autoencoders for Operator-theoretic decomposition and Model reduction Karthik Duraisamy

Thanks to…

Motivation Decomposition & Reduced Order Modeling of Complex Multiscale Problems [K] [W/m3] Large scale simulations O(10 6 )- O(10 8 ) CPU hours / run Complex physics : Flow, turbulence, combustion, heat transfer, etc

The Autoencoder Structure: encoder (compression) + decoder (decompression) – Encoder 𝚾 𝐲; 𝛊 𝚾 – Decoder 𝛀 . ; 𝛊 𝛀 – POD: 𝚾 → 𝐕 ) · , 𝛀 → 𝐕 · – Trained as one single network, 𝛊 𝚾 and 𝛊 𝛀 are optimized jointly – Automatically separated into encoder and decoder by cutting at the “bottleneck” 𝚾 𝛀 x = Ψ ( · , θ Ψ ) � Φ ( x, θ Φ ) ˜ x “Applied Deep Learning” 4 https://towardsdatascience.com/

Embedding (in the right coordinates)

Part 1 Operator-theoretic Learning & Decomposition

Koopman operator and linear embedding

Connections of Koopman to other operators Liouville operator L := f · r x ∂ u Liouville PDE ∂ t = L u ; u ( · , 0) = h K t h = h � φ t = e t L h Generator Liouville generates Koopman Perron-Frobenius operator ρ ( · , t ) = P t � ρ Duality h h, P t ρ i = h K t h, ρ i Perron-Frobenius is adjoint of Koopman

Spectral expansion of Koopman operators

Koopman operators & “Deep” Learning Several works since 2018

Extracting a Koopman-invariant subspace Goal: Extracting the Koopman operator defined on Observation functionals We are also interested in retrieving the state x Arbabi & Mezic 2019

Enforcing structure for Learning : “Physics information”

Enforcing structure for Learning : Tractable optimization

“Data-free”, “Physics-informed” Trajectory data, ”Unknown physics”

Enforcing structure for Learning : “DMD ResNet”

Enforcing structure for Learning : Stability

Naïve “Autoencoders” x f(x)

Putting it all together (deterministic form) x f(x) Pan. S. & Duraisamy, K., Physics-Informed Probabilistic Learning of Linear Embeddings of Non-linear Dynamics With Guaranteed Stability , SIAM J. of Applied Dynamical Systems, 2020.

Bayesian Neural Networks & Variational Inference

Variational Inference

Verification on Model dynamical systems Duffing oscillator: Eigenfunctions (with uncertainty) Prediction and sensitivity to data 100 data points. 1000 data points 10000 data points

Flow over cylinder: Prediction with uncertainties • Gaussian white noise added

Flow over cylinder: Velocity magnitude Prediction with (mean) uncertainties Velocity magnitude (standard deviation) Physics-Informed Probabilistic Learning of Linear Embeddings of Non-linear Dynamics With Guaranteed Stability, Pan, S., and Duraisamy, K., SIADS, 2020

Multi-task learning framework to extract sparse Proposed framework Koopman-invariant subspaces Steps: I a-priori cross validation to choose an appropriate hyperparameter I mode-by-mode error analysis I choose a trade-o ff between reconstruction error and linear evolving error I sparse reconstruction of system with multi-task learning Sparsity-promoting algorithms for the discovery of informative Koopman invariant subspaces , Pan, S., N. A-M and Duraisamy, K., arXiv:2002.10637 4 / 9

Turbulent Ship Airwake I transient behavior is accurately reconstructed I stable modes are successfully extracted from strongly nonlinear transient data I left mode: due to side edge of superstructure. right mode: due to funnel Sparsity-promoting algorithms for the discovery of informative Koopman invariant subspaces , Pan, S., N. A-M and Duraisamy, K., arXiv:2002.10637 6 / 9

Summary Many opportunities to enforce structure in Autoencoders è flexible and powerful tools 26

Part 2 Learning Reduced Order Models of Parametric Spatio-temporal dynamics

Non-intrusive data-driven ROMs q n +1 = f ( q n +1 l , .... q n − l , q n , B ( u n +1 ) , µ ) l l l Some recent works: B. Kramer, K. E. Willcox, AIAA Journal ,2019 • M. Guo, J. S. Hesthaven, CMAME, 2018. • A. Mohan, D. Daniel, M. Chertkov, D. Livescu, arXiv, 2019 • S. Lee, D. You, arXiv, 2019. • Q. Wang, J. S. Hesthaven, D. Ray, JCP, 2019. •

Basic Component: Convolutional Layer Convolutional layers preserve complex • spatio-temporal “information” Convolutional operation on a local window w • 26 28 𝑦 ∗ 𝑥 ./ = ∑ ∑ 𝑦 .23,/25 𝑥 3,5 – 378 576 Ideal for “localized” feature identification • Rotation and translation invariant, if • properly constructed : “Applied Deep Learning” https://towardsdatascience.com/ http://cs231n.github.io/convolutional-networks/ 29

Temporal Convolutional Performs dilated 1D convolutional operation in temporal/sequential direction • :2; 𝑦 ∗ 9 𝑥 . = ∑ 𝑦 .293 𝑥 3 – 37< Exponential increase in reception field è an increasingly popular alternative to • RNN/LSTM Example: k = 2 , d = 2 i Output/Conv-4: d =8 reception field: 16 Conv-3: d =4 reception field: 8 Conv-2: d =2 reception field: 4 Conv-1: d =1 reception field: 2 Input sequence Source of image: github.com/philipperemy/keras-tcn

Training Multi-level convolutional AE networks

Example CAE architecture 32

Prediction using Multilevel AE networks

Example TCAE architecture

Prediction using Multilevel AE networks

Time stepping

Example TCN architecture Example TCN architecture 0 1 0 1 0 1 0 0 0 0 1 1 1 1 1 0 0 1 0 0 1 0 0 0 0 1 1 1 0 0 0 0 0 1 1 0 0 0 0 0 0 1 𝑜 > input 𝑜 ? channels steps * 𝑜 > channels 𝑜 ? steps * Input Jumping Jumping Jumping Dense Input Non-strided Non-strided 1D convolution .,? .A; .,? .A; 𝐫 > 𝐫 > dilated 1D dilated 1D dilated 1D dilated 1D dilated 1D 𝐑 > 𝐑 > convolution convolution convolution convolution convolution *: Convolution direction *: Convolution direction Many-to-one Many-to-many 37

Numerical Tests: Discontinuous compressible flow CAE reconstruction CAE + TCN Component (final step) Training Testing Pressure 0.04% 0.1% 0.12% Density 0.01% 0.04% 0.14% 38 Velocity 0.04% 0.08% 0.13%

Discontinuous compressible flow : Impact of data 39

Numerical Tests : 3D Ship Airwake – Incompressible Navier-Stokes – 576k DOF, 400 time snapshots – Global parameter: sliding angle 𝛽 – Training: 𝛽 = 5 ° :5 ° :20 ° – Prediction: 𝛽 = 12.5 40 𝛽 = 5 ° 𝛽 = 20 °

Numerical Test: 3D Ship Airwake Truth Latent variable Prediction vs Truth Prediction CAE reconstruction Component MLP + TCAE + CAE Training Testing U 0.12% 0.30% 0.51% V 0.09% 0.38% 0.89% W 0.08% 0.29% 0.62% Relative absolute error 41 Manuscript “Multi-level Convolutional Autoencoder Networks for Parametric Prediction of Spatio-temporal Dynamics,” Submitted CMAME

Summary Fully Data-driven framework • • Multi-level neural network architecture • Convolutions in space & time Non-linear manifolds • Fast training, faster prediction • • Up to 6 orders of reduction in DoF • Total training time: 3.6 hours on one NVIDIA Tesla P100 GPU for 3D ship air wake • Prediction time: Seconds for a new parameter or hundreds of future steps Caveats • Require large amounts of data • No indicator for choice of latent dimensions è use singular values to find an upper bound Manuscript “Multi-level Convolutional Autoencoder Networks for Parametric Prediction of Spatio-temporal Dynamics” to be submitted to ArXiv in a week 42

Acknowledgments DARPA Physics of AI program (Technical Monitor: Dr. Ted Senator ) • Air Force Center of Excellence grant (Program Managers: Dr. Mitat Birkan and Dr. • Fariba Fahroo) Office of Naval Research (Program manager: Dr. Brian Holm-Hansen) • Computational infrastructure : NSF-MRI (Program manager: Dr. Stefan Robila) • 43

Numerical Tests 𝛽 = 5 ° 44

Structured Autoencoders for Operator-theoretic decomposition and - PowerPoint PPT Presentation

Structured Autoencoders for Operator-theoretic decomposition and Model reduction Karthik Duraisamy Thanks to Motivation Decomposition & Reduced Order Modeling of Complex Multiscale Problems [K] [W/m3] Large scale simulations O(10 6 )-

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

A linear operator-theoretic approach to nonlinear systems Alexandre Mauroy University of Namur

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

CSC321 Lecture 20: Autoencoders Roger Grosse Roger Grosse CSC321 Lecture 20: Autoencoders 1 /

Lecture 25: Autoencoders Kernel PCA Aykut Erdem January 2017 Hacettepe University Today

Shuffle algebra perspective on operator valued probability theory 30 mars 2020 1/25 Operator

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM)

Adversarially Regularized Autoencoders Junbo (Jake) Zhao, Yoon Kim, Kelly Zhang, Alexander M.

Understanding Geometric Attributes with Autoencoders Alasdair Newson Tlcom ParisTech

Educating Text Autoencoders: Latent Representation Guidance via Denoising Tianxiao Shen Jonas

Variational Autoencoders Tom Fletcher March 25, 2019 Talking about this paper: Diederik Kingma

Hierarchical Importance Weighted Autoencoders Chin-Wei Huang Kris Sankaran Eeshan Dhekane

CSC421/2516 Lecture 17: Variational Autoencoders Roger Grosse and Jimmy Ba Roger Grosse and

Variational Laplace Autoencoders Yookoon Park, Chris Dongjoo Kim and Gunhee Kim Vision and

Loss Landscapes of Regularized Linear Autoencoders Daniel Kunin Jonathan M. Bloom Aleksandrina

Extreme Data-Intensive Scientific Computing Alex Szalay JHU Big Data in Science Data

What if trees had ? ratings in kN? John Morton Snohomish County SAR Everett Mountain Rescue

Second Quarter Report 2004 I am pleased to present BMO Financial Groups Second Quarter 2004

Cryptography I Exercises Luca Vigan` o Institut f ur Informatik

Image Stitching Ali Farhadi CSE 576 Several slides from Rick Szeliski, Steve Seitz, Derek Hoiem,

& Stereo Tues Oct 20 Last time: How to stitch a panorama? Basic Procedure Take a

S cience Case 5: Planning a Deep NIRCam S urvey Building the APT file Martha Boyer S TS cI,

Topics in Philosophy of Science Week 2: Laws of Nature Niels Martens Slides, reading list &

Sambuz

Useful Links

Newsletter

Mail Us

Structured Autoencoders for Operator-theoretic decomposition and - PowerPoint PPT Presentation

Structured Autoencoders for Operator-theoretic decomposition and Model reduction Karthik Duraisamy Thanks to Motivation Decomposition & Reduced Order Modeling of Complex Multiscale Problems [K] [W/m3] Large scale simulations O(10 6 )-

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

A linear operator-theoretic approach to nonlinear systems Alexandre Mauroy University of Namur

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

CSC321 Lecture 20: Autoencoders Roger Grosse Roger Grosse CSC321 Lecture 20: Autoencoders 1 /

Lecture 25: Autoencoders Kernel PCA Aykut Erdem January 2017 Hacettepe University Today

Shuffle algebra perspective on operator valued probability theory 30 mars 2020 1/25 Operator

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM)

Adversarially Regularized Autoencoders Junbo (Jake) Zhao, Yoon Kim, Kelly Zhang, Alexander M.

Understanding Geometric Attributes with Autoencoders Alasdair Newson Tlcom ParisTech

Educating Text Autoencoders: Latent Representation Guidance via Denoising Tianxiao Shen Jonas

Variational Autoencoders Tom Fletcher March 25, 2019 Talking about this paper: Diederik Kingma

Hierarchical Importance Weighted Autoencoders Chin-Wei Huang Kris Sankaran Eeshan Dhekane

CSC421/2516 Lecture 17: Variational Autoencoders Roger Grosse and Jimmy Ba Roger Grosse and

Variational Laplace Autoencoders Yookoon Park, Chris Dongjoo Kim and Gunhee Kim Vision and

Loss Landscapes of Regularized Linear Autoencoders Daniel Kunin Jonathan M. Bloom Aleksandrina

Extreme Data-Intensive Scientific Computing Alex Szalay JHU Big Data in Science Data

What if trees had ? ratings in kN? John Morton Snohomish County SAR Everett Mountain Rescue

Second Quarter Report 2004 I am pleased to present BMO Financial Groups Second Quarter 2004

Cryptography I Exercises Luca Vigan` o Institut f ur Informatik

Image Stitching Ali Farhadi CSE 576 Several slides from Rick Szeliski, Steve Seitz, Derek Hoiem,

&amp; Stereo Tues Oct 20 Last time: How to stitch a panorama? Basic Procedure Take a

S cience Case 5: Planning a Deep NIRCam S urvey Building the APT file Martha Boyer S TS cI,

Topics in Philosophy of Science Week 2: Laws of Nature Niels Martens Slides, reading list &amp;

Sambuz

Useful Links

Newsletter

Mail Us

& Stereo Tues Oct 20 Last time: How to stitch a panorama? Basic Procedure Take a

Topics in Philosophy of Science Week 2: Laws of Nature Niels Martens Slides, reading list &