Uniform sampling of a feasible set of model parameters Wenyu Li - - PowerPoint PPT Presentation

uniform sampling of a feasible set of model parameters
SMART_READER_LITE
LIVE PREVIEW

Uniform sampling of a feasible set of model parameters Wenyu Li - - PowerPoint PPT Presentation

Uniform sampling of a feasible set of model parameters Wenyu Li Arun Hegde Jim Oreluk Andrew Packard Michael Frenklach SIAM UQ18 APRIL 17, 2018 Acknowledgements This work is supported as a part of the CCMSC at the University of Utah,


slide-1
SLIDE 1

APRIL 17, 2018 SIAM UQ18

Uniform sampling of a feasible set of model parameters

Wenyu Li Arun Hegde Jim Oreluk Andrew Packard Michael Frenklach

slide-2
SLIDE 2

APRIL 17, 2018 SIAM UQ18

Acknowledgements

This work is supported as a part of the CCMSC at the University

  • f Utah, funded through PSAAP by the National Nuclear Security

Administration, under Award Number DE-NA0002375.

slide-3
SLIDE 3

APRIL 17, 2018 SIAM UQ18

Bound-to-Bound Data Collaboration (B2BDC)

Prior Uncertainty

Feasible set Model:

Data 1 Data 2 Data 3 Data n Data n Data n

slide-4
SLIDE 4

APRIL 17, 2018 SIAM UQ18

Uniform sampling

Goal: uniform sampling of feasible set

  • Sampling is useful in providing information about
  • B2BDC makes NO distribution assumptions, but as far as taking

samples, uniform distribution of is reasonable

  • Applying Bayesian analysis with specific prior assumptions also

leads to uniform distribution of as posterior[1]

slide-5
SLIDE 5

APRIL 17, 2018 SIAM UQ18

Procedure: Pros & Cons

  • find a bounding box
  • available from B2BDC
  • generate uniformly distributed samples in

the box as candidates

  • reject the points outside of feasible set
  • provably uniform in the feasible set
  • candidates can be drawn very efficiently
  • efficiency drops quickly with increased dimension

Bounding box

Rejection sampling method with a box

Procedure: Pros & Cons

  • find a bounding box
  • available from B2BDC
  • generate uniformly distributed samples in

the box as candidates

  • reject the points outside of feasible set

Circumscribed box Feasible set

  • provably uniform in the feasible set
  • candidates can be drawn very efficiently
  • efficiency drops quickly with increased dimension
slide-6
SLIDE 6

APRIL 17, 2018 SIAM UQ18

Procedure:

  • start from a feasible point
  • available from B2BDC
  • select a random direction, calculate extreme

points and choose the next point uniformly

  • repeat the process

Pros & Cons

  • NOT limited by problem dimensions
  • NOT necessarily uniform in the feasible set

Random walk[2] (RW)

Procedure:

Feasible set

Moving direction Extreme point Extreme point Starting point New moving direction Next point

  • start from a feasible point
  • available from B2BDC
  • select a random direction, calculate extreme

points and choose the next point uniformly

  • repeat the process

Pros & Cons

  • NOT limited by problem dimensions
  • NOT necessarily uniform in the feasible set
slide-7
SLIDE 7

APRIL 17, 2018 SIAM UQ18

Procedure:

  • find a convex bounding polytope
  • available from B2BDC
  • generate candidate points by random walk
  • reject the points outside of feasible set

Pros & Cons

  • provably uniform in the feasible set
  • increased efficiency with more polytope facets
  • limited by computational resource

Rejection sampling method with a polytope

Procedure:

  • find a convex bounding polytope
  • available from B2BDC
  • generate candidate points by random walk
  • reject the points outside of feasible set

Pros & Cons

  • provably uniform in the feasible set
  • increased efficiency with more polytope facets

Circumscribed polytope Feasible set Bounding polytope

  • limited by computational resource
slide-8
SLIDE 8

APRIL 17, 2018 SIAM UQ18

Effect of polytope complexity

  • Polytopes with

different complexity are tested

  • 5 million

candidates are generated to calculate the efficiency and CPU time

slide-9
SLIDE 9

APRIL 17, 2018 SIAM UQ18

Effect of polytope complexity

  • Sampling

efficiency increases with more complex polytope

  • The

improvement is more significant at higher dimensions

slide-10
SLIDE 10

APRIL 17, 2018 SIAM UQ18

Truncation strategy

Motivations

  • difference between a bounding and circumscribed polytope
  • existence of low-density tails along most of the directions

Procedure

  • start with a bounding polytope and shrink the polytope bounds
  • recommended to stop when a practical efficiency is obtained
slide-11
SLIDE 11

APRIL 17, 2018 SIAM UQ18

Effect of truncation

  • A polytope is defined as:
  • The s are calculated

from B2BDC and represents a bounding polytope

  • s vary gradually to

generate smaller polytopes

[3]

slide-12
SLIDE 12

APRIL 17, 2018 SIAM UQ18

Effect of truncation

  • Region of no truncation,

improved sampling efficiency.

  • Region of increased

truncation, improved efficiency but acceptable approximation

  • Region of unacceptable

approximation

[3]

slide-13
SLIDE 13

APRIL 17, 2018 SIAM UQ18

Check of directional histograms

  • This is observed along all the directions defining the polytope
  • The distribution has zero-density regions
  • The distribution has low-density tail regions

Due to the conservative estimation in Low-density tails

slide-14
SLIDE 14

APRIL 17, 2018 SIAM UQ18

Procedure:

  • collect RW samples from the feasible set
  • conduct PCA on RW samples
  • find a subspace based on PCA result
  • generate uniform samples in the subspace

Pros & Cons

  • improves sampling efficiency significantly
  • works only if feasible set approximates a

lower-dimensional manifold/subspace

Principal component analysis (PCA)

Procedure:

Feasible set Lower-dimensional subspace

  • collect RW samples from the feasible set
  • conduct PCA on RW samples
  • find a subspace based on PCA result
  • generate uniform samples in the subspace

Pros & Cons

  • improves sampling efficiency significantly
  • works only if feasible set approximates a

lower-dimensional manifold/subspace

slide-15
SLIDE 15

APRIL 17, 2018 SIAM UQ18

Principal direction index

Effect of dimension reduction

  • efficiency is affected mostly by problem dimension the (2.96e-5 in full dimension)
  • returned samples approximate the desired distribution with acceptable accuracy
  • nly when the smallest principal direction is truncated
slide-16
SLIDE 16

APRIL 17, 2018 SIAM UQ18

Summary

  • We developed methods to generate uniformly distributed

samples of a feasible set

  • Truncation strategy and PCA further improves the sampling

efficiency of the method

  • Numerical results support an advantageous efficiency-

accuracy trade-off of the proposed approximation techniques

slide-17
SLIDE 17

APRIL 17, 2018 SIAM UQ18

Reference

[1] Frenklach, Michael, et al. "Comparison of statistical and deterministic frameworks of uncertainty quantification." SIAM/ASA Journal on Uncertainty Quantification 4.1 (2016): 875-901. [2] Smith, Robert L. "Efficient Monte Carlo procedures for generating points uniformly distributed over bounded regions." Operations Research 32.6 (1984): 1296-1308. [3] Gretton, Arthur, et al. "A kernel two-sample test." Journal of Machine Learning Research 13.Mar (2012): 723-773.