COMPUTATIONAL INFRASTRUCTURE SIMONS ELECTRON MICROSCOPY CENTER - - PowerPoint PPT Presentation

computational infrastructure
SMART_READER_LITE
LIVE PREVIEW

COMPUTATIONAL INFRASTRUCTURE SIMONS ELECTRON MICROSCOPY CENTER - - PowerPoint PPT Presentation

PROJECT COMPUTATIONAL INFRASTRUCTURE SIMONS ELECTRON MICROSCOPY CENTER HANDLING THE CHALLENGES FOR CRYO-EM PROCESSING ASSIGNEE EDWARD T ENG NOVEMBER 3, 2017 STRUCTURAL BIOLOGISTS DATE CLIENT SIMONS ELECTRON MICROSCOPY CENTER SIMONS


slide-1
SLIDE 1

STRUCTURAL BIOLOGISTS

PROJECT DATE CLIENT

NOVEMBER 3, 2017

COMPUTATIONAL INFRASTRUCTURE

HANDLING THE CHALLENGES FOR CRYO-EM PROCESSING

ASSIGNEE EDWARD T ENG SIMONS ELECTRON MICROSCOPY CENTER

slide-2
SLIDE 2

SIMONS ELECTRON MICROSCOPY CENTER

slide-3
SLIDE 3

SIMONS ELECTRON MICROSCOPY CENTER

Ashleigh Raczkowski Senior Technician Crystal Premo Administrator Sargis Dallakyan

  • Res. Programmer

Anchi Cheng

  • Res. Staff Scientist

Venkat Dandey Post Doc. Carl Negro

  • Res. Programmer

Alex Wei Technician Bridget Carragher Director Clint Potter Director Ed Eng Staff Scientist Bill Rice EM Manager Priyamvada Acharya Embedded Post Doc. Yong Zi Tan

  • Grad. Student

Kotaro Kelly Post Doc. Julia Brasch Embedded Post Doc. Zhening Zhang

  • Res. Scientist

Kelsey Jordan Technician Giovanna Scapin Embedded Scientist Alex Noble Post Doc. Micah Rapp Grad Student Laura Kim Research Associate

Simons Electron Microscopy Center NEW YORK STRUCTURAL BIOLOGY CENTER

National Resource for Automated Molecular Microscopy http://nramm.nysbc.org

Daija Bobe Technician

slide-4
SLIDE 4

"LIFE IS REALLY SIMPLE, BUT WE INSIST ON MAKING IT COMPLICATED.”

–CONFUCIUS (551-479 BCE)

What is possible?

slide-5
SLIDE 5

What is possible today?

2.5Å within a day

slide-6
SLIDE 6

What is the timeline?

1h

2h

4h

8h

12h

0h

2.5Å within a day

slide-7
SLIDE 7

Is this routinely done?

Aldolase Glutamate dehydrogenase Apoferritin 20S proteasome 60S/80S ribosome D2 D3 O D7 C1 ~150kDa 334kDa 443kDa 750kDa ~2-4MDa rabbit muscle cow liver horse spleen Thermoplasma

  • r Mycoplasma

human

Workflow validation/testing

slide-8
SLIDE 8

What type of computing challenge do you have?

Infrastructure to do cryo-EM processing for a research project/lab Infrastructure to support a multi-user/ instrument EM facility

slide-9
SLIDE 9

Computation Storage Software Computation Storage Software

CyroEM Infrastructure for a lab

Infrastructure to do cryo-EM processing for a research project/lab

slide-10
SLIDE 10

CyroEM Infrastructure for a lab

singleparticle.com exxactcorp.com linuxvixion.com thinkmate.com

Computation Storage Software

slide-11
SLIDE 11

The challenge of cryo-EM computation

Infrastructure to support a multi-user/ instrument EM facility Infrastructure to do cryo-EM processing for a research project/lab

slide-12
SLIDE 12

The challenge of cryo-EM computation

Infrastructure to support a multi-user/ instrument EM facility

How many instruments? How many users? What do they want to do? What support would you like to provide?

Baldwin, et al. Current Opinion in Microbiology, Vol 43, 2017 (in press)

slide-13
SLIDE 13

What do your users want to do?

Infrastructure to support a multi-user/ instrument EM facility

Breakdown of projects

Single particle 67% 2dx/helical 7% Other 1% FIB-SEM 17% Tomography 8%

400 registered users, 150 active Krios users

slide-14
SLIDE 14

What software do they request to use?

RELION / FREALIGN / cryoSPARC/EMAN2/etc…

IMOD / Protomo / Dynamo/PEET/PyTom/etc…

Amira / IMOD / Dragonfly/etc…

What is asked

Single Particle Analysis Tomography Segmentation/ Annotation

Support required

Experience

83% 14% 3% Beginner/Novice Intermediate Expert

Breakdown of projects

2dx/helical 7% Other 1% FIB-SEM 17% Tomography 8% Single particle 67%

slide-15
SLIDE 15

How many instruments are used?

data generation at SEMC circa 2017

FEI Tecnai Biotwin TVIPS 4K CMOS FEI Titan Krios#1 / #2 / #3 Falcon3 x3 K2 x3 FEI Tecnai F20 DE20 TVIPS 4K CMOS FEI Helios 650 ETD, TLD, ICE JEOL 1230 Gatan US4000 CCD

Cameras

7 direct detectors

  • n 4 TEMs

5 CMOS/CCDs

  • n 3 TEMs +

1 SEM

slide-16
SLIDE 16

How much data is generated?

  • months

e x p

  • s

u r e s

Krios & DD cameras Other scopes & CMOS/CCD

#TEM Exposure images in 2017:

2,689,276

# TEM Exposure images in 2015 & 2016:

1,069,315 # TEM Exposure images: 3,758,591*

*Total number of saved images since 2015: 766,329,392

slide-17
SLIDE 17

The challenge of cryo-EM computation

How many instruments? How many users? What do they want to do? What support would you like to provide?

Not enough and getting more Growing exponentially Everything As much as possible

scalable

^

slide-18
SLIDE 18

SEMC solutions

  • The overall mission of NRAMM is to develop, test and apply technology for automating and streamlining

cryo-electron microscopy (cryoEM) for structural biology.

Spotiton

slide-19
SLIDE 19

SEMC solutions

Camera Buffer server File system / cluster Web portal Leginon Workstation User Data transfer station

On the fly data pipeline c. 2017

GLOBUS

Remote data transfer Cloud computing

slide-20
SLIDE 20

SEMC solutions

■ HPC Server and storage (DDN): ■ 2 x 42U rack enclosures ■ DDN GRIDScaler GS7K appliance with 1.1PB GPFS paralegal file system ■ 420TB DDN WOS object storage for archival ■ 1056 x CPU cores. 44 x SuperMicro nodes each with 24 x CPU cores and 256GB RAM ■ 4 x GPU nodes each with one GPU and 128GB RAM. One GPU server with 8 x GPUs and 512GB RAM and 2 x GPU servers each with 4 x GPUs and 512GB RAM. ■ 4 buffer servers each with 51TB local storage, 2 x GPUs, 128GB RAM and 10G Fiber Network cards. ■ 5 x 36 QSFP port 56Gb/s FDR InfiniBand switches. ■ Bright Cluster Manager ■ Basic Onsite Support; 7x24 remote support

slide-21
SLIDE 21

SEMC solutions

Central MySQL Database and web server

Size of images: 158.06 TB # DB records: 766,329,392 Size of database: 7.44 GB since 2015

3,064 tilt series 3,758,591 images

  • 3/4 users who use Leginon

also have Appion sessions

slide-22
SLIDE 22

Example: Single-particle workflow

Workflow Optimization if needed Setting up workflow Data acquisition

Leginon session During EM session

Frame alignment CTF estimation Particle picking Micrograph/ Particle curating

Appion session

3D classification 3D refinement Model building 2D classification

After EM session Home institution/ cloud/SEMC

Initial 2D classification Initial model generation Micrograph/ Particle sorting

SEMC computing

3D refinement

slide-23
SLIDE 23

Example: Single-particle workflow

MotionCor2/Unblur/ alignframes_lmbfgs/ DE frame alignment/etc…

3D classification Workflow Optimization if needed Setting up workflow Data acquisition CTFFind4/gCTF/ ACE/etc…

DOG picker/ Gautomatch/FindEM/ EMAN2/etc…

RELION/cryoSPARC/ EMAN2/Xmipp/SPIDER/ IMAGIC/sparx/etc…

VIPER/SIMPLE/SPARX/ cryoSPARC/RELION/ Optimod/EMAN2/etc…

3D refinement Model building Appion Appion 2D classification

During EM session After EM session Leginon session Appion session SEMC computing Home institution/ cloud/SEMC

RELION/FREALIGN/ cryoSPARC/EMAN2/Xmipp/ IMAGIC/spider/etc…

slide-24
SLIDE 24

What is the timeline?

2h

4h

12h

24h

0h

48h

  • …96h

Resolution [1/Å] FSC

0.143

3.1Å

unpublished

slide-25
SLIDE 25

What is the timeline?

2h

4h

12h

24h

0h

48h

  • …96h

CTFFIND4 DoG picker MotionCor2 gCTF Template picker Chimera coot

slide-26
SLIDE 26

What is the timeline?

2h

4h

12h

24h

0h

48h

  • …96h

CTFFIND4 DoG picker MotionCor2 gCTF Template picker Chimera coot Buffer server 2xGeForce GTX 1080 GPU 9x 8TB 7.2K SATA drives, 1x 120GB SSD drive cryoSPARC workstation 4xGeForce GTX 1070 GPU 2 x Ten-Core 2.20GHz 25MB Cache 8 x 32GB 2400MHz DDR4 1x180GB STA SSD, 1x750GB SATA SSD RELION workstation 4 x NVIDIA GeForce GTX TITAN X Pascal 2 x Ten-Core 2.20GHz 25MB Cache 8 x 32GB 2400MHz DDR4 1x180GB STA SSD, 1x750GB SATA SSD

slide-27
SLIDE 27

The challenge of cryo-EM computation

Infrastructure to support a multi-user/ instrument EM facility Infrastructure to do cryo-EM processing for a research project/lab

slide-28
SLIDE 28

–Confucius (551-479 BCE)

“It does not matter how slowly you go as long as you do not stop.”