Databases Services at CERN Databases Services at CERN for the - PowerPoint PPT Presentation

Databases Services at CERN Databases Services at CERN for the Physics Community Luca Canali, CERN Orcan Conference, Stockholm, May 2010 y

Outline  Overview of CERN and computing for LHC  Database services at CERN  DB service architecture  DB service operations and monitoring  DB service operations and monitoring  Service evolution Luca Canali 2

CERN is: What is CERN? - ~ 2500 staff scientists (physicists, engineers, …) - Some 6500 visiting Some 6500 visiting scientists (half of the world's particle • CERN is the world's largest particle physics centre physicists) They come from • Particle physics is about: P ti l h i i b t 500 universities representing - elementary particles and fundamental forces 80 nationalities. • Particles physics requires special tools to create and study new particles new particles • ACCELERATORS, huge machines able to speed up particles to very high energies before colliding them into other particles th ti l • DETECTORS, massive instruments which register the particles produced when the accelerated particles collide Luca Canali 3

LHC: a Very Large Scientific Instrument LHC : 27 km long 100m underground 100m underground Mont Blanc, 4810 m ATLAS ATLAS Downtown Geneva ALICE CMS +TOTEM Luca Canali 4

… Based on Advanced Technology 27 km of superconducting magnets 27 km of superconducting magnets cooled in superfluid helium at 1.9 K Luca Canali 5

The ATLAS experiment 7000 tons, 150 million sensors generating data 40 millions times per second generating data 40 millions times per second i.e. a petabyte/s Luca Canali 6

7 7 TeV Physics with LHC in 2010 Luca Canali

The LHC Computing Grid 8 The LHC Computing Grid Luca Canali

9 A collision at LHC Luca Canali

Luca Canali 10 The Data Acquisition

Tier 0 at CERN: Acquisition, First pass processing Storage & Distribution Storage & Distribution 1.25 GB/sec (ions) 11 Luca Canali

The LHC Computing Challenge  Signal/Noise: 10 ‐ 9  Data volume  High rate * large number of channels * 4 experiments  15 PetaBytes of new data each year  Compute power  Event complexity * Nb. events * thousands users  100 k of (today's) fastest CPUs  45 PB of disk storage  Worldwide analysis & funding  Computing funding locally in major regions & countries  Efficient analysis everywhere  GRID technology  Bulk of data stored in files, a fraction of it in databases (~30TB/year) 12 Luca Canali

Balloon ( 3 0 Km ) LHC data CD stack w ith CD stack w ith 1 year LHC data! ( ~ 2 0 Km ) LHC data correspond to about LHC d t d t b t 20 million CDs each year! Concorde ( 1 5 Km ) Where will the experiments store all of these data? Mt. Blanc ( 4 .8 Km ) Luca Canali 13

Tier 0 – Tier 1 – Tier 2 Tier-0 (CERN): ( ) •Data recording •Initial data reconstruction •Data distribution •Data distribution Tier-1 (11 centres): •Permanent storage •Re-processing •Analysis Tier-2 (~130 centres): Tier 2 ( 130 centres): • Simulation • End-user analysis 14 Luca Canali

Databases and LHC  Relational DBs play today a key role in the LHC production chains production chains  online acquisition, offline production, data (re)processing, data distribution, analysis • SCADA, conditions, geometry, alignment, calibration, file bookkeeping, file transfers, etc..  Grid Infrastructure and Operation services p • Monitoring, Dashboards, User-role management, ..  Data Management Services • File catalogues, file transfers and storage management, … Fil t l fil t f d t t  Metadata and transaction processing for custom tape storage system of physics data g y p y  Accelerator logging and monitoring systems Luca Canali 15

DB Services Architecture and

CERN Databases in Numbers  CERN databases services – global numbers  Global users community of several thousand users  ~ 100 Oracle RAC database clusters (2 – 6 nodes)  Currently over 3300 disk spindles providing more than Currently over 3300 disk spindles providing more than 1PB raw disk space (NAS and SAN)  Some notable DBs at CERN Some notable DBs at CERN  Experiment databases – 13 production databases • Currently between 1 and 9 TB in size • Expected growth between 1 and 19 TB / year  LHC accelerator logging database (ACCLOG) – ~30 TB • Expected growth up to 30 TB / year • Expected growth up to 30 TB / year  ... Several more DBs on the range 1-2 TB Luca Canali 17

Service Key Requirements  Data Availability, Scalability, Performance and Manageability Manageability  Oracle RAC on Linux: building-block architecture for CERN and Tier1 sites  Data Distribution  Oracle Streams: for sharing information between databases at CERN and 10 Tier1 sites CERN and 10 Tier1 sites  Data Protection  Oracle RMAN on TSM for backups  Oracle Data Guard: for additional protection against failures (data corruption, disaster recoveries,...) Luca Canali 18

Hardware architecture  Servers  “Commodity” hardware (Intel Harpertown and Nahalem based mid-range servers) running 64-bit Linux  Rack mounted boxes and blade servers  Rack mounted boxes and blade servers  Storage  Different storage types used: Different storage types used: • NAS (Network-attached Storage) – 1Gb Ethernet • SAN (Storage Area Network) – 4Gb FC  Different disk drive types: • high capacity SATA (up to 2TB) • high performance SATA high performance SATA • high performance FC Luca Canali 19

High Availability  Resiliency from HW failures  Using commodity HW g y  Redundancies with software  Intra-node redundancy  Redundant IP network paths (Linux bonding) R d d t IP t k th (Li b di )  Redundant Fiber Channel paths to storage • OS configuration with Linux’s device mapper  Cluster redundancy: Oracle RAC + ASM  Monitoring: custom monitoring and alarms to on-call DBAs DBAs  Service Continuity: Physical Standby (Dataguard)  Recovery operations: o n-disk backup and tape backup y p p p p Luca Canali 20

DB clusters with RAC  Applications are consolidated on large clusters per customer (e.g. experiment)  Load balancing and growth:leverages Oracle Load balancing and growth:leverages Oracle services  HA: cluster survives node failures  Maintenance: allows scheduled rolling interventions  Maintenance: allows scheduled rolling interventions Prodsys COOL Shared_2 Sh Shared_1 d 1 Integration I t ti TAGS TAGS listener listener listener listener DB inst. DB inst. DB inst. DB inst. ASM inst. ASM inst. ASM inst. ASM inst. Clusterware Luca Canali 21

Oracle’s ASM  ASM (Automatic Storage Management) • Cost: Oracle’s cluster file system and volume Cost: Oracle s cluster file system and volume manager for Oracle databases • HA: online storage reorganization/addition • Performance: stripe and mirroring everything • Performance: stripe and mirroring everything • Commodity HW: Physics DBs at CERN use ASM normal redundancy ( similar to RAID 1+0 across multiple disks and storage arrays ) multiple disks and storage arrays ) DATA DATA RECOVERY RECOVERY Disk Group Disk Group Disk Group Disk Group Storage 1 Storage 1 Storage 2 Storage 2 Storage 3 Storage 3 Storage 4 Storage 4 Luca Canali 22

Storage deployment  Two diskgroups created for each cluster  DATA – data files and online redo logs – outer g part of the disks  RECO – flash recovery area destination – archived redo logs and on disk backups archived redo logs and on disk backups – inner part of the disks  One failgroup per storage array One failgroup per storage array DATA DG1 DATA DG1 _ RECO_DG1 RECO_DG1 Failgroup1 Failgroup1 Failgroup1 Failgroup1 Failgroup2 Failgroup2 Failgroup2 Failgroup2 Failgroup3 Failgroup3 Failgroup3 Failgroup3 Failgroup4 Failgroup4 Failgroup4 Failgroup4 Luca Canali 23

Physics DB HW, a typical setup   Dual-CPU quad-core 2950 DELL servers 16GB memory Dual-CPU quad-core 2950 DELL servers, 16GB memory, Intel 5400-series “Harpertown”; 2.33GHz clock  Dual power supplies, mirrored local disks, 4 NIC (2 private/ 2 public), dual HBAs, “RAID 1+0 like” with ASM Luca Canali 24

ASM scalability test results  Big Oracle 10g RAC cluster built with mid-range 14 servers  26 storage arrays connected to all servers and big ASM 26 t t d t ll d bi ASM diskgroup created (>150TB of raw storage)  Data warehouse like workload (parallelized query on all test Data warehouse like workload (parallelized query on all test servers)  Measured sequential I/O • Read: 6 GB/s Read 6 GB/s • Read-Write: 3+3 GB/s  Measured 8 KB random I/O • Read: 40 000 IOPS 0 000 O S  Results – “commodity” hardware can scale on Oracle RAC Luca Canali 25

Tape backups  Main ‘safety net’ against failures  Despite the associated cost they have many advantages:  Tapes can be easily taken offsite  Backups once properly stored on tapes are quite reliable y  If configured properly can be very fast Metadata Payload MM MM Client Client RMAN RMAN Library Library RMAN Media Manager Manager Server Tape drives Database Luca Canali 26

Databases Services at CERN Databases Services at CERN for the - PowerPoint PPT Presentation

Databases Services at CERN Databases Services at CERN for the Physics Community Luca Canali, CERN Orcan Conference, Stockholm, May 2010 y Outline Overview of CERN and computing for LHC Database services at CERN DB service

Overview of the SPS LLRF upgrade Gregoire Hagmann (CERN) Mattia Rizzi (CERN) Philippe

Creating Databases and Tables Introduction to Databases in Python Creating Databases

Inductive Inductive Inductive Inductive Databases Databases Databases Databases and

Lecture 11: Persistent Memory Databases 1 / 71 Persistent Memory Databases Recap

Accelera'ng records management at CERN Andrew Short andrew.short@cern.ch CERN Accelerator

Marek Domaracky CERN IT Vidyo@CERN CERN WebRTC Future 3 VIDYO@CERN: SCALE AND

Benchmarking topics at Benchmarking topics at CERN CERN Helge Meinhard / CERN- -IT IT Helge

Module 3: Creating and Managing Databases Overview Creating Databases Creating

Oracle at CERN CERN openlab summer students programme 2011 Eric Grancher eric.grancher@cern.ch

Minute of PACMAN kick-off meeting 20/11/2013 Participants: K. Artoos (CERN), F. Bordry (CERN), A.

AIDA - Abstract Interfaces for Data Analysis Andreas Pfeiffer, CERN/IT Andreas Pfeiffer, CERN/IT

IPv6 deployment at CERN ISGC, Taipei, 16 th March 2016 edoardo.martelli@cern.ch CERN IT

DSS Data & Storage Services TSM Monitoring @ CERN Daniele Francesco Kruse CERN IT/DSS

Data Life Cycle Management for Oracle @ CERN with partitioning Oracle @ CERN with partitioning,

GEMS/Food Databases and GEMS/Food Databases and GEMS/Food Databases and in the Food Supply

Image Databases Image Databases Image Databases Prof. Paolo Ciaccia Prof. Paolo Ciaccia

: :

Introduction to Metasploit and tools Michal Novotn Malware Researcher & Security Analyst

High-Speed Implementation of bcrypt Password Search using Special-Purpose Hardware 10. December

Computer Security DD2395 http://www.csc.kth.se/utbildning/kth/kurser/DD2395/dasak10/ Fall 2010

How we hacked Online Banking Malware Sebastian Bachmann & Tibor Eli as 22. November

Network Security Dr. Haojin Zhu Zhu-hj@cs.sjtu.edu.cn https://nsec.sjtu.edu.cn/ 1 About

BINARY-LEVEL SECURITY: SEMANTIC ANALYSIS TO THE RESCUE Sbastien Bardin (CEA LIST) Joint work

WHY YOU WILL SOON SEE 100S OF NEW NFC APPLICATIONS HOW IOS11 AND NXP CAN BOOST YOUR BUSINESS

Databases Services at CERN Databases Services at CERN for the - PowerPoint PPT Presentation

Databases Services at CERN Databases Services at CERN for the Physics Community Luca Canali, CERN Orcan Conference, Stockholm, May 2010 y Outline Overview of CERN and computing for LHC Database services at CERN DB service

Overview of the SPS LLRF upgrade Gregoire Hagmann (CERN) Mattia Rizzi (CERN) Philippe

Creating Databases and Tables Introduction to Databases in Python Creating Databases

Inductive Inductive Inductive Inductive Databases Databases Databases Databases and

Lecture 11: Persistent Memory Databases 1 / 71 Persistent Memory Databases Recap

Accelera'ng records management at CERN Andrew Short andrew.short@cern.ch CERN Accelerator

Marek Domaracky CERN IT Vidyo@CERN CERN WebRTC Future 3 VIDYO@CERN: SCALE AND

Benchmarking topics at Benchmarking topics at CERN CERN Helge Meinhard / CERN- -IT IT Helge

Module 3: Creating and Managing Databases Overview Creating Databases Creating

Oracle at CERN CERN openlab summer students programme 2011 Eric Grancher eric.grancher@cern.ch

Minute of PACMAN kick-off meeting 20/11/2013 Participants: K. Artoos (CERN), F. Bordry (CERN), A.

AIDA - Abstract Interfaces for Data Analysis Andreas Pfeiffer, CERN/IT Andreas Pfeiffer, CERN/IT

IPv6 deployment at CERN ISGC, Taipei, 16 th March 2016 edoardo.martelli@cern.ch CERN IT

DSS Data &amp; Storage Services TSM Monitoring @ CERN Daniele Francesco Kruse CERN IT/DSS

Data Life Cycle Management for Oracle @ CERN with partitioning Oracle @ CERN with partitioning,

GEMS/Food Databases and GEMS/Food Databases and GEMS/Food Databases and in the Food Supply

Image Databases Image Databases Image Databases Prof. Paolo Ciaccia Prof. Paolo Ciaccia

: :

Introduction to Metasploit and tools Michal Novotn Malware Researcher &amp; Security Analyst

High-Speed Implementation of bcrypt Password Search using Special-Purpose Hardware 10. December

Computer Security DD2395 http://www.csc.kth.se/utbildning/kth/kurser/DD2395/dasak10/ Fall 2010

How we hacked Online Banking Malware Sebastian Bachmann &amp; Tibor Eli as 22. November

Network Security Dr. Haojin Zhu Zhu-hj@cs.sjtu.edu.cn https://nsec.sjtu.edu.cn/ 1 About

BINARY-LEVEL SECURITY: SEMANTIC ANALYSIS TO THE RESCUE Sbastien Bardin (CEA LIST) Joint work

WHY YOU WILL SOON SEE 100S OF NEW NFC APPLICATIONS HOW IOS11 AND NXP CAN BOOST YOUR BUSINESS

DSS Data & Storage Services TSM Monitoring @ CERN Daniele Francesco Kruse CERN IT/DSS

Introduction to Metasploit and tools Michal Novotn Malware Researcher & Security Analyst

How we hacked Online Banking Malware Sebastian Bachmann & Tibor Eli as 22. November