320454 big data project a
play

320454 Big Data Project A Instructor: Peter Baumann email: - PowerPoint PPT Presentation

320473 Databases & Web Applications Lab 320454 Big Data Project A Instructor: Peter Baumann email: p.baumann@jacobs-university.de tel: -3178 office: room 88, Research 1 320302 Databases & Web Applications (P. Baumann) Big Science


  1. 320473 Databases & Web Applications Lab 320454 Big Data Project A Instructor: Peter Baumann email: p.baumann@jacobs-university.de tel: -3178 office: room 88, Research 1 320302 Databases & Web Applications (P. Baumann)

  2. Big Science Data [OGC Ocean Science Interoperability Experiment; image source: Mbari] Databases & Web Applications Lab | Big Data Project (P. Baumann) 2

  3. OGC Coverage Types «FeatureType»  Coverage = digital representation of Abstract Coverage space/time varying phenomenon Grid Coverage • n-D MultiSolid Coverage Rectified GridCoverage MultiSurface Referenceable Coverage MultiCurve GridCoverage Coverage MultiPoint Coverage Databases & Web Applications Lab | Big Data Project (P. Baumann) 3

  4. Facing the Coverage Deluge sensor feeds [OGC SWE] coverage server Databases & Web Applications Lab | Big Data Project (P. Baumann) 4 4

  5. Taming the Coverage Deluge sensor feeds [OGC SWE] coverage server Databases & Web Applications Lab | Big Data Project (P. Baumann) 5 5

  6. Let’s Take a Closer Look... t  Divergent access patterns for ingest and retrieval  Server must mediate between access patterns Databases & Web Applications Lab | Big Data Project (P. Baumann) 6

  7. Our Research  Large-Scale Scientific Information Services (L-SIS) Research Group  flexible, scalable services on massive multi-dimensional scientific data • Particular focus: n-D arrays • Massive = multi- TB … multi -PB per object  Results: • rasdaman array DBMS (www.rasdaman.org), demo at www.earthlook.org • Geoservice standards: OGC WCS suite, http://external.opengeospatial.org/twiki_public/CoveragesDWG/WebHome • ISO 9075 SQL Part 15: MDA (under work) Databases & Web Applications Lab | Big Data Project (P. Baumann) 7

  8. rasdaman: Scalable Array Analytics  „ raster data man ager“: Array Database = SQL + n-D arrays • select img.green[x0:x1,y0:y1] > 130 from LandsatArchive as img  “tile streaming” architecture: scaling from laptop to cloud rasdaman Web visitors www.rasdaman.org Databases & Web Applications Lab | Big Data Project (P. Baumann) 8

  9. Use Case: Satellite ImageTime Series [Diedrich et al 2001] Databases & Web Applications Lab | Big Data Project (P. Baumann) 9

  10. Ea Eart rthServe Server  Big Earth Data Analytics  Up to 130 TB databases for all Earth sciences + planetary science • EU FP7-INFRA, 3 years, 5.85 mEUR  Platform: rasdaman; strictly open standards Cryospheric Airborne Atmospheric Geology Oceanography Planetary Science Science Science marine model runs + Science geological models in-situ data landcover mapping high-altitude drones climate variables Mars geology Databases & Web Applications Lab | Big Data Project (P. Baumann) 10

  11. Database Visualization select encode( struct { red: (char) s.image.b7[x0:x1,x0:x1], green: (char) s.image.b5[x0:x1,x0:x1], blue: (char) s.image.b0[x0:x1,x0:x1], alpha: (char) scale( d.elev, 20 ) }, "image/png" ) from SatImage as s, DEM as d [JacobsU, Fraunhofer 2012; [data courtesy BGS, ESA] [JacobsU, Fraunhofer; data courtesy BGS, ESA] Databases & Web Applications Lab | Big Data Project (P. Baumann) 11

  12. Parallel / Distributed Query Processing  ad-hoc federation  mixed hardware Dataset D select max((A.nir - A.red) / (A.nir + A.red)) - max((B.nir - B.red) / (B.nir + B.red)) Dataset C - max((C.nir - C.red) / (C.nir + C.red)) - max((D.nir - D.red) / (D.nir + D.red)) from A, B, C, D Dataset A Dataset B Databases & Web Applications Lab | Big Data Project (P. Baumann) 12

  13. Secured Archive Integration First-ever direct, ad-hoc mix from protected NASA & ESA services in OGC WCS/WCPS Web client (EarthServer + CobWeb) Databases & Web Applications Lab | Big Data Project (P. Baumann) 13

  14. Demo Databases & Web Applications Lab | Big Data Project (P. Baumann) 14

  15. Next: On-Board Query Intelligence [OPS-SAT: ESA CubeSat] Democratize direct data access [imagery courtesy ESA, NASA] Databases & Web Applications Lab | Big Data Project (P. Baumann) 15

  16. Summary  Project work • embedded in international projects & collaborations  Present  Publish Databases & Web Applications Lab | Big Data Project (P. Baumann) 16

  17. Big Picture  320302 Databases and Web Applications • Fall lecture, undergrad + grad • Advanced course in spring: Information Architectures  320473 Databases and Web Applications Lab • Lab, grad  320454 Big Data Project A • Project, grad New meeting slot: Tue 09:45, Research 1, room 88 Databases & Web Applications Lab | Big Data Project (P. Baumann) 17

  18. Project Task  Pick a topic • http://www.faculty.jacobs-university.de/pbaumann/iu- bremen.de_pbaumann//Courses/ResearchTopics/  Perform task – planful:  Spec document 20% -- Sep 26 Oct 03  Prototype 1: breakthrough implementation 20% -- Oct 17  Prototype 2: ready for benchmark 20% -- Oct 31  Benchmark results 20% -- Nov 14  Publication 10% -- Nov 28  Prototype 3: ready for handover 10% -- Dec 05 Databases & Web Applications Lab | Big Data Project (P. Baumann) 18

  19. Resources  rasdaman website • www.rasdaman.org  demo • www.earthlook.org  Our publications • http://www.faculty.jacobs-university.de/pbaumann/iu-bremen.de_pbaumann//pubs.php  Instructor: • p.baumann@...  ...and the rasdaman team Databases & Web Applications Lab | Big Data Project (P. Baumann) 19

  20. Main Evaluation Criteria  complete wrt. requirements  Solid engineering • bug-free, project & code documentation, coding quality, ...  user-friendliness and appealing look&feel  complexity (in absolute terms and in comparison to other teams' work)  Good writeup • Specification, documentation, paper  (no particular order) Databases & Web Applications Lab | Big Data Project (P. Baumann) 20

  21. Project/Lab Topics  http://www.faculty.jacobs-university.de/pbaumann/iu- bremen.de_pbaumann/teaching.php  -> course list -> list of topics Databases & Web Applications Lab | Big Data Project (P. Baumann) 21

Recommend


More recommend