Computing in the Cloud (CiC): GIS Vector Data Overlay Computation - PowerPoint PPT Presentation

Computing in the Cloud (CiC): GIS Vector Data Overlay Computation on Windows Azure Platform Sushil Prasad Xuan Shi

Research Challenge • How to improve the performance of vector overlay computation over large scale spatial data by utilizing Windows Azure Cloud platform?

Spatial Computation in the Cloud ??? Task(s) accomplished in a single desktop/standalone GIS

Concepts in Windows Azure Cloud Web Role(s) Worker Role(s)

Processing single files Dispatch Monitor Aggregate . . . . . . . . . Reprojection, create index, build pyramid, etc.

Raster data modeling Partition/Dispatch Monitor Aggregate

Vector overlay computation equal, touch, contain, within, intersect, difference, union, etc. Partition/Dispatch Monitor Aggregate How? Oops! Help ?

Partitioning two sets of data • Partitioning binary streams  Where to cut??? • Partitioning based on the order of input features  Within a layer, the order of input is meaningless  Between layers, the random orders generate more chaos

Uniform grid vs. tiled processing • Split [sequential ?] – compute [parallel] – merge [sequential?] • Smaller cells vs. more overhead • Load balance, monitor mechanism, etc.

Partitioning upon spatial index • Spatial data have build-in spatial index [R- tree, Quad-tree, etc.] • No APIs to manipulate data based on spatial index • Building spatial index over two large scale datasets for data partitioning in Web role is time consuming

Partitioning vs. spatial relationship • Data partitioning is determined by the potential relationship, i.e. the bounding box relationship • Overlay computation determines the true spatial relationship • No silver bullet for all kinds of spatial relationships

Data preparation and I/O streaming • Computing nodes in cloud/grid/GPU may not be able to utilize proprietary modules  Shapefile or spatial database: looping through 500,000+ features to partition two datasets into cloud seems another process of spatial overlay computation  GML: before read through the whole file, nobody knows 1) how many features is has; 2) for each feature, what the bounding box is; 3) for each feature, whether it is a multi-polygon; 4) how many holes each exterior ring has; 5) how many vertices each ring has  New data schema designed to enable efficient data partitioning and processing  Stored in Azure tables

The general workflow Web Role Parse XML Sort Add Wait for De- and store Polygons Link Base Serialize messages Output serialize as objects in parallel Layer to and store to work Queue to and write defined in based on Overlay into Azure queue for be to output Layer table the new bounding each job populated file schema boxes Worker Role Wait for Serialize Read from Feed work and store Populate table and Polygon to queue to the output de- GPC be output queue serialize library populated into Table

Processing in the cloud

Aggregation • Aggregation may be simplified in case of intersect, touch, contain, within operations – the Web roles only collects and write out the results without any further processing. • Aggregation can be a challenge in other spatial operations, such as union, which may need a different partitioning solution

Project under development Questions?

Computing in the Cloud (CiC): GIS Vector Data Overlay Computation - PowerPoint PPT Presentation

Computing in the Cloud (CiC): GIS Vector Data Overlay Computation on Windows Azure Platform Sushil Prasad Xuan Shi Research Challenge How to improve the performance of vector overlay computation over large scale spatial

CIC Edmonton RIN Presentation - October 10th 2019 Introductions Program Director, CIC CIC

Cloud Computing & Cloud Models Cloud Models Topics Defining cloud computing

cloud computing Ridwaan Boda Director | Technology, Media and Telecommunications Overview

Cloud Computing and Cloud Storage By: Maurice Kelly History of Internet and Cloud Computing

Chapter 4 Cloud Computing Applications and Paradigms Cloud Computing: Theory and Practice. 1

Storage Deduplication in Cloud Computing Joo Paulo and Jos Pereira University of Minho July

Patterns for Cloud Computing Simon Guest Senior Director, Technical Strategy Microsoft

Introduction to PaaS and IaaS Cloud Computing Roberto Beraldi Models for Cloud Computing

Climate Innovation and Center (CIC) Business Plan: Ethiopia January 2012 Recap: Why a CIC in

Design of CIC of CIC Compensators Compensators With With SPT SPT Design Coefficients Based

Ciberseguridad Cyber security A-15 Dr. Ponciano Jorge Escamilla Ambrosio pescamilla@cic.ipn.mx

Superferric 3T CIC Dipole R&D 2016/17 Project Report Peter McIntyre Texas A&M University

Ciberseguridad Cyber security A-15 Dr. Ponciano Jorge Escamilla Ambrosio pescamilla@cic.ipn.mx

Ciberseguridad Cyber security A-15 Dr. Ponciano Jorge Escamilla Ambrosio pescamilla@cic.ipn.mx

Ciberseguridad Cyber security A-15 Dr. Ponciano Jorge Escamilla Ambrosio pescamilla@cic.ipn.mx

Ciberseguridad Fusin de Sensores Dr. Ponciano Jorge Escamilla Ambrosio pescamilla@cic.ipn.mx

Private Sponsorship of Refugees Program Sponsor Engagement Meetings Resettlement Operations

APIC 43 rd Annual Conference June 11-13 Charlotte, NC Rochelle Neilson, RN, BS N, MS HCA,

OBJECTIVES Provide an Overview of the FIN Program Highlight Success to date Next steps

NAVIGATING FROM EXPRESS ENTRY TO PERMANENT RESIDENCY: CRITICAL RISK AND COMPLIANCE ISSUES

Privacy and Sharing Data for Social Good July 15, 2019 Background Communities grapple with

Our next steps: Coproduce Care CIC @coprocare www.coproducecare.com hi@coproducecare.com Here

Crace, Lightsview and Googong Clive Alcock, Urban Design Manager This presentation. Covers

a n a c t a 001852 ___ Aboriginal Affairs arid I Affaires autochtones et , Dveloppement

Computing in the Cloud (CiC): GIS Vector Data Overlay Computation - PowerPoint PPT Presentation

Computing in the Cloud (CiC): GIS Vector Data Overlay Computation on Windows Azure Platform Sushil Prasad Xuan Shi Research Challenge How to improve the performance of vector overlay computation over large scale spatial

CIC Edmonton RIN Presentation - October 10th 2019 Introductions Program Director, CIC CIC

Cloud Computing &amp; Cloud Models Cloud Models Topics Defining cloud computing

cloud computing Ridwaan Boda Director | Technology, Media and Telecommunications Overview

Cloud Computing and Cloud Storage By: Maurice Kelly History of Internet and Cloud Computing

Chapter 4 Cloud Computing Applications and Paradigms Cloud Computing: Theory and Practice. 1

Storage Deduplication in Cloud Computing Joo Paulo and Jos Pereira University of Minho July

Patterns for Cloud Computing Simon Guest Senior Director, Technical Strategy Microsoft

Introduction to PaaS and IaaS Cloud Computing Roberto Beraldi Models for Cloud Computing

Climate Innovation and Center (CIC) Business Plan: Ethiopia January 2012 Recap: Why a CIC in

Design of CIC of CIC Compensators Compensators With With SPT SPT Design Coefficients Based

Ciberseguridad Cyber security A-15 Dr. Ponciano Jorge Escamilla Ambrosio pescamilla@cic.ipn.mx

Superferric 3T CIC Dipole R&amp;D 2016/17 Project Report Peter McIntyre Texas A&amp;M University

Ciberseguridad Cyber security A-15 Dr. Ponciano Jorge Escamilla Ambrosio pescamilla@cic.ipn.mx

Ciberseguridad Cyber security A-15 Dr. Ponciano Jorge Escamilla Ambrosio pescamilla@cic.ipn.mx

Ciberseguridad Cyber security A-15 Dr. Ponciano Jorge Escamilla Ambrosio pescamilla@cic.ipn.mx

Ciberseguridad Fusin de Sensores Dr. Ponciano Jorge Escamilla Ambrosio pescamilla@cic.ipn.mx

Private Sponsorship of Refugees Program Sponsor Engagement Meetings Resettlement Operations

APIC 43 rd Annual Conference June 11-13 Charlotte, NC Rochelle Neilson, RN, BS N, MS HCA,

OBJECTIVES Provide an Overview of the FIN Program Highlight Success to date Next steps

NAVIGATING FROM EXPRESS ENTRY TO PERMANENT RESIDENCY: CRITICAL RISK AND COMPLIANCE ISSUES

Privacy and Sharing Data for Social Good July 15, 2019 Background Communities grapple with

Our next steps: Coproduce Care CIC @coprocare www.coproducecare.com hi@coproducecare.com Here

Crace, Lightsview and Googong Clive Alcock, Urban Design Manager This presentation. Covers

a n a c t a 001852 ___ Aboriginal Affairs arid I Affaires autochtones et , Dveloppement

Cloud Computing & Cloud Models Cloud Models Topics Defining cloud computing

Superferric 3T CIC Dipole R&D 2016/17 Project Report Peter McIntyre Texas A&M University