obtain original web presentation here https slides com
play

Obtain original web presentation here: - PowerPoint PPT Presentation

Obtain original web presentation here: https://slides.com/odineidolon/chym2017- 8/fullscreen#/ This PDF version is of lower quality 1 . 1 HOW TO DEAL WITH HOW TO DEAL WITH OBSERVATIONAL DATA OBSERVATIONAL DATA FOR (HYDROLOGICAL) FOR


  1. Obtain original web presentation here: https://slides.com/odineidolon/chym2017- 8/fullscreen#/ This PDF version is of lower quality 1 . 1

  2. HOW TO DEAL WITH HOW TO DEAL WITH OBSERVATIONAL DATA OBSERVATIONAL DATA FOR (HYDROLOGICAL) FOR (HYDROLOGICAL) MODELLING PURPOSES MODELLING PURPOSES ADRIANO FANTINI ICTP, Trieste, Italy afantini@ictp.it Online presentation: 1 . 2 https://bit.ly/2MaOjrP

  3. Which observations do you Which observations do you need for hydrology? need for hydrology? Precipitation (possibly hourly, esp. for small basins) Temperature Snow Elevation data Land use data Discharge / water stage 2

  4. Dense or sparse? Dense or sparse? Gridded: In-situ: Precipitation Precipitation Temperature Temperature Elevation Discharge Land use Water stage 3

  5. Gridded Gridded Advantages Uniform availability, often global Compare easily with models Generally straighforward formats (e.g. NetCDF) Efficient processing Different variables on the same grid Usually quality-controlled Disadvantages Heavily dependent on gridding method Not suitable for comparison over specific points Usually derived from in-situ data Dataset resolution != actual resolution (!!!) 4 . 1

  6. Gridding methods Gridding methods Basic categories: Inverse Distance Weighting Kriging Spline Interpolation Surface polygons CAN HAVE DIFFERENT RESULTS! 4 . 2 Mohr, 2008

  7. Gridding methods Gridding methods Basic categories: Inverse Distance Weighting Kriging Spline Interpolation Surface polygons CAN HAVE DIFFERENT RESULTS! Hofstra, 2008 4 . 3 Bostan, 2012

  8. In-situ In-situ Advantages No gridding/smoothing -> good for extremes Easy to compare with models (e.g. discharge at a given point) Do not hide anything from the user Dataset resolution == actual resolution Metadata! Disadvantages Scarse data availability Often in very weird formats Often lacking quality control Hard to compare with gridded (e.g. climate) models (PR, T) 5 . 1

  9. Common problems with Common problems with in-situ measurements in-situ measurements Temporal and spatial problems: Short timescale Missing periods Low station density Missing timesteps Data quality problems: Breaks and inhomogeneities Manual measurement errors Equipment errors and failures Weather-related measurement errors 5 . 2

  10. Temporal and spatial problems Temporal and spatial problems Short timescale Low station density Missing timesteps Missing periods HISTALP database, Bohm et al., 2007 5 . 3

  11. Data quality problems Data quality problems Manual measurement errors Equipment errors and failures 5 . 4

  12. Inhomogeneities Inhomogeneities Changes in measurement time Station relocations Instrumentation upgrades Incorrect maintainance 5 . 5 Hewaarachchi et al., 2016

  13. Data quality problems Data quality problems Measurement errors due to: Sensor icing Lack of power Vandalism Lack of maintenance Gauge undercatch 5 . 6

  14. Data acquisition problems Data acquisition problems EXAMPLE: PRECIPITATION GAUGE UNDERCATCH Nespor and Sevruk, 1999 on average ~30% ? on average ~30% ? ~80% for extreme winter solid events? ~80% for extreme winter solid events? 5 . 7

  15. Data acquisition problems Data acquisition problems EXAMPLE: PRECIPITATION GAUGE UNDERCATCH Macdonald and Pomeroy, 2008 5 . 8

  16. An in-situ example An in-situ example Precipitation Hourly From different institutions ~2200 stations on average uneven spatial coverage 2000-2016 6 . 1

  17. 6 . 2

  18. 6 . 3

  19. ? 6 . 4

  20. ? 6 . 5

  21. What can we do based on this? What can we do based on this? Cut outliers over a given fixed threshold Variable threshold based on SD or IQR Remove consecutive suspicious values 6 . 6

  22. What can we do based on this? What can we do based on this? Cut outliers over a given fixed threshold Variable threshold based on SD or IQR Remove consecutive suspicious values Timeseries are usually not enough to Timeseries are usually not enough to identify inhomogeneities and errors identify inhomogeneities and errors 6 . 6

  23. What can we do based on this? What can we do based on this? Cut outliers over a given fixed threshold Variable threshold based on SD or IQR Remove consecutive suspicious values Timeseries are usually not enough to Timeseries are usually not enough to identify inhomogeneities and errors identify inhomogeneities and errors Metadata Metadata 6 . 6

  24. What can we do based on this? What can we do based on this? Cut outliers over a given fixed threshold Variable threshold based on SD or IQR Remove consecutive suspicious values Timeseries are usually not enough to Timeseries are usually not enough to identify inhomogeneities and errors identify inhomogeneities and errors Metadata Metadata Spatial analysis Spatial analysis 6 . 6

  25. Metadata Metadata 6 . 7

  26. Metadata Metadata all the information that is all the information that is not data itself not data itself Gauge type and characteristics Station history (relocations, upgrades...) Recorded changes in station environment News about extreme events (hard to find for old data) 6 . 7

  27. Metadata Metadata all the information that is all the information that is not data itself not data itself Gauge type and characteristics Station history (relocations, upgrades...) Recorded changes in station environment News about extreme events (hard to find for old data) WE OFTEN DO NOT HAVE ACCESS TO THIS, AND IT'S EXTREMELY TIME CONSUMING 6 . 7

  28. Spatial analysis Spatial analysis 6 . 8

  29. Spatial analysis Spatial analysis Maps + comparison to Maps + comparison to neighbouring stations neighbouring stations Can be automated, once a criterion is chosen Possibilities for choosing reference stations: nearest neighbours, distance radius, height range, high correlation... 6 . 8

  30. Spatial analysis Spatial analysis Maps + comparison to Maps + comparison to neighbouring stations neighbouring stations Can be automated, once a criterion is chosen Possibilities for choosing reference stations: nearest neighbours, distance radius, height range, high correlation... REQUIRES HIGH ENOUGH STATION DENSITY HARD TO DO ON HIGHLY SPATIALLY VARIABLE FIELDS (e.g. PRECIPITATION) OR REGIONS (e.g. MOUNTAINS) 6 . 8

  31. 6 . 9

  32. 6 . 10

  33. ! 6 . 11

  34. Even after correction... Even after correction... JJA Prein et al., 2017 7 . 1

  35. DJF Prein et al., 2017 7 . 2

  36. DJF MAM JJA SON 2001-2016 mean precip 7 . 3

  37. DJF MAM JJA SON 2001-2016 mean precip 7 . 3

  38. DJF MAM JJA SON 2001-2016 mean precip 7 . 3

  39. 2001-2016 Precipitation probability density function (Northern Italy only) 7 . 4

  40. A few remarks A few remarks The best approach to correct data is heavily dependent on: Application Variable (e.g. precipitation > discharge > temperature) Availability of metadata Station density Length of the records Manual resources available 7 . 5

  41. A few remarks A few remarks The best approach to correct data is heavily dependent on: Application Variable (e.g. precipitation > discharge > temperature) Availability of metadata Station density Length of the records Manual resources available A CORRECTION A CORRECTION OBSERVATIONAL OBSERVATIONAL WILL OFTEN NOT WILL OFTEN NOT DATA WITH DATA WITH VERY VERY BE POSSIBLE BE POSSIBLE HIGH HIGH UNCERTAINTY UNCERTAINTY 7 . 5

  42. But... what about other data But... what about other data sources? sources? RADAR SATELLITE Only for precipitation Precipitation, temperature Depends heavily on location ~ Worldwide Can be shielded by topography The same algorithm is not Can be shielded by intense necessarily good everywhere rain Resolution is generally poor Frequent downtime (0.25° max) 8 . 1

  43. Requirement to They are just proxies! choose an algorithm TRMM ALGORITHMS CORR Liu, 2014 8 . 2

  44. Requirement to They are just proxies! choose an algorithm TRMM ALGORITHMS CORR Liu, 2014 But they are getting better and better! But they are getting better and better! 8 . 2

  45. DEMs DEMs Digital Elevation Models ASTER (30m) SRTM (30/90m) HydroSHEDS (90m) JAXA ALOS (30m) GTOPO (1km) WorldDEM (12m) Local, national DEMs ... Usually satellite based, sometimes LIDAR 9 . 1

  46. ~100km 9 . 2

  47. Another example: comparison over a small area High resolution Italian official DEM: 20m resolution Obtained from military contour maps Comparison with: ~30km ASTER HS-c HS-vf JAXA SRTM TINITALY01 All remapped on a 100m grid 9 . 3

  48. ASTER HS-c HS-vf JAXA SRTM TINITALY01 9 . 4

  49. Average height differences of several meters 5th-95th bias percentile range up to 115 meters Standard deviation of bias up to 60m Even the best alternative DEM has a 5th-95th bias percentile range of 35 meters, more than enough to introduce issues in river routing! CHOICE OF DEM MATTERS! 9 . 5

  50. Take home message Take home message Do not underestimate observational Do not underestimate observational uncertainty uncertainty Choose your data source based on Choose your data source based on your your application application Never-ever blindly trust un-checked Never-ever blindly trust un-checked obs data!!! obs data!!! 10

Recommend


More recommend