visualising real time traffic data using elasticsearch
play

VISUALISING REAL TIME TRAFFIC DATA USING ELASTICSEARCH AND C3JS @ - PowerPoint PPT Presentation

VISUALISING REAL TIME TRAFFIC DATA USING ELASTICSEARCH AND C3JS @ jettroCoenradie Trifork Amsterdam Case Study ANWB (Royal Dutch Automobile Association) FACT SHEET Software engineer @ Trifork Jettro Coenradie specialised in search


  1. VISUALISING REAL TIME TRAFFIC DATA USING ELASTICSEARCH AND C3JS @ jettroCoenradie Trifork Amsterdam Case Study ANWB (Royal Dutch Automobile Association)

  2. FACT SHEET Software engineer @ Trifork Jettro Coenradie specialised in search @jettroCoenradie Twitter @gridshore Gihub https://github.com/jettro Linkedin https://www.linkedin.com/in/jettro http://www.gridshore.nl Blogs http://blog.trifork.com/author/jettro/

  3. GOAL Ideas for combining (open) data Evaluate options and performance

  4. WHAT IS ANWB? • Dutch Automobile Driver Assistance • Sister from: FDM (Danmark) ADAC (Germany) AA (England)

  5. Founded in 1883 as Algemene Nederlandse Wieler Bond General Dutch Bicycle Association

  6. WHAT IS ELASTICSEARCH • Distributed / Scalable search • Structured and full-text • Data analytics • Log analysis

  7. (OPEN) DATA Real time traffic data Weather data Automobile Assistance data

  8. GOAL FOR THE PROJECT Amount of cars on the roads Wrong data Traffic intensity on the roads

  9. FLOW OF THE PROJECT • Get to know the data: Logstash / Kibana • Start improving data quality • Present data using our own charts

  10. TECHNICAL OVERVIEW Tomcat - Spring mvc - c3js Data view Spring Integration Data xml / csv integration Data Store elasticsearch

  11. DEMO

  12. Index A Index B Index C Shard 1 Shard R 1 Shard 2 Shard R 2 Lucene Lucene Lucene Lucene

  13. TIME BASED INDICES Strings Numbers NDW Dates Geo points

  14. TIME BASED INDICES Alias NDW-2014-09-15 NDW NDW-2014-09-16 NDW-2014-09-17 mapping-template

  15. SCHEMA-LESS Dynamic schema • There is always a schema • The schema can be dynamic • Often you want to be specific Dates / Numbers / Geo locations

  16. SEARCH Full text search Versus Structured search

  17. STRUCTURED SEARCH Filters • Can be cached most of the time • No scoring • Fast

  18. FILTERS WE USED • Range filters • Term filters • Composite (bool) filters

  19. Date Range Filter Range Filter Term Filter

  20. AGGREGATIONS Two types of aggregations • Create buckets of data • Compute Metrics

  21. Set of documents Doc Doc Doc Term: red, blue, green, yellow Range: 0-10, 10-20, 20-30, 30-40 Condition Bucket Bucket Bucket Bucket

  22. D D Set of documents

  23. AGGREGATIONS WE USED • Date histogram aggregations • Terms aggregations • AVG aggregations

  24. Date Histogram Aggregation + AVG metric Aggregation

  25. Terms Aggregation

  26. GEO LOCATIONS Two types of locations • Using latitude and longitude • Using geohash (creating a grid)

  27. GEO LAT/LON • Used for distance based queries • Used for distance based aggregations

  28. GEO HASH • Uses a hash te represent a square • More characters means more precision

  29. GEOHASH http://www.bigdatamodeling.org/2013/01/intuitive-geohash.html

  30. PERCOLATOR “The opposite of executing a query and finding results”

  31. PERCOLATOR “Match an (existing) document against stored queries.”

  32. PERCOLATOR Zuid-West Geo polygon Noord-West filter Noord-Oost Zuid { location: [ Zuid-West 3.5123, 46.3412 ] }

  33. QUESTIONS @jettroCoenradie

Recommend


More recommend