next generation data discovery
play

Next Generation Data Discovery Fusing Structured and Unstructured - PowerPoint PPT Presentation

Next Generation Data Discovery Fusing Structured and Unstructured Content from Multiple Repositories Chris Meredith UDOT Dan Quinn PTFS Questions Shared Drives Vision: Enterprise-wide Data Discovery Information Transparency


  1. Next Generation Data Discovery Fusing Structured and Unstructured Content from Multiple Repositories Chris Meredith – UDOT Dan Quinn – PTFS

  2. Questions Shared Drives

  3. Vision: Enterprise-wide Data Discovery • Information Transparency • UDOT information should be discoverable to the entire department and its partners • Sometimes, information is desired but the question is unknown

  4. What Have We Done? • You don’t need to know where the information resides • You don’t need to know what information exists R2 Shared Drive

  5. What Stays the Same? The Index... ● References data from the source system. Nothing is copied! ● Utilizes source system credentials for document access. ○ If you’re required to log into the source system, the index does not provide a way around that. ● Allows data owners to remain data owners and continue to collect and maintain their information. So you can more easily use the information already provided by the department without duplication!

  6. Basic Knowvation operation

  7. Basic Search screen has several options

  8. Knowvation supports a browse hierarchy

  9. Browse enables navigating a folder structure

  10. Knowvation can support optimal UDOT hierarchy

  11. There are many search options from this interface

  12. The full text search box can start easy searches

  13. Items presented as dots on base map

  14. Various layers can be easily turned on/off

  15. Now mileposts are off

  16. Base map options can be switched with a click

  17. Select Open Street Map

  18. Open Street Map used to start search, display data

  19. A geospatial search is a common starting point

  20. The search can now be limited to Route 48

  21. And further limited with a PIN

  22. And further limited with a Document Type

  23. Data can be presented in different views: Grid View

  24. Data can be presented in different views: List View

  25. Data can be presented in different views: Thumbnail View

  26. An Esri widget enables Knowvation searches in ArcGIS

  27. An Esri widget enables Knowvation searches in ArcGIS

  28. A drop down makes selecting target route easy

  29. A Search simply on route returns 4,282,556 files

  30. Adding full text search on “ramp” narrows to 146,763 filesPIN narrows it down to five files

  31. Adding PIN 10711 narrows list to five files

  32. Selecting “all” brings back all records = 685 records

  33. Pattern Search is a fuzzy text search

  34. Correct/incorrect spellings are highlighted.

  35. What Else? • Improving data attributes/metadata improves searchability • Aligning data standards across the Department • The index reflects how well data governance functions within the department. With data governance improvements, the index improves.

  36. Moving Forward • Findability Study using machine learning to help make documents more findable • Power user testing – Region Designers • Training • Incorporate additional data sources

  37. Chris Meredith Utah Department of Transportation Central Right of Way GIS Administrator cmeredith@utah.gov Dan Quinn PTFS VP, Sales & Marketing dquinn@ptfs.gov

  38. How Can You Do That? You can search by… Location on a map

  39. How Can You Do That? You can search by… Address Route and Milepost Source system Project information (PIN) • PIN • Route • Name

  40. How Can You Do That? You can search by… Metadata categories The picture can't be displayed.

  41. How Can You Do That? You can search by… Full text across metadata and text in files using Boolean, Exact, Concept and Pattern search techniques

Recommend


More recommend