timeplot
play

timePlot Text mining support for analysis of aeronautical incident - PowerPoint PPT Presentation

timePlot Text mining support for analysis of aeronautical incident reports Showcase of Electronic Tools (SET13) September 23 rd 2013 Safety-Data by CFH Group p. 2 Outline I. Partners and clients II. Context and Stakes III. Proposed


  1. timePlot Text mining support for analysis of aeronautical incident reports Showcase of Electronic Tools (SET13) September 23 rd 2013 Safety-Data  by CFH Group

  2. p. 2 Outline I. Partners and clients II. Context and Stakes III. Proposed approach IV. Demo V. TimePlot analysis platform VI. Other services VII. About us Safety-Data  by CFH Group 2013/09/23

  3. p. 3 I - Partners • Tools designed and tuned in close collaboration with experts in the field of aviation safety: – Administrations: • ICAO • KOTSA • DSNA (ATC) – Manufacturers: • ATR • Airbus • The timePlot platform was designed in collaboration with: • DGAC (tool currently deployed and in service  200 users) • Air France (currently testing and validation) • EASA (ECCAIRS integration, currently testing) Safety-Data  by CFH Group 2013/09/23

  4. p. 4 II - Context & Stakes • Context: incident report databases ‒ Large and constantly growing repositories of safety related data. ‒ A lot of information is still in the form of free text. ‒ Coding/classification based strategies are complex and reductive. ‒ Processing this information by human experts is time-consuming and costly. Safety-Data  by CFH Group 2013/09/23

  5. p. 5 II - Context & Stakes • The stakes: ‒ How to make sense of the data as a whole? – How fully process information-rich content? – How to identify and extract recurrent events, without relying on coded data? – How to visualize and specific, rare or complex situations ?  Our approach: text mining tools based on linguistic analysis of all textual content in safety reports. Safety-Data  by CFH Group 2013/09/23

  6. p. 6 III - Proposed approach • Issue: Use text-mining techniques to process and organize narrative data in incident reports. • Basic principle : model inter-occurrence similarity based on the narrative content and visualized as a function of time ‒ Content similarity: occurrences describing the same phenomenon. ‒ Context similarity: Events occurring in similar contexts (airport/aircraft model/route) based on available coded data. ‒ Custom factors: Occurrences sharing similar causes (crew fatigue, situational awareness…). Safety-Data  by CFH Group 2013/09/23

  7. p. 7 IV - Demo • Online demo with an ASRS dataset: – From 1988 to 2012, – ~ 167 000 occurrences. https://services.safety-data.com/timeplot/prod/asrs/login.php Login: demouser Password: demoASRS2013 Safety-Data  by CFH Group 2013/09/23

  8. p. 8 V.1 - Overview • Example: For a given report, visualization of similar reports through time Safety-Data  by CFH Group 2013/09/23

  9. p. 9 V.2 - Features (1): Select the pivot report • Selection of the reference report in the reference database ‒ Search by query (keywords, type of aircraft, date, etc.), ‒ Via report id. • Report given by the user via a free text field. Safety-Data  by CFH Group 2013/09/23

  10. p. 10 V.2 - Features (2): Visualization • “Post - similarity” ordering Safety-Data  by CFH Group 2013/09/23

  11. p. 11 V.2 - Features (3): Analysis assistance • Selection of reports via customized factors Safety-Data  by CFH Group 2013/09/23

  12. p. 12 V.2 - Features (4): Alerts • Principle: Automatic analysis of incoming reports in order to identify specific incidents. • Alert selection: ‒ via a report describing an incident that one is looking to trace; ‒ via keywords; ‒ via the frequency of reports received for a particular type of equipment. • Basics: – When the database is updated, a similarity analysis is automatically run through the new reports to identify those corresponding to the alerts set up; – The user receives an alert message indicating the alert concerned and the corresponding reports. Safety-Data  by CFH Group 2013/09/23

  13. p. 13 V.3 - timePlot platform description • Hosted web service (accessible over the internet, used in a web-browser, logging via a user management module). • Available for English and French data, more languages to come. • Data exchange interfaces with other environments (ECCAIRS, ASRS-like databases, custom in-house solutions). • User-side integration with the ECCAIRS Browser. Safety-Data  by CFH Group 2013/09/23

  14. p. 14 VI.1 - Other services (1) • Data migration: – All source data format conversion (doc, xls, xml …) to the format of a target database (e.g. ECCAIRS e4f/e5f). – Taxonomy migration (e.g. ASRS  ADREP). • Databases Quality & Coherence Analysis: – Factual data completion based on the narratives and/or external resources (e.g. Type of aircraft  Mass group). – Duplicates identification through a content analysis. – Integration of a data flow by checking the quality of the data. Safety-Data  by CFH Group 2013/09/23

  15. p. 15 VI.1 - Other services (2) • Categorization assistance: ‒ Analysis of the taxonomy used and learning of the codification rules thanks to the existing data. ‒ Categorization assistance features integration in the user environment for entering/managing reports. ‒ Report internal coherence verification (text/categorization). ‒ Codification coherence and quality analysis in a report database (batch mode). Safety-Data  by CFH Group 2013/09/23

  16. p. 16 VI.2 - Database life cycle, e.g. ECCAIRS Phases Activities CFH-SD Supports • Environment initialization  ECCAIRS deployment Database initialization • Taxonomy migration set-up  ECCAIRS/ADREP proficiency • Existing database translation  Specific software development • Database quality check  Database quality check tools • New reports integration:  Categorization assistance: Database  Expert manpower – with SD ECCAIRS Add-in update – in batch mode • Experts check-up:  Quality & completion analysis Database Quality Data quality check software Check software for large databases from EASA • ECCAIRS user activities Database  timePlot environment for: Usage – Sorting and exporting data – Similarity analysis Safety-Data  by CFH Group 2013/09/23

  17. p. 17 VII - About us • CFH-Safety Data is a French SME: – We are based in Toulouse, – We are an Aerospace Valley member (World Competitiveness Cluster in Aeronautics & Space), – We are developing tools and applications and providing services related to aviation safety data. Website: www.safety-data.com • Contacts: – Mika Andreou: mika@safety-data.com – Eric Hermann: hermann@safety-data.com – Céline Raynal: raynal@safety-data.com Safety-Data  by CFH Group 2013/09/23

Recommend


More recommend