timePlot Text mining support for analysis of aeronautical incident reports Showcase of Electronic Tools (SET13) September 23 rd 2013 Safety-Data by CFH Group
p. 2 Outline I. Partners and clients II. Context and Stakes III. Proposed approach IV. Demo V. TimePlot analysis platform VI. Other services VII. About us Safety-Data by CFH Group 2013/09/23
p. 3 I - Partners • Tools designed and tuned in close collaboration with experts in the field of aviation safety: – Administrations: • ICAO • KOTSA • DSNA (ATC) – Manufacturers: • ATR • Airbus • The timePlot platform was designed in collaboration with: • DGAC (tool currently deployed and in service 200 users) • Air France (currently testing and validation) • EASA (ECCAIRS integration, currently testing) Safety-Data by CFH Group 2013/09/23
p. 4 II - Context & Stakes • Context: incident report databases ‒ Large and constantly growing repositories of safety related data. ‒ A lot of information is still in the form of free text. ‒ Coding/classification based strategies are complex and reductive. ‒ Processing this information by human experts is time-consuming and costly. Safety-Data by CFH Group 2013/09/23
p. 5 II - Context & Stakes • The stakes: ‒ How to make sense of the data as a whole? – How fully process information-rich content? – How to identify and extract recurrent events, without relying on coded data? – How to visualize and specific, rare or complex situations ? Our approach: text mining tools based on linguistic analysis of all textual content in safety reports. Safety-Data by CFH Group 2013/09/23
p. 6 III - Proposed approach • Issue: Use text-mining techniques to process and organize narrative data in incident reports. • Basic principle : model inter-occurrence similarity based on the narrative content and visualized as a function of time ‒ Content similarity: occurrences describing the same phenomenon. ‒ Context similarity: Events occurring in similar contexts (airport/aircraft model/route) based on available coded data. ‒ Custom factors: Occurrences sharing similar causes (crew fatigue, situational awareness…). Safety-Data by CFH Group 2013/09/23
p. 7 IV - Demo • Online demo with an ASRS dataset: – From 1988 to 2012, – ~ 167 000 occurrences. https://services.safety-data.com/timeplot/prod/asrs/login.php Login: demouser Password: demoASRS2013 Safety-Data by CFH Group 2013/09/23
p. 8 V.1 - Overview • Example: For a given report, visualization of similar reports through time Safety-Data by CFH Group 2013/09/23
p. 9 V.2 - Features (1): Select the pivot report • Selection of the reference report in the reference database ‒ Search by query (keywords, type of aircraft, date, etc.), ‒ Via report id. • Report given by the user via a free text field. Safety-Data by CFH Group 2013/09/23
p. 10 V.2 - Features (2): Visualization • “Post - similarity” ordering Safety-Data by CFH Group 2013/09/23
p. 11 V.2 - Features (3): Analysis assistance • Selection of reports via customized factors Safety-Data by CFH Group 2013/09/23
p. 12 V.2 - Features (4): Alerts • Principle: Automatic analysis of incoming reports in order to identify specific incidents. • Alert selection: ‒ via a report describing an incident that one is looking to trace; ‒ via keywords; ‒ via the frequency of reports received for a particular type of equipment. • Basics: – When the database is updated, a similarity analysis is automatically run through the new reports to identify those corresponding to the alerts set up; – The user receives an alert message indicating the alert concerned and the corresponding reports. Safety-Data by CFH Group 2013/09/23
p. 13 V.3 - timePlot platform description • Hosted web service (accessible over the internet, used in a web-browser, logging via a user management module). • Available for English and French data, more languages to come. • Data exchange interfaces with other environments (ECCAIRS, ASRS-like databases, custom in-house solutions). • User-side integration with the ECCAIRS Browser. Safety-Data by CFH Group 2013/09/23
p. 14 VI.1 - Other services (1) • Data migration: – All source data format conversion (doc, xls, xml …) to the format of a target database (e.g. ECCAIRS e4f/e5f). – Taxonomy migration (e.g. ASRS ADREP). • Databases Quality & Coherence Analysis: – Factual data completion based on the narratives and/or external resources (e.g. Type of aircraft Mass group). – Duplicates identification through a content analysis. – Integration of a data flow by checking the quality of the data. Safety-Data by CFH Group 2013/09/23
p. 15 VI.1 - Other services (2) • Categorization assistance: ‒ Analysis of the taxonomy used and learning of the codification rules thanks to the existing data. ‒ Categorization assistance features integration in the user environment for entering/managing reports. ‒ Report internal coherence verification (text/categorization). ‒ Codification coherence and quality analysis in a report database (batch mode). Safety-Data by CFH Group 2013/09/23
p. 16 VI.2 - Database life cycle, e.g. ECCAIRS Phases Activities CFH-SD Supports • Environment initialization ECCAIRS deployment Database initialization • Taxonomy migration set-up ECCAIRS/ADREP proficiency • Existing database translation Specific software development • Database quality check Database quality check tools • New reports integration: Categorization assistance: Database Expert manpower – with SD ECCAIRS Add-in update – in batch mode • Experts check-up: Quality & completion analysis Database Quality Data quality check software Check software for large databases from EASA • ECCAIRS user activities Database timePlot environment for: Usage – Sorting and exporting data – Similarity analysis Safety-Data by CFH Group 2013/09/23
p. 17 VII - About us • CFH-Safety Data is a French SME: – We are based in Toulouse, – We are an Aerospace Valley member (World Competitiveness Cluster in Aeronautics & Space), – We are developing tools and applications and providing services related to aviation safety data. Website: www.safety-data.com • Contacts: – Mika Andreou: mika@safety-data.com – Eric Hermann: hermann@safety-data.com – Céline Raynal: raynal@safety-data.com Safety-Data by CFH Group 2013/09/23
Recommend
More recommend