discovering functional dependencies and
play

Discovering Functional Dependencies and Association Rules by - PowerPoint PPT Presentation

1 Discovering Functional Dependencies and Association Rules by Navigating in a Lattice of OLAP Views Team LIS IRISA (Rennes, France) Pierre Allard, pierre.allard@irisa.fr Sbastien Ferr, ferre@irisa.fr Olivier Ridoux, ridoux@irisa.fr 2


  1. 1 Discovering Functional Dependencies and Association Rules by Navigating in a Lattice of OLAP Views Team LIS – IRISA (Rennes, France) Pierre Allard, pierre.allard@irisa.fr Sébastien Ferré, ferre@irisa.fr Olivier Ridoux, ridoux@irisa.fr

  2. 2 Introduction : photos database Name Date Place Theme … • Rules extraction from multi- valued set T1 ▫ Set of described photos T2 … Date  Place • Traditional systems extract a Date, Camera  Place list of rules (Date  Camera, Tp) Place = Marseille  Event = Conference … • Many extracted rules • Difficulty in finding relevant information

  3. 3 Problem: Rules browsing • Excessive number of rules ▫ Difficulty in finding relevant information • Do not present all the rules together ▫ Show rules in subsets (views) • Need interactivity ▫ Navigation ▫ Exploration

  4. 4 Rule extraction : background • Multi-valued context ▫ Attr(r) = {Name, Date, Place, Theme , Event, Camera, …} ▫ Set of items described in each attribute • Functional Dependencies ▫ Date  Place • Conditional Functional Dependencies ▫ (Date  Camera, {Topic = Holiday}) • Association Rules ▫ (Place = Marseille)  (Event = Conference JIGOT) • Hierarchy between rules with help of FCA [Medina09] ▫ Most general to most specific ▫ AR < CFD < FD

  5. 5 OLAP : background • On-Line Analytical Processing (OLAP) [Codd93] • Business intelligence (statisticians) • Extract trends, charts from a relation • Cube: view of the relation ▫ Fixed dimensions and measure ▫ Aggregation ▫ Granularity levels ▫ Navigation from cube to cube

  6. 6 Projection of a relation into an OLAP cube • Cube = aggregations of measures, by each value of each dimension • Dimensions  measure ▫ Dim(c) = ( Orientation , Date ), Meas(c) = Camera • Premises  conclusion ▫ Orientation , Date  Camera ? • Classic cube with no aggregation ▫ Multisets at each cell ▫ Keep all the values of the measure  Support, confidence • Subset of rules, regarding dimensions and measure

  7. 7 View : Photo database Projection (Orient, Date) Landscape Portrait Dim(c) = (Orientation, Date) 16 jul {{Nikon, Nikon}} {{Nikon}} Meas(c) = Camera 21 aug {{Nikon, Apple}} Name Date Orientation Camera t_1 DSC01 16 jul Landscape Nikon t_2 DSC02 16 jul Landscape Nikon t_3 DSC03 16 jul Portrait Nikon t_4 DSC04 21 aug Portrait Nikon t_5 IMG05 21 aug Portrait Apple

  8. 8 View : Rules in projection (Orient, Date) Landscape Portrait 16 jul {{Nikon, Nikon}} {{Nikon}} 21 aug {{Nikon, Apple}} • Cell with a unique item value ▫ Association Rule (Orientation = Landscape), (Date = 16 jul)  (Camera = Nikon) • Subset of cube with empty or unique item values cells ▫ Conditional Functional Dependency (Orientation, Date  Camera, {Orientation = Landscape}) • Cube with empty or unique item values cells ▫ Functional Dependency

  9. 9 View : Rules in projection • Respected hierarchy regarding number of cells ▫ AR < CFD < FD • Display of all rules type in one view • Accessible support et confidence • Must navigation to access other cubes

  10. 10 View : Granularity • Dimension values : taxonomy organized in levels • New possibilities of cube creation, regarding new dimensions and measures ▫ e.g. date month , date year , place country date year 2010 date month 2010/01 2010/02 date day 2010/01/09 2010/01/16 2010/02/06 2010/02/07

  11. 11 View : Granularity 16 jul 09 17 jul 09 28 jan 10 • Date  Place {{Marseille}} {{Marseille}} {{Hammamet}} jul 09 jan 10 • Date month  Place {{Marseille, Marseille}} {{Hammamet}} • Add new relevant information • Need navigation links, to go from a cube to another

  12. 12 Navigation • Use of standard OLAP navigation links ▫ Add / Delete a dimensions ▫ Drill-down / Roll-up a dimension  Delete is a specific case of roll-up • Introduction of new navigation links ▫ Drill-down / Roll-up the measure ▫ Change measure

  13. 13 Navigation : predictable consequences • Rules appearance  Unic item value   Multiple item values • Rules disappearance  Cube with FD  Cube without FD • Rules preservation

  14. 14 Abilis : Prototype • Based on Logical Information System (LIS) ▫ Camelis kernel with web interface [Ferre09] ▫ Query and navigate context with logical formulas • Current query ▫ Allows you to select a subset of items (conjonctions, disjonctions, negations, etc) • Navigation tree ▫ Summary of the dimensions values according to the current selection • OLAP navigation links ▫ Granularity created by LIS

  15. 15 Initial view : query, navigation tree and extension

  16. 16 Navigation tree = index

  17. 17 Add « Theme » partition

  18. 18 Create complex selections

  19. 19 Add « event » as measure

  20. 20 Set « date » by day as partition : FD

  21. Roll-up date by day to date by month : CFD 21

  22. 22 Add « Camera model » as partition : CFD

  23. 23 Remove birthday events : FD

  24. 24 Conclusion and perspectives • Show FD, CFD and AR in one view • Predict some consequences on rules with navigation links • Manage more complex data (relational, with zero or multiple attributes) • Add indicator values for navigation links • Allow user to create cubes from aggregated data

  25. 25 Thank your for your attention Team LIS – IRISA (Rennes, France) Pierre Allard, pierre.allard@irisa.fr Sébastien Ferré, ferre@irisa.fr Olivier Ridoux, ridoux@irisa.fr

Recommend


More recommend