challenges in ubiquitous data mining
play

Challenges in Ubiquitous Data Mining Jo ao Gama LIAAD-INESC Porto, - PowerPoint PPT Presentation

Motivation Illustrative Example Clustering Sensor Networks Final Comments Challenges in Ubiquitous Data Mining Jo ao Gama LIAAD-INESC Porto, University of Porto, Portugal jgama@fep.up.pt Jo ao Gama Challenges in Ubiquitous Data Mining


  1. Motivation Illustrative Example Clustering Sensor Networks Final Comments Challenges in Ubiquitous Data Mining Jo˜ ao Gama LIAAD-INESC Porto, University of Porto, Portugal jgama@fep.up.pt Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  2. Motivation Illustrative Example Clustering Sensor Networks Final Comments Motivation 1 Illustrative Example 2 Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks 3 Motivation Distributed Grid Clustering Clustering Data Sources Final Comments 4 Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  3. Motivation Illustrative Example Clustering Sensor Networks Final Comments Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  4. Motivation Illustrative Example Clustering Sensor Networks Final Comments Problem Formulation: Network Data Model Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  5. Motivation Illustrative Example Clustering Sensor Networks Final Comments Querying Model Query = Q ( � n i =0 S i ) One-shot queries: What is the state of the network? Continuous queries: Track and monitor the state of network at any time Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  6. Motivation Illustrative Example Clustering Sensor Networks Final Comments Network topologies Star Topology arrange peers around a central hub (coordinator). Mesh Network every peer is connected to nearest peers. The main purpose is fault tolerance. Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  7. Motivation Illustrative Example Clustering Sensor Networks Final Comments Routing schemes unicast: delivers a message to a single specific node; broadcast: delivers a message to all nodes in the network; anycast: delivers a message to a group of nodes, typically the ones nearest to the source. Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  8. Motivation Illustrative Example Clustering Sensor Networks Final Comments Limitations of existing techniques Machine learning so far has mostly centered on one-shot data analysis from homogeneous and stationary data, and on centralized algorithms. We are faced with tremendous amount of distributed data. In most cases, this data is transient , and may not be stored in permanent relations. The theory of machine learning relies on the assumption that the data points are independent and identically distributed, meaning that the underlying generative process is stationary. Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  9. Motivation Illustrative Example Clustering Sensor Networks Final Comments Requirements for Mining Sensor Data Streams Vertically distributed data Single pass: process each observation once; Small space: constant space; Small processing time; Reduced communications. Local Approaches: ✓ Privacy and Security preserving ✗ Synchronization Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  10. Motivation Illustrative Example Clustering Sensor Networks Final Comments The Demand for Learning Requirements for adaptive smart devices: be able to sense their environment, receive data from other devices, and make sense of the gathered data. be able to adapt continuously to changing environmental conditions and evolving user habits and needs. be capable of predictive self-diagnosis . be resource-aware because of the real-time constraint and of limited computer, battery power and communication resources. Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  11. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Illustrative Example: Renewable Power Prediction Analog Method for Collaborative very-short-term Forecasting of Power Generation from Photovoltaic Systems , V.Gomez, G. Hebrail, NGDM 2011 EC recommendation: in 2020 the penetration of renewable energies should be 20% Renewable Power Prediction: Predict the power produced by a photovoltaic panel for each quarter in a short-term time horizon. Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  12. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Collaborative Forecasting: Main Idea 1 Local Step: Find past states nearest to current state; 2 Collaboration: Broadcast time-stamps of past nearest states; 3 Local Search: Inferring the Global Context; 4 Prediction: Using the global context. Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  13. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Collaboration Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  14. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Local Search Past Future Local Site ? ? Reference Window Size W Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  15. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Local Search T1 Past Future Local Site ? ? Reference Window Compute the distance from the time- Size W series starting at time-stamp T1 to the reference window Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  16. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Local Search T5 Past Future Local Site ? ? Reference Window Compute the distance from the time- Size W series starting at time-stamp T5 to the reference window Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  17. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Local Search T8 Past Future Local Site ? ? Reference Window Compute the distance from the time- Size W series starting at time-stamp T8 to the reference window Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  18. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Collaboration: broadcast time-stamps of similar contexts Past Future ? Local Site Neighbor 1 Neighbor 2 Neighbor 3 Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  19. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Local search: Inferring the Global Context Past Future ? Local Site Neighbor 1 Neighbor 2 Neighbor 3 Matches: 3 Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  20. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Local search: Inferring the Global Context Past Future ? Local Site Neighbor 1 Neighbor 2 Neighbor 3 Matches: 0 Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  21. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Local search: Inferring the Global Context Past Future ? Local Site Neighbor 1 Neighbor 2 Neighbor 3 Matches: 1 Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  22. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Local search: Inferring the Global Context Past Future ? Local Site Neighbor 1 Neighbor 2 Neighbor 3 Matches: 1 Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  23. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments The Global Context Past Future ? Local Site Neighbor 1 Neighbor 2 Neighbor 3 Best Matching: 3 Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  24. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Prediction Past Future ? Local Site Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  25. Motivation Illustrative Example Very-short-term Forecasting in Photovoltaic Systems Clustering Sensor Networks Final Comments Lessons Learned Using local information to infer global context by collaboration with neighbors; Preserves privacy while collaborating with other systems; Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  26. Motivation Motivation Illustrative Example Distributed Grid Clustering Clustering Sensor Networks Clustering Data Sources Final Comments Clustering Distributed Data Streams Sensors are small, low-cost devices capable of sensing and communicating with other sensors. Continuously maintain a cluster structure over the network. Jo˜ ao Gama Challenges in Ubiquitous Data Mining

  27. Motivation Motivation Illustrative Example Distributed Grid Clustering Clustering Sensor Networks Clustering Data Sources Final Comments Clustering Distributed Data Streams Continuously maintain a cluster structure of the data points generated by sensors. A Cluster is a set of data points: Information about dense regions of the sensor data space. Jo˜ ao Gama Challenges in Ubiquitous Data Mining

Recommend


More recommend