staging user feedback toward rapid conflict resolution in
play

Staging User Feedback toward Rapid Conflict Resolution in Data - PowerPoint PPT Presentation

Staging User Feedback toward Rapid Conflict Resolution in Data Fusion Romila P Pradhan* , Siarhei Bykau , Sunil Prabhakar* *Purdue University, Bloomberg L.P. 1 Fusing data from multiple sources Data It Item S 1 S 2 S 3 S 4 Zootopia Howard


  1. Staging User Feedback toward Rapid Conflict Resolution in Data Fusion Romila P Pradhan* , Siarhei Bykau , Sunil Prabhakar* *Purdue University, Bloomberg L.P. 1

  2. Fusing data from multiple sources Data It Item S 1 S 2 S 3 S 4 Zootopia Howard Spencer Spencer Kung Fu Panda Stevenson Nelson Inside Out leFauve Docter Finding Dory Stanton Minions Coffin Renaud Rio Jones Saldanha 2

  3. Data fusion systems ACCU 1 Data It Item S 1 S 2 S 3 S 4 Zootopia Howard Spencer Spencer Kung Fu Panda Stevenson Nelson Source iterative Correctness Inside Out leFauve Docter computation accuracy of claims Finding Dory Stanton Minions Coffin Renaud Rio Jones Saldanha Data It Data It Item Item Correctness o Correctness o of c of c claims claims So Source Ac Accuracy Zootopia Zootopia Howard (0.000) Howard (0.000) Spencer (1.000) Spencer ( (1.0 .000) S 1 0.317 Kung Fu Panda Kung Fu Panda Stevenson (0.015) Stevenson (0.015) Nelson (0.985) Nelson ( (0.9 .985) S 2 0.027 Inside Out Inside Out leFauve (0.001) leFauve (0.001) Docter (0.999) Do Docter (0.9 .999) S 3 0.992 Finding Dory Finding Dory Stanton (1.000) Stanton ( (1.0 .000) S 4 1.000 Minions Minions Coffin (0.921) Coffin ( (0.9 .921) Renaud (0.079) Renaud (0.079) Rio Rio Jones (0.015) Jones (0.015) Saldanha (0.9 Sa Saldanha (0.985) .985) [1] Xin Luna Dong, Laure Berti-Equille, Divesh Srivastava. Data Fusion: Resolving Conflicts from Multiple Sources. WAIM 2013. 3

  4. Comparison with ground truth ACCU 1 Data It Item S 1 S 2 S 3 S 4 Zootopia Howard Spencer Spencer Kung Fu Panda Stevenson Nelson Source iterative Correctness Inside Out leFauve Docter computation accuracy of claims Finding Dory Stanton Minions Coffin Renaud Rio Jones Saldanha Data It Item Tr Truth Data It Item Correctness o of c claims So Source Ac Accuracy Zootopia Howard Zootopia Howard (0.000) Spencer ( (1.0 .000) S 1 0.317 Kung Fu Panda Stevenson Kung Fu Panda Stevenson (0.015) Nelson ( (0.9 .985) S 2 0.027 Inside Out Docter Inside Out leFauve (0.001) Docter (0.9 Do .999) S 3 0.992 Finding Dory Stanton Finding Dory Stanton ( (1.0 .000) S 4 1.000 Minions Coffin Minions Coffin ( (0.9 .921) Renaud (0.079) Rio Saldanha Rio Jones (0.015) Sa Saldanha (0.9 .985) [1] Xin Luna Dong, Laure Berti-Equille, Divesh Srivastava. Data Fusion: Resolving Conflicts from Multiple Sources. WAIM 2013. 4

  5. Involve the User Validate data item Data Fusion Correctness D Model of claims User feedback to fusion model Labels 5

  6. How to be most effective with user feedback? 6

  7. This talk 4 ranking strategies Query-by-committee Maximum Expected Utility Item-level ranking Holistic ranking Approximate MEU Uncertainty Sampling Feedback Errors Evaluation ∆ distance_to_ground_truth (%) 0 Random QBC US • Confidence ApproxMEU -20 MEU GUB • Error-rate -40 • Conflicting -60 Non-expert feedback -80 0 20 40 60 80 100 data items validated (%) 7

  8. item-level ranking holistic ranking feedback errors evaluation Query-by-committee (QBC) most sources agree Data It Item S 1 S 2 S 3 S 4 Zootopia Howard Spencer Spencer Zootopia Howard Spencer Spencer Kung Fu Panda Stevenson Nelson Inside Out leFauve Docter sources disagree Finding Dory Stanton Minions Coffin Renaud Rio Jones Saldanha Rio Jones Saldanha 8

  9. item-level ranking holistic ranking feedback errors evaluation Uncertainty Sampling (US) Data It Item Correctness o of c f claims Data It Item Correctness o of c f claims Zootopia Howard (0.000) Spencer (1.000) Zootopia Howard (0.000) Spencer (1.000) Kung Fu Panda Stevenson (0.015) Nelson (0.985) Kung Fu Panda Stevenson (0.015) Nelson (0.985) Kung Fu Panda Stevenson (0.015) Nelson (0.985) Inside Out leFauve (0.001) Docter (0.999) Inside Out leFauve (0.001) Docter (0.999) Finding Dory Stanton (1.000) Finding Dory Stanton (1.000) Minions Coffin (0.921) Renaud (0.079) Minions Coffin (0.921) Renaud (0.079) Minions Coffin (0.921) Renaud (0.079) Rio Jones (0.015) Saldanha (0.985) Rio Jones (0.015) Saldanha (0.985) 9

  10. item-level ranking holistic ranking feedback errors evaluation Implication of a validation S 2 S 3 S 4 Data It Data Item Item S 1 S 1 S 2 S 2 S 3 S 3 S 4 S 4 Zootopia Zootopia Howard Howard Spencer Spencer Spencer Spencer Kung Fu Panda Kung Fu Panda Stevenson Stevenson Nelson Nelson Nelson Inside Out Inside Out leFauve leFauve Docter Docter leFauve Docter Finding Dory Finding Dory Stanton Stanton Stanton Minions Minions Coffin Coffin Renaud Renaud Renaud Rio Rio Jones Jones Saldanha Saldanha Saldanha 10

  11. item-level ranking holistic ranking feedback errors evaluation Implication of a validation Data Item S 1 S 2 S 3 S 4 S 4 Zootopia Howard Spencer Spencer Spencer Kung Fu Panda Stevenson Nelson Inside Out leFauve Docter Finding Dory Stanton Minions Coffin Renaud Rio Jones Saldanha 11

  12. item-level ranking holistic ranking feedback errors evaluation Ideal utility function truth function ? truth function fusion model data average correctness Utility Function of true claims 12

  13. item-level ranking holistic ranking feedback errors evaluation Practical utility function over correctness of all claims entropies of all data items Entropy Utility Function 13

  14. item-level ranking holistic ranking feedback errors evaluation Maximum Expected Utility (MEU) § Value of perfect information entropy utility if claim is true Best alternative in the absence of ground truth 14

  15. item-level ranking holistic ranking feedback errors evaluation Approximate-MEU • Key idea: Propagation of changes correctness of correctness of claims accuracies of claims of validated of unvalidated data sources data item items no need to fuse for every claim! removed bottleneck iterative computation of MEU 15

  16. item-level ranking holistic ranking feedback errors evaluation Users can be wrong o Honest but unsure user 80% certain about a claim o Error-rate of user user is correct 85% of the time o Conflicting feedback from a crowd of workers Claim1 Claim2 Claim3 6/10 3/10 1/10 16

  17. item-level ranking holistic ranking feedback errors evaluation Real-world datasets Books 1 FlightsDay 2 Population 3 Flights 2 Items 1263 5836 40696 121567 Sources 894 38 2545 38 Claims 24303 80452 46734 1931701 Feedback Simulation o Books: silver standard provided in [4] o Flight information: data provided by carrier websites considered ground truth o Population: manually identified the true claim for data items having multiple claims 1. X. L. Dong, L. Berti-Equille, and D. Srivastava. Integrating conflicting data: The role of source dependence. PVLDB, 2009 2. X. Li, X. L. Dong, K. Lyons, W. Meng, and D. Srivastava. Truth finding on the deep web: Is the problem solved? PVLDB, 2012 3. J. Pasternack and D. Roth. Knowing what to believe(when you already know something). COLING, 2010 4. http://lunadong.com/fusionDataSets.htm 17

  18. item-level ranking holistic ranking feedback errors evaluation Competing methods o It Item-level r ranking m methods § QBC / US o De Decision-theoretic r ranking m methods § MEU / Approx-MEU § Greedy Upper Bound (GUB) ground-truth-utility-based o Ra Random § all data items equally beneficial 18

  19. item-level ranking holistic ranking feedback errors evaluation Large number of sources, few claims: holistic ranking ∆ distance_to_ground_truth (%) 0 Random QBC US ApproxMEU -20 MEU GUB -40 -60 -80 0 20 40 60 80 100 data items validated (%) Books 19

  20. item-level ranking holistic ranking feedback errors evaluation Large number of sources, few claims: holistic ranking ∆ distance_to_ground_truth (%) 0 -3 -6 QBC -9 US ApproxMEU -12 MEU 0 20 40 60 80 100 120 # data items validated Population 20

  21. item-level ranking holistic ranking feedback errors evaluation Large number of claims, few sources: either QBC/holistic ∆ distance_to_ground_truth (%) 0 Random QBC US -20 ApproxMEU MEU GUB -40 -60 0 20 40 60 80 100 data items validated (%) FlightsDay 21

  22. item-level ranking holistic ranking feedback errors evaluation Large number of claims, few sources: either QBC/holistic ∆ distance_to_ground_truth (%) 0 -2 -4 -6 QBC US ApproxMEU 10 0 0.5 1 1.5 2 data items validated (%) Flights 22

  23. Contributions o Integrating user feedback to improve the performance of existing data fusion systems o Designed strategies to generate an effective ordering for validating claims o scalable decision-theoretic solution for iterative fusion o explored imperfect feedback scenarios o Evaluation on real-world datasets confirmed that guided feedback rapidly increases the effectiveness of data fusion 23

Recommend


More recommend