Progressive Interaction for Autonomous Entity Matching Ben - PowerPoint PPT Presentation

Progressive Interaction for Autonomous Entity Matching Ben McCamish, Arash Termehchy Oregon State University I nformation & D ata Manag e ment and A nalytics Laboratory (IDEA)

User interacts with local data source DBMS A DBMS B Products Sellers Queries ID Name ID Name Store 1 Soda 3 Hamburger 7/11 2 Beef 4 Pop Kroger … … … … … Results • User interacts with DBMS A by using some query interface ‣ They express their intents, what they are looking for • Then the results are presented to the user

DBMS A not able to satisfy query Store DBMS A DBMS B selling Soda Products Sellers Queries ID Name ID Name Store 1 Soda 3 Hamburger 7/11 2 Beef 4 Pop Kroger … … … … … Results • User queries its local data source, DBMS A • DBMS A does not have the desired information • Must find the desired information in external data source, DBMS B

DBMS A cannot query Store DBMS A DBMS B selling Soda ? Products Sellers Queries ID Name ID Name Store 1 Soda 3 Hamburger 7/11 2 Beef 4 Pop Kroger … … … … … Results • DBMS A needs to submit queries to DBMS B • DBMS B schema and representation of entities is di ff erent • DBMS A does not know schema or representation ‣ Cannot properly formulate queries

DBMS A queries DBMS B Store DBMS A DBMS B selling Soda Mapping Products Sellers Queries ID Name ID Name Store 1 Soda 3 Hamburger 7/11 2 Beef 4 Pop Kroger … … … … … Results • Traditionally a mapping between two DBMSs • However this is costly ‣ Needs to be updated when the schema changes, manually ‣ Manually develop this mapping, takes time

What if DBMS A learns through interactions? “Soda” Store DBMS A DBMS B selling Soda Keyword Query Products Sellers Queries ID Name ID Name Store 1 Soda 3 Hamburger 7/11 2 Beef 4 Pop Kroger … … … … … Results • DBMS A wants to find similar entities in other DBMS, sends some query • There is often a common query language ‣ Keyword Queries • Other DBMSs understand this, but results are not very e ff ective

Results are returned “Soda” Store DBMS A DBMS B selling Soda Keyword Query Products Sellers Queries ID Name ID Name Store 1 Soda 3 Hamburger 7/11 2 Beef 4 Pop Kroger … … … … … Results Results Soda Hamburger 7/11 • Results are returned to the user • User gives some feedback on the results ‣ This is not what the user is looking for

Results are returned “Soda” Store DBMS A DBMS B selling Soda Keyword Query Products Sellers Queries ID Name ID Name Store 1 Soda 3 Hamburger 7/11 2 Beef 4 Pop Kroger … … … … … Results Results Soda Pop Kroger • Results are returned to the user • User gives some feedback on the results ‣ This is the answer the user wanted

Utilize the feedback and learn Store DBMS A DBMS B selling Soda Keyword Query Products Sellers Queries ID Name ID Name Store 1 Soda 3 Hamburger 7/11 2 Beef 4 Pop Kroger … … … … … Results Results • Can build the mapping over time through interaction and feedback • Our Goal: Learn this mapping between DBMS A and DBMS B • Method: Establish a common language or means of communication between the two DBMSs

Our Framework Mapping Local External Query • Local and External DBMS Results • Communicate via keyword Feedback queries and results Offline User Training Data Feedback

Intents Products ID Name 1 Soda 2 Beef Mapping Local External Query Local DBMS Intents Intent # Intent e1 1 Soda e2 2 Beef Results • Local DBMS has intents Feedback • Defined by the user Offline User • Doesn’t require user Training Data Feedback however

Mapping Queries DBMS A Queries Query # Query s1 1 soda Mapping Local External Query s2 2 beef s3 soda s4 beef Strategy Results s1 s2 s3 s4 e1 0.5 0.1 0.4 0 Feedback e2 0 0.4 0.3 0.3 • Sends keyword queries Offline User Training Data Feedback • Called Mapping Queries

Returned Results Mapping Local External Query Sellers ID Name Store 3 Hamburger 7/11 Results 4 Pop Kroger … … … • External DBMS returns some Feedback results Offline User • External DBMS can also learn Training Data Feedback Results Soda Pop Kroger Local Intent External Result

Feedback • Feedback on whether the Mapping Local External Query returned results are correct • Can come from user, but doesn’t have to Results • Can use a model built on Feedback previous user feedback Offline User Training Data Feedback

Local DBMS Strategy Local DBMS Intents Intent # Intent External DBMS Strategy e1 1 Soda s1 s2 s3 s4 Sellers e2 2 Beef ID Name Store Mapping Queries e1 0.5 0.1 0.4 0 3 Hamburger 7/11 Query # Query e2 0 0.4 0.3 0.3 4 Pop Kroger s1 1 soda … … … Products s2 2 beef ID Name s3 soda 1 Soda s4 beef 2 Beef • Local DBMS has a strategy to send queries for intents • External DBMS may also have a strategy

Local DBMS Strategy Local DBMS Intents Intent # Intent External DBMS Strategy e1 1 Soda s1 s2 s3 s4 Sellers e2 2 Beef ID Name Store Mapping Queries e1 0.5 0.1 0.4 0 3 Hamburger 7/11 Query # Query e2 0 0.4 0.3 0.3 4 Pop Kroger s1 1 soda … … … Products s2 2 beef ID Name s3 soda 1 Soda s4 beef 2 Beef • Suppose local DBMS has the intent e1

Local DBMS Strategy Local DBMS Intents Intent # Intent External DBMS Strategy e1 1 Soda s1 s2 s3 s4 Sellers e2 2 Beef ID Name Store Mapping Queries e1 0.5 0.1 0.4 0 3 Hamburger 7/11 Query # Query e2 0 0.4 0.3 0.3 4 Pop Kroger s1 1 soda … … … Products s2 2 beef ID Name s3 soda 1 Soda s4 beef 2 Beef • Consults strategy to see what mapping query to send • Sends s3 with 0.4 probability

Local DBMS Strategy Local DBMS Intents Intent # Intent External DBMS Strategy e1 1 Soda s1 s2 s3 s4 Sellers e2 2 Beef ID Name Store Mapping Queries e1 0.5 0.1 0.4 0 3 Hamburger 7/11 Query # Query e2 0 0.4 0.3 0.3 4 Pop Kroger s1 1 soda … … … Products s2 2 beef ID Name s3 soda 1 Soda s4 beef 2 Beef • When results are returned and feedback given, strategy is updated • Uses reinforcement learning method

Reinforcement Learning • Select a query based on past success, i.e., exploitation • Explore and try new/less successful queries to gain new knowledge, i.e., exploration ‣ Sacrifice immediate success for more success in the long run

Reinforcing Local Strategy Local DBMS Intents Intent # Intent External DBMS Strategy e1 1 Soda s1 s2 s3 s4 Sellers e2 2 Beef ID Name Store Mapping Queries e1 0.5 0.1 0.4 0 3 Hamburger 7/11 Query # Query e2 0 0.4 0.3 0.3 4 Pop Kroger s1 1 soda … … … Products s2 2 beef ID Name s3 soda 1 Soda s4 beef 2 Beef • The probabilities of queries allow for exploration and exploitation

Reinforcing Local Strategy Local DBMS Intents Intent # Intent External DBMS Strategy e1 1 Soda s1 s2 s3 s4 Sellers e2 2 Beef ID Name Store Mapping Queries e1 0.5 0.1 0.4 0 3 Hamburger 7/11 Query # Query e2 0 0.4 0.3 0.3 4 Pop Kroger s1 1 soda … … … Products s2 2 beef ID Name s3 soda 1 Soda s4 beef 2 Beef • Suppose the feedback given for this query was positive • Then the strategy is reinforced as such

Reinforcing Local Strategy Local DBMS Intents Intent # Intent External DBMS Strategy e1 1 Soda s1 s2 s3 s4 Sellers e2 2 Beef ID Name Store Mapping Queries e1 0.5 0.1 0.45 0 3 Hamburger 7/11 Query # Query e2 0 0.4 0.3 0.3 4 Pop Kroger s1 1 soda … … … Products s2 2 beef ID Name s3 soda 1 Soda s4 beef 2 Beef • Increase probability for mapping query sent

Reinforcing Local Strategy Local DBMS Intents Intent # Intent External DBMS Strategy e1 1 Soda s1 s2 s3 s4 Sellers e2 2 Beef ID Name Store Mapping Queries e1 0.45 0.09 0.45 0 3 Hamburger 7/11 Query # Query e2 0 0.4 0.3 0.3 4 Pop Kroger s1 1 soda … … … Products s2 2 beef ID Name s3 soda 1 Soda s4 beef 2 Beef • Implicitly decreases probability for others

Reinforcing Local Strategy Local DBMS Intents Intent # Intent External DBMS Strategy e1 1 Soda s1 s2 s3 s4 Sellers e2 2 Beef ID Name Store Mapping Queries e1 0.45 0.09 0.45 0 3 Hamburger 7/11 Query # Query e2 0 0.4 0.3 0.3 4 Pop Kroger s1 1 soda … … … Products s2 2 beef ID Name s3 soda 1 Soda s4 beef 2 Beef • External DBMS may also learn, but we don’t focus on that here • In both cases when the external DBMS learns and doesn’t learn, it will converge, based on our previous results

Our experiments • Use two databases, each containing information on products ‣ One is an Amazon database and the other a Google database • Approximately 1400 tuples in the Amazon and 3200 tuples in the Google dataset • We have the ground truth, which is used as simulated user feedback • Single tuples are used as intents and they have single match • The receiver does not learn • Cache simulated user feedback

Results for learning every time

Progressive Interaction for Autonomous Entity Matching Ben - PowerPoint PPT Presentation

Progressive Interaction for Autonomous Entity Matching Ben McCamish, Arash Termehchy Oregon State University I nformation & D ata Manag e ment and A nalytics Laboratory (IDEA) User interacts with local data source DBMS A DBMS B Products

7.5 Bipartite Matching Matching Matching. Input: undirected graph G = (V, E). M E

Entity Matching for Semistructured Data in the Cloud Marcus Paradies IBM F2CE Workshop December

PROGRESSIVE DELIVERY PROGRESSIVE DELIVERY CONTINUOUS DELIVERY THE RIGHT WAY CONTINUOUS DELIVERY

Mobile Cross-Platform Development from a Progressive Perspective Web App Progressive Web App N

Matching of Matrix Elements and Parton Showers CKKW matching in e + e collisions Lecture 2:

Global Shape Matching Section 3.3: Articulated Matching using Graph Cuts Global Shape Matching:

Matching Bipartite Matching Input Given a (undirected) graph G = ( V , E ) Input Given a bipartite

Design Challenges for Entity Linking Xiao Ling , Sameer Singh, Daniel S. Weld Entity Linking

the interaction The Interaction interaction models translations between user and system

the interaction physical characteristics of interaction interaction styles the

Adaptive Progressive Web Apps PWA Progressive Web Apps are just great websites that can behave

Progressive Energy Governance Welcome Introduction to IGov Progressive Energy Governance

Impedance Matching of 640 GHz SIS Mixer Impedance Matching of 640 GHz SIS Mixer of 640 GHz SIS

String Matching Inge Li Grtz CLRS 32 String Matching String matching problem: string

Outline Morning program Preliminaries Text matching I Text matching II Afternoon program

CSE182-L7 Dicitionary matching Pattern matching October 09 CSE182 Dictionary Matching

Intro to BeEF Chad Hollman Analyst, County of Sacramento Department of Technology What is BeEF?

Webinar Information 2020 Housing Inventory Count All participants Use the This webinar This

Compressed Sensing and Bayesian Experimental Design or Optimal Sensing and Reconstruction of N -

What about larger-scale representations? Challenges for traditional theories of schemas How to

The Panel Nicole Day Director of Programming, Communications, CADE nicole@cadefarms.org Chris

Working w ith more than one time series VISU AL IZIN G TIME SE R IE S DATA IN P YTH ON Thomas

OpenTag : Open Attribute Value Extraction From Product Profiles Guineng Zheng*, Subhabrata

GROUND CUTTING, DEVOURING AND DIGESTING THE LEGS OFF A BROWSER MICHELE ANTISNATCHOR