A Preference-Based Bandit Framework for Personalized Recommendation - PowerPoint PPT Presentation

Apr 23, 2023 •10 likes •140 views

A Preference-Based Bandit Framework for Personalized Recommendation Maryam Tavakol and Ulf Brefeld Paderborn, Nov 8, 2016 Introduction Personalized Recommendation Preference Learning Multi-armed bandits 2 Recommendation 3 Recommendation

A Preference-Based Bandit Framework for Personalized Recommendation Maryam Tavakol and Ulf Brefeld Paderborn, Nov 8, 2016
Introduction Personalized Recommendation Preference Learning Multi-armed bandits 2
Recommendation 3
Recommendation 4
Preference Model • Item i : { Shirt , Blue , Women , Cheap } • Item k : { Polo shirt , White , Women , Expensive } Item i ≻ Item k : { Shirt-Polo shirt, Blue-White, Women-Women, Cheap-Expensive } z i � k := z i − z k 5
Payoff Model • Personalized model + average component User 1 User 1 + User 2 + … + User m User 2 … User m E [ r t,i � k | u t = u j ] = β > t z i � k + θ > z i � k 6
Personalized Recommendation with Qualitative Bandit • For t = 1, …, T: 1. T he world generates some context 2. The learner chooses an action 3. The world reacts with a reward • Choosing the arm with the highest mean reward + confidence interval (General case of LinUCB) 7
Unified Optimization • Solving the objective function in dual space • With arbitrary loss function • Using Fenchel-Legendre conjugate 8
Squared Loss − 1 2 C α > α + r > α max α � 1 2 α > [ ZZ > + 1 X φ j ⌦ φ > j ) � ZZ > ] α µ ( j • The problem reduces to standard quadratic optimization • Model parameters ( , ), are obtained from θ β j α 9
Squared Loss • In the contextual bandit framework: • Mean: β > t z i � k + θ > z i � k • Confidence bound: q z > i � k ( Z > Z + λ I ) � 1 z i � k c 10
Algorithm 11
Summary • Personalized recommendation • Pairwise learning in bandit framework • Optimization in dual space • Learning algorithm for squared loss 12
Thanks for your attention Questions? Email: tavakol@leuphana.de

Recommend

Reinforcement Learning n-armed bandit Kevin Spiteri April 21, 2015 n-armed bandit n-armed

Reinforcement Learning n-armed bandit Kevin Spiteri April 21, 2015 n-armed bandit n-armed bandit 0.9 0.5 0.1 0.9 0.5 0.1 0.0 0.0 0.0 0.0 estimate n-armed bandit n-armed bandit 0.9 0.5 0.1 0.9 0.5 0.1 0 0.0 0.0 0.0 0.0

677 views • 21 slides

Reinforcement Learning Kevin Spiteri April 21, 2015 n-armed bandit n-armed bandit 0.9 0.5

Reinforcement Learning Kevin Spiteri April 21, 2015 n-armed bandit n-armed bandit 0.9 0.5 0.1 n-armed bandit 0.9 0.5 0.1 0.0 0.0 0.0 0.0 estimate n-armed bandit 0.9 0.5 0.1 0 0.0 0.0 0.0 0.0 estimate 0 0 0 0.0 0

995 views • 84 slides

One Armed Bandit source: http://dogbeforewicket.blogspot.ca EECS 1030 moodle.yorku.ca One Armed

One Armed Bandit source: http://dogbeforewicket.blogspot.ca EECS 1030 moodle.yorku.ca One Armed Bandit Utility /** * Returns the winnings from one pull of the one armed * bandit. * * @param coin the coin deposited in the one armed bandit.

623 views • 58 slides

Realizing the Dreams of Personalized Medicine Realizing the Dreams of Personalized Medicine

Realizing the Dreams of Personalized Medicine Realizing the Dreams of Personalized Medicine Realizing the Dreams of Personalized Medicine Realizing the Dreams of Personalized Medicine Realizing the Dreams of Personalized Medicine Realizing the

146 views • 13 slides

A Contextual-Bandit Approach to Personalized News Article Recommendation Lihong li, Wei Chu,

A Contextual-Bandit Approach to Personalized News Article Recommendation Lihong li, Wei Chu, John Langford, Rebort E. Schapire Presentator: Qingyun Wu News Recommendation Cycle A K-armed Bandit Formulation A gambler must decide which of

560 views • 14 slides

3. Preference Learning Techniques a. Learning Utility Functions b. Learning Preference

AGENDA 1. Preference Learning Tasks 2. Performance Assessment and Loss Functions 3. Preference Learning Techniques a. Learning Utility Functions b. Learning Preference Relations c. Structured Output Prediction d. Model-Based Preference

583 views • 39 slides

Preference Litigation ASK LLP What is a preference? A preference is a payment made by an

Preference Litigation ASK LLP What is a preference? A preference is a payment made by an insolvent debtor that favors certain creditors over others. The causes of action to avoid and recover preferential payments are codified in 11

853 views • 39 slides

Preference Relations Relations Preference Preference Relations Prof. Paolo Ciaccia Prof. Paolo

Preference Relations Relations Preference Preference Relations Prof. Paolo Ciaccia Prof. Paolo Ciaccia http://www http:// www- -db.deis.unibo.it db.deis.unibo.it/ /courses courses/SI /SI- -LS/ LS/ 04_PreferenceRelations.pdf

478 views • 31 slides

3. Preference Learning Techniques 4. Complexity of Preference Learning 5. Conclusions 1 ECAI

AGENDA 1. Preference Learning Tasks 2. Performance Assessment and Loss Functions 3. Preference Learning Techniques 4. Complexity of Preference Learning 5. Conclusions 1 ECAI 2012 Tutorial on Preference Learning | Part 5 | J. Frnkranz &

478 views • 6 slides

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems, Part I S ebastien

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems, Part I S ebastien Bubeck Theory Group i.i.d. multi-armed bandit, Robbins [1952] i.i.d. multi-armed bandit, Robbins [1952] Known parameters: number of arms n and

809 views • 67 slides

The Multi-Armed Bandit Problem Nicol` o Cesa-Bianchi Universit` a degli Studi di Milano Nicol`

The Multi-Armed Bandit Problem Nicol` o Cesa-Bianchi Universit` a degli Studi di Milano Nicol` o Cesa-Bianchi The Multi-Armed Bandit Problem The bandit problem [Robbins, 1952] . . . K slot machines Rewards X i ,1 , X i ,2 , . . . of

591 views • 15 slides

Personalized Stream Analysis with Preference SQL Lena Rudenko and Markus Endres University of

Personalized Stream Analysis with Preference SQL Lena Rudenko and Markus Endres University of Augsburg Germany Workshop Prferenzen und Personalisierung in der Informatik - PPI17 @ BTW 2017 Data Personalized Stream Analysis with Preference

456 views • 13 slides

Ordinal and Cardinal Preferences A preference structure represents an agents preferences over a

Preference Representation COMSOC 2011 Preference Representation COMSOC 2011 Ordinal and Cardinal Preferences A preference structure represents an agents preferences over a (finite) set of alternatives X . Two types of preference structures:

215 views • 7 slides

District- -based risk based risk- -need need- -driven Personalized Care Program (PCP) for

District- -based risk based risk- -need need- -driven Personalized Care Program (PCP) for severe driven Personalized Care Program (PCP) for severe District District-based risk-need-driven Personalized Care Program (PCP) for severe mentally

767 views • 31 slides

Personalized Learning October 2018 Pattonville Personalized Learning Vision Students own their

Personalized Learning October 2018 Pattonville Personalized Learning Vision Students own their learning, unconstrained by time, practice, or structure, to meet their unique learning goals supporting their future success. Personalized

414 views • 7 slides

Web Mining and Recommender Systems Advanced Recommender Systems: Bayesian Personalized Ranking

Web Mining and Recommender Systems Advanced Recommender Systems: Bayesian Personalized Ranking Coming up Methodological papers Bayesian Personalized Ranking Factorizing Personalized Markov Chains Personalized Ranking Metric Embedding

1.14k views • 94 slides

Teaching a first course in Human-Robot Interaction Carlotta A. Berry, Ph.D. Rose-Hulman

Teaching a first course in Human-Robot Interaction Carlotta A. Berry, Ph.D. Rose-Hulman Institute of Technology berry123@rose-hulman.edu Introduction Design and implementation of an introductory course in HRI Relatively new field with

561 views • 15 slides

MARKET SIZE Governments will SPEND billions in smart cities A trillion dollar market in

@ mrkindustries www.mrkindustries.com.ar MARKET SIZE Governments will SPEND billions in smart cities A trillion dollar market in 2020 MRKINDUSTRIES info@mrkindustries.com.ar PROBLEM MRKINDUSTRIES info@mrkindustries.com.ar

299 views • 9 slides

August 5, 2014 1 DISCLAIMER This presentation includes time sensitive information that may be

August 5, 2014 1 DISCLAIMER This presentation includes time sensitive information that may be accurate only as of todays date, August 5, 2014. Estimates of future net income per share, funds from operations per share, adjusted funds from

699 views • 49 slides

How to address Polo? Grammatically correct Prof. Chau Dr. Chau Grammatically incorrect, but

http://poloclub.gatech.edu/cse6242 CSE6242 / CX4242: Data & Visual Analytics Duen Horng (Polo) Chau Assistant Professor Associate Director, MS Analytics Georgia Tech Google Polo Chau (only one in the world) How to

913 views • 74 slides

Mining Pools Prof. Tom Austin San Jos State University Review: Bitcoin mining Miners

Cryptocurrencies & Security on the Blockchain Mining Pools Prof. Tom Austin San Jos State University Review: Bitcoin mining Miners verify transactions Must find a proof-of-work. Reward: newly generated bitcoins, plus

796 views • 28 slides

Automatic Compiler Based FPGA Accelerator for CNN Training Shreyas Venkataramanaiah 1 , Yufei Ma

Automatic Compiler Based FPGA Accelerator for CNN Training Shreyas Venkataramanaiah 1 , Yufei Ma 1 , Shihui Yin 1 , Eriko Nurvithadhi 2 , Aravind Dasu 3 , Yu Cao 1 , Jae-sun Seo 1 1 School of ECEE, Arizona State University, Tempe, AZ, USA 2 Intel

539 views • 21 slides

Legal posi6vism and collec6ve acceptance Social Ontology Neuchatel 2020 Michael Schmitz

Legal posi6vism and collec6ve acceptance Social Ontology Neuchatel 2020 Michael Schmitz Universitt Wien Goals for this talk How can the law be characterized from the point of view of a theory of collec?ve inten?onality? From a

991 views • 19 slides

The logic of formulas Andre Kornell UC Davis BLAST August 10, 2018 Andre Kornell (UC Davis)

The logic of formulas Andre Kornell UC Davis BLAST August 10, 2018 Andre Kornell (UC Davis) The logic of formulas BLAST August 10, 2018 1 / 22 the Vienna Circle The meaning of a proposition is the method of its verification. - Moritz

632 views • 22 slides