CS 4803 / 7643: Deep Learning
Zsolt Kira Georgia Tech
Topics:
– (Continue) Low-label ML Formulations
CS 4803 / 7643: Deep Learning Topics: (Continue) Low-label ML - - PowerPoint PPT Presentation
CS 4803 / 7643: Deep Learning Topics: (Continue) Low-label ML Formulations Zsolt Kira Georgia Tech Administrative Projects! Poster details out on piazza Note: No late days for anything project related! Also note: Keep
– (Continue) Low-label ML Formulations
– Poster details out on piazza – Note: No late days for anything project related! – Also note: Keep track of your GCP usage and costs! Set limits on spending
(C) Dhruv Batra & Zsolt Kira 31
– Assume: We have larger labeled dataset for a different set of categories (base classes)
– N-way k-shot test – k: Number of examples in support set – N: Number of “confusers” that we have to choose target class among
(C) Dhruv Batra & Zsolt Kira 32
Target Query Set
– Train classifier on base classes – Freeze features – Learn classifier weights for new classes using few amounts
(C) Dhruv Batra & Zsolt Kira 33
A Closer Look at Few-shot Classification, Wei-Yu Chen, Yen-Cheng Liu, Zsolt Kira, Yu-Chiang Frank Wang, Jia-Bin Huang
(C) Dhruv Batra & Zsolt Kira 34
– Can optionally pre-train features on held-out base classes (not typical)
(C) Dhruv Batra & Zsolt Kira 35
(C) Dhruv Batra & Zsolt Kira 36
– Take inspiration from a known learning algorithm
MAML (Finn et al. 2017)
– Derive it from a black box neural network
(C) Dhruv Batra & Zsolt Kira 37
Slide Credit: Hugo Larochelle
– Take inspiration from a known learning algorithm
MAML (Finn et al. 2017)
– Derive it from a black box neural network
(C) Dhruv Batra & Zsolt Kira 38
Slide Credit: Hugo Larochelle
(C) Dhruv Batra & Zsolt Kira 39
Slide Credit: Hugo Larochelle
(C) Dhruv Batra & Zsolt Kira 40
Slide Credit: Hugo Larochelle
(C) Dhruv Batra & Zsolt Kira 41
Slide Credit: Hugo Larochelle
– Parameter initialization and update rules
(C) Dhruv Batra & Zsolt Kira 42
(C) Dhruv Batra & Zsolt Kira 43
Slide Credit: Hugo Larochelle
(C) Dhruv Batra & Zsolt Kira 44
Slide Credit: Hugo Larochelle
(C) Dhruv Batra & Zsolt Kira 45
Slide Credit: Hugo Larochelle
(C) Dhruv Batra & Zsolt Kira 46
Slide Credit: Hugo Larochelle
(C) Dhruv Batra & Zsolt Kira 47
(C) Dhruv Batra & Zsolt Kira 48
Slide Credit: Hugo Larochelle
(C) Dhruv Batra & Zsolt Kira 49
Slide Credit: Hugo Larochelle
(C) Dhruv Batra & Zsolt Kira 50
(C) Dhruv Batra & Zsolt Kira 51
Slide Credit: Sergey Levine
(C) Dhruv Batra & Zsolt Kira 52
Slide Credit: Sergey Levine
(C) Dhruv Batra & Zsolt Kira 53
Slide Credit: Sergey Levine
– Take inspiration from a known learning algorithm
MAML (Finn et al. 2017)
– Derive it from a black box neural network
(C) Dhruv Batra & Zsolt Kira 54
Slide Credit: Hugo Larochelle
(C) Dhruv Batra & Zsolt Kira 55
Slide Credit: Hugo Larochelle
(C) Dhruv Batra & Zsolt Kira 56
Slide Credit: Hugo Larochelle
(C) Dhruv Batra & Zsolt Kira 57
Slide Credit: Hugo Larochelle A Closer Look at Few-shot Classification, Wei-Yu Chen, Yen-Cheng Liu, Zsolt Kira, Yu-Chiang Frank Wang, Jia-Bin Huang
(C) Dhruv Batra & Zsolt Kira 58
(C) Dhruv Batra & Zsolt Kira 59
(C) Dhruv Batra & Zsolt Kira 60
(C) Dhruv Batra & Zsolt Kira 61
(C) Dhruv Batra & Zsolt Kira 62
(C) Dhruv Batra & Zsolt Kira 63
– varying number of classes / examples per class (meta- training vs. meta-testing) ? – semantic differences between meta-training vs. meta-testing classes ? – overlap in meta-training vs. meta-testing classes (see recent “low-shot” literature) ?
– how should this impact how we generate episodes ? – meta-active learning ? (few successes so far)
Slide Credit: Hugo Larochelle