DeepType: On-Device Deep Learning for Input Personalization Service - PowerPoint PPT Presentation

DeepType: On-Device Deep Learning for Input Personalization Service with Minimal Privacy Concern Mengwei Xu 1 , Feng Qian 2 , Qiaozhu Mei 3 Kang Huang 4 , Xuanzhe Liu 1 , Yun Ma 1 1 Peking University, 2 University of Minnesota, 3 University of Michigan, 4 Kika Tech

Everyone types a lot everyday • Per day on earth: 2M Reddit posts, 5M tweets, 100B instant messages, and 200B emails • A large portion of them are done on mobile devices, which makes: Input method application (IMA): a killer app Next-word prediction: a killer feature for productivity

DL-powered next-word prediction • Next-word prediction techniques has evolved to deep learning (DL) dictionary • cheap, inaccurate lookup More accurate traditional • ngram ML algos More expensive • both training and prediction deep • LSTM learning

LSTM model for next-word prediction Predicted word: “priceless” softmax hidden states LSTM LSTM LSTM LSTM Cell Cell Cell Cell embedding lookup ( ids → vectors ) vocabulary lookup ( words/chars → ids ) “health” “is” “p” “r”

Personalizing prediction models • Can we further improve the accuracy of DL models? Tomorrow I will go to the party Tomorrow I will go the class • The models need to be personalized and adapt to diverse users • Training one model for one user using his/her own data

On-cloud personalization is not a good idea privacy concern scalability issue Personalizing 1M users takes 36,000 GPU-hrs. Too expensive! GPUs are expensive Can we personalize (train) the DL model on mobile devices?

Challenges of on-device personalization • Limited data volume Is it enough to make model converge • Limited computational resources Can we train model w/o compromising user experience

Challenges of on-device personalization • Limited data volume Is it enough to make model converge Key idea 1: use public corpora to pre-train a global model before on-device personalization • Limited computational resources Can we train model w/o compromising user experience Key idea 2: compress, customize, and fine-tune the model

DeepType: on-device personalization cloud training fresh model global model public corpora CLOUD

DeepType: on-device personalization offline cloud training training global model personal model fresh model global model public corpora private corpora CLOUD DEVICE

DeepType: on-device personalization offline cloud training training global model personal model fresh model global model serve & online training public corpora private corpora CLOUD DEVICE

DeepType: on-device personalization offline cloud training training global model personal model fresh model global model serve & online training public corpora private corpora CLOUD DEVICE • Good privacy: input data never leaves mobile device • Good flexibility: the model can be updated anytime with small cost

Reducing on-device computations 1. SVD-based model compression (on cloud) 2. Vocabulary compression 3. Fine-tune training 4. Reusing inference results L i L i layer compression L i+1 L i+1

Reducing on-device computations 1. SVD-based model compression 2. Vocabulary compression (on device) 3. Fine-tune training 4. Reusing inference results Global Personal vocabulary vocabulary To cover 95% 20,000 words 6,000 words occurrences Vocabulary size used by 1M users within 6 months (Jul. 2017 to Dec. 2017). Mean: 6214, median: 5911

Reducing on-device computations 1. SVD-based model compression 2. Vocabulary compression 3. Fine-tune training (on-device) 4. Reusing inference results forward backward

Reducing on-device computations 1. SVD-based model compression 2. Vocabulary compression 3. Fine-tune training 4. Reusing inference results (on-device online training) forward backward reused

Implementation and Evaluation • Extension to TensorFlow • Dataset: half-year input data from 1M real users • IRB-approved, fully anonymized • Over 10 billion messages in English • Metrics: our collaborated Inc. • Input efficiency (accuracy) How many chars user has to input • On-device overhead (latency & energy) to get the correct prediction User input User wants Model output (top 3) 𝟑 Top-3-efficiency = 𝟐 − [“am”, “have”, “don ’ t”] “I” “will” 𝟓 “I”, “w” “will” [“was”, “would”, “wish”] “I”, “wi” “will” [“wish”, “will”, “with”] Length of output word “will”

DeepType improves model accuracy pre-train dataset personalization top-3-efficiency (global model) (private model) DeepType 0.616 ✓ Twitter corpora no personalization 0.513 ✘ 0.508 ✓ Wikipedia corpora 0.325 ✘ 0.624 ✓ private corpora 0.568 ✘ no pre-train 0.331 ✓

DeepType improves model accuracy pre-train dataset personalization top-3-efficiency (global model) (private model) DeepType 0.616 ✓ Twitter corpora 0.513 ✘ 0.508 ✓ Wikipedia corpora 0.325 ✘ 0.624 ✓ private corpora 0.568 ✘ no pre-train 0.331 ✓

DeepType improves model accuracy pre-train dataset personalization top-3-efficiency (global model) (private model) DeepType 0.616 ✓ Twitter corpora 0.513 ✘ 0.508 ✓ Wikipedia corpora 0.325 ✘ Ideal but impractical. 0.624 ✓ Bad user privacy private corpora 0.568 ✘ no pre-train 0.331 ✓

DeepType improves model accuracy pre-train dataset personalization top-3-efficiency (global model) (private model) DeepType 0.616 ✓ Twitter corpora 0.513 ✘ 0.508 ✓ Wikipedia corpora 0.325 ✘ 0.624 ✓ private corpora 0.568 ✘ no pre-train 0.331 ✓

DeepType reduces on-device overhead • 91.6% reduction of training time • Less than 1.5 hours to personalize the model on half-year input history • 90.3% reduction of energy consumption Training time on different Android devices Training energy w/ and w/o optimization

DeepType reduces on-device overhead • 91.6% reduction of training time • Less than 1.5 hours to personalize the model on half-year input history • 90.3% reduction of energy consumption 1. Device is one 2. Device screen is turned off Device is in favored state 3. Device is being charged and has high remaining battery more than 50% users spend around 2.7 hours on favored states per day -> enough for offline training!

DeepType reduces on-device overhead • 91.6% reduction of training time • Less than 1.5 hours to personalize the model on half-year input history • 90.3% reduction of energy consumption • On-device online training typically takes only 20ms~60ms • Unnoticeable to users

DeepType improves the user experience • A field study: 34 voluntary subjects in Indiana University, 3 weeks. • Embed DeepType into a commercial keyboard app Recruit Install Collect Answer voluntary apps trace questionnaire • Quantitative analysis • Prediction: 25ms, training (online): 86ms << inter-keystroke: 264ms • Qualitative analysis (feedbacks): • 78% users report improved accuracy • 93.7% users report good responsiveness • 100% users report no battery impacts

Summary • On-cloud personalization vs. on-device personalization • Privacy and scalability matter • DeepType: on-device personalization framework • Cloud pre-train, device fine-tune -> ensure both privacy and accuracy • Model compression and customized -> reduce computation overhead Thank you for attention!

DeepType: On-Device Deep Learning for Input Personalization Service - PowerPoint PPT Presentation

DeepType: On-Device Deep Learning for Input Personalization Service with Minimal Privacy Concern Mengwei Xu 1 , Feng Qian 2 , Qiaozhu Mei 3 Kang Huang 4 , Xuanzhe Liu 1 , Yun Ma 1 1 Peking University, 2 University of Minnesota, 3 University of

Nquire ask anything Anis Abboud, Chris Snyder, Mario Finelli Device 1 Device 2 Device 1

Networks Computer-Computer Comm CPU CPU CPU CPU Memory Device Device Memory Memory

File Input and Output File Input and Output 1 / 9 File input/output input function reads values

Input Input devices Text entry Positional input Input Devices 1 iPod Wheel Input Devices 2

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Tra ffi c Management as a Service | Ghent, Belgium INPUT PROCESS OUTPUT INPUT PROCESS OUTPUT

Input Input devices Text entry Positional input Input Devices 1 MacBook Wheel (The Onion) -

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

INPUT DEVICES An input device is any peripheral used to provide data and control signals to a

Input Devices Managing text and positional input 1 CS 349 - Input Devices iPod Wheel 2 CS 349

Expanding the Reach of Fuzzing Caroline Lemieux September 8 th , 2020 Fuzzcon Europe

Wayland Input Methods Michael Hasselmann Openismus GmbH Wayland Input Methods Input methods?

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Device Creation with Qt Enterprise Embedded Andy Nichols Overview The challenges of device

Towards a Unified Framework for Mobile Device Security Wayne A. Jansen, NIST Mobile Device

An employment perspective Clive Dobbin (Partner) Sarah Hayes (Solicitor) What is IR35 and what

Update on the Implementation of Duty to Serve One Year In: Duty to Serve Alanna McCargo,

Background of the Act: patients' data to governmentally accredited private companies skilled in

Sa Sa sk sk ra a (Rite of passage) Part-6 Sa Sa -ny ny sa sa Sa sk sk ra

Matthew Series Lesson #094 September 20, 2015 Dean Bible Ministries www.deanbibleministries.org

Introductions & Welcome Priscille Dando Coordinator, Library Information Services Fairfax

Wise Influence Everyday Wisdom | Proverbs 12:26 Your Frontline is the place where you spend

1 & 2 Samuel Series Lesson #035 December 1, 2015 Dean Bible Ministries

Sambuz

Useful Links

Newsletter

Mail Us

DeepType: On-Device Deep Learning for Input Personalization Service - PowerPoint PPT Presentation

DeepType: On-Device Deep Learning for Input Personalization Service with Minimal Privacy Concern Mengwei Xu 1 , Feng Qian 2 , Qiaozhu Mei 3 Kang Huang 4 , Xuanzhe Liu 1 , Yun Ma 1 1 Peking University, 2 University of Minnesota, 3 University of

Nquire ask anything Anis Abboud, Chris Snyder, Mario Finelli Device 1 Device 2 Device 1

Networks Computer-Computer Comm CPU CPU CPU CPU Memory Device Device Memory Memory

File Input and Output File Input and Output 1 / 9 File input/output input function reads values

Input Input devices Text entry Positional input Input Devices 1 iPod Wheel Input Devices 2

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Tra ffi c Management as a Service | Ghent, Belgium INPUT PROCESS OUTPUT INPUT PROCESS OUTPUT

Input Input devices Text entry Positional input Input Devices 1 MacBook Wheel (The Onion) -

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

INPUT DEVICES An input device is any peripheral used to provide data and control signals to a

Input Devices Managing text and positional input 1 CS 349 - Input Devices iPod Wheel 2 CS 349

Expanding the Reach of Fuzzing Caroline Lemieux September 8 th , 2020 Fuzzcon Europe

Wayland Input Methods Michael Hasselmann Openismus GmbH Wayland Input Methods Input methods?

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Device Creation with Qt Enterprise Embedded Andy Nichols Overview The challenges of device

Towards a Unified Framework for Mobile Device Security Wayne A. Jansen, NIST Mobile Device

An employment perspective Clive Dobbin (Partner) Sarah Hayes (Solicitor) What is IR35 and what

Update on the Implementation of Duty to Serve One Year In: Duty to Serve Alanna McCargo,

Background of the Act: patients' data to governmentally accredited private companies skilled in

Sa Sa sk sk ra a (Rite of passage) Part-6 Sa Sa -ny ny sa sa Sa sk sk ra

Matthew Series Lesson #094 September 20, 2015 Dean Bible Ministries www.deanbibleministries.org

Introductions &amp; Welcome Priscille Dando Coordinator, Library Information Services Fairfax

Wise Influence Everyday Wisdom | Proverbs 12:26 Your Frontline is the place where you spend

1 &amp; 2 Samuel Series Lesson #035 December 1, 2015 Dean Bible Ministries

Sambuz

Useful Links

Newsletter

Mail Us

Introductions & Welcome Priscille Dando Coordinator, Library Information Services Fairfax

1 & 2 Samuel Series Lesson #035 December 1, 2015 Dean Bible Ministries