Categorical Feature Compression via Submodular Optimization - PowerPoint PPT Presentation

Mar 05, 2023 •184 likes •306 views

Categorical Feature Compression via Submodular Optimization Mohammad Hossein Bateni, Lin Chen, Hossein Esfandiari, Thomas Fu, Vahab Mirrokni, and Afshin Rostamizadeh Pacific Ballroom #142 Why Vocabulary Compression? Why Vocabulary Compression?

Categorical Feature Compression via Submodular Optimization Mohammad Hossein Bateni, Lin Chen, Hossein Esfandiari, Thomas Fu, Vahab Mirrokni, and Afshin Rostamizadeh Pacific Ballroom #142
Why Vocabulary Compression?
Why Vocabulary Compression? Embedding layer Huge! Video ID: ~7 billion values 99.9% of neural net
How to Compress Vocabulary?
How to Compress Vocabulary Group similar feature values into one. U.S. U.S./Canada Good compression preserves most Canada information of labels . China Supervised Japan Chn/Jpn/Kor Korea
Problem Formulation
Problem Formulation User ID Featur Compressed Favorite fruit (label) e feature Max I(f(X); C) #1843 China China/Japan/Korea s.t. f(X) can take at #429 Japan China/Japan/Korea most m values ... #9077 Brazil Brazil/Argentina Random variable Random variable Compressed feature X ∈ C ∈ {pear, apple, f(X) ∈ {Afghanistan, …, mango} {China/Japan/Korea, Albania, …, Brazil/Argentina, Zimbabwe} U.S./Canada}
Our Results
Our Results There is a quasi-linear (O(n log n)) algorithm that achieves Max I(f(X); C) 63% f(OPT) if label is binary . s.t. f(X) can ● Design a new submodular function after re-parametrization take at most m values There is a log(n) -round distributed algorithm that achieves 63% f(OPT) with O(n/k) space per machine. ● k is # of machines
Reparametrization for Submodularity ● Sort feature values x according to P(X=x|C=0) . ● A problem of placing separators ● I(f(X); C) is a function of the set of separators.
Experiment Results
Pacific Ballroom #142 See you this evening

Recommend

( ) Outline Submodular

( ) Outline Submodular Functions Examples Discrete Convexity Minimizing Submodular Functions Symmetric Submodular Functions Maximizing Submodular Functions

567 views • 45 slides

Fast Semi-differential based Submodular Function Optimization Rishabh Iyer 1 Stefanie Jegelka 2

Background Submodular Semigradients Submodular Minimization Submodular Maximization Conclusion Fast Semi-differential based Submodular Function Optimization Rishabh Iyer 1 Stefanie Jegelka 2 Jeff Bilmes 1 1 University of Washington, Seattle 2

1.24k views • 81 slides

Submodular Maximization Seffi Naor Lecture 2 4th Cargese Workshop on Combinatorial Optimization

Submodular Maximization Seffi Naor Lecture 2 4th Cargese Workshop on Combinatorial Optimization Seffi Naor Submodular Maximization Submodular Maximization Constrained Submodular Maximization Family of allowed subsets M 2 N . f ( S ) max

477 views • 35 slides

Streaming -submodular Maximization under Noise subject to Size Constraint Lan N. Nguyen, My

Streaming -submodular Maximization under Noise subject to Size Constraint Lan N. Nguyen, My T. Thai University of Florida -submodular maximization s.t. size constraint -submodular function is a generalization of submodular

685 views • 29 slides

Minimizing Submodular Functions Satoru Iwata (RIMS, Kyoto University) Outline Submodular

Minimizing Submodular Functions Satoru Iwata (RIMS, Kyoto University) Outline Submodular Functions Examples Discrete Convexity Submodular Function Minimization Min-Max Theorem Combinatorial Algorithms Applications

625 views • 35 slides

Lossless compression in lossy compression systems Almost every lossy compression system

Lossless compression in lossy compression systems Almost every lossy compression system contains a lossless compression system Lossy compression system Dequantizer Transform Lossless Lossless Inverse Quantizer Encoder Decoder

771 views • 29 slides

14.9.2 JPEG2000 compression DCT compression basis for JPEG wavelet compression

14.9 JPEG and MPEG image compression 31 14.9.2 JPEG2000 compression DCT compression basis for JPEG wavelet compression basis for JPEG2000 JPEG2000 new international standard for still image compression

492 views • 12 slides

Decision Tree Prof. Seungchul Lee Industrial AI Lab. Feature Test Feature 1 Feature 2 Feature

Decision Tree Prof. Seungchul Lee Industrial AI Lab. Feature Test Feature 1 Feature 2 Feature 3 Feature 4 output ? Yes Low Medium Bad Feature test in the first level Yes Yes High Medium Bad ? No High Medium Good No No

579 views • 23 slides

MELODI M achin E L earning, O ptimization, & D ata I nterpretation @ UW Iyer & Bilmes,

Submodular Functions Problem Formulation Algorithmic Framework Empirical Results Submodular Optimization with Submodular Cover and Submodular Knapsack Constraints (SCSC/ SCSK) Rishabh Iyer Jeff Bilmes University of Washington, Seattle

946 views • 94 slides

Optimization of Submodular Functions Tutorial - lecture II Jan Vondrk 1 1 IBM Almaden Research

Optimization of Submodular Functions Tutorial - lecture II Jan Vondrk 1 1 IBM Almaden Research Center San Jose, CA Jan Vondrk (IBM Almaden) Submodular Optimization Tutorial 1 / 24 Outline Lecture I: Submodular functions: what and why? 1

940 views • 50 slides

Fast and Private Submodular and k- Submodular Functions Maximization with Matroid Constraints

ICML | 2020 Thirty-seventh International Conference on Machine Learning Fast and Private Submodular and k- Submodular Functions Maximization with Matroid Constraints Akbar Rafiey Yuichi Yoshida 1 Core massage What is the problem?

511 views • 22 slides

Maximization of Submodular Functions Seffi Naor Lecture 1 4th Cargese Workshop on Combinatorial

Maximization of Submodular Functions Seffi Naor Lecture 1 4th Cargese Workshop on Combinatorial Optimization Seffi Naor Maximization of Submodular Functions Submodular Maximization Optimization Problem Family of allowed subsets M 2 N . f

1.16k views • 88 slides

Approximating Submodular Functions Everywhere Nick Harvey February 16, 2008 Joint work with M.

Approximating Submodular Functions Everywhere Nick Harvey February 16, 2008 Joint work with M. Goemans, S. Iwata and V. Mirrokni Nick Harvey Approximating Submodular Functions Everywhere Submodular Functions Definition f : 2 [ n ] R is

790 views • 50 slides

Submodular Maximization Seffi Naor Lecture 3 4th Cargese Workshop on Combinatorial Optimization

Submodular Maximization Seffi Naor Lecture 3 4th Cargese Workshop on Combinatorial Optimization Seffi Naor Submodular Maximization Continuous Relaxation Recap: a continuous relaxation for maximization Seffi Naor Submodular Maximization

1.77k views • 93 slides

CS675: Convex and Combinatorial Optimization Fall 2019 Submodular Function Optimization

CS675: Convex and Combinatorial Optimization Fall 2019 Submodular Function Optimization Instructor: Shaddin Dughmi Outline Introduction to Submodular Functions 1 Unconstrained Submodular Minimization 2 Definition and Examples The Convex

2.22k views • 173 slides

JPEG Compression Ian Snyder December 11, 2009 Ian Snyder JPEG Compression Outline

Outline Introduction Images and Compression Walkthrough of JPEG Compression Steps Complete Compression Process Results and Conclusion JPEG Compression Ian Snyder December 11, 2009 Ian Snyder JPEG Compression Outline Introduction Images

594 views • 45 slides

A Thorough Formalization of Conceptual Spaces Lucas Bechberger and Kai-Uwe Khnberger The

A Thorough Formalization of Conceptual Spaces Lucas Bechberger and Kai-Uwe Khnberger The Different Layers of Representation A Thorough Formalization of Conceptual Spaces / Bechberger and Khnberger 2 The Different Layers of Representation

913 views • 63 slides

East Malling Rootstock Breeding Club NIAB EMR Update January 2018 Agenda Minutes and

East Malling Rootstock Breeding Club NIAB EMR Update January 2018 Agenda Minutes and actions from previous meeting(s) Apple canker: PhD project (AG) & other projects (AK) EM trials completing in 2017-18 (AK) Pest and disease

691 views • 58 slides

Go 2 Draft Designs Hello everyone, Im here to talk about the draft designs that the Go team

Go 2 Draft Designs Hello everyone, Im here to talk about the draft designs that the Go team recently released for possible future additions to Go, specifically about two areas: improved error handling and parametric polymorphism (AKA generics),

411 views • 29 slides

Looping through Python data structures Justin Kiggins Product Manager DataCamp Python for

DataCamp Python for MATLAB Users PYTHON FOR MATLAB USERS Looping through Python data structures Justin Kiggins Product Manager DataCamp Python for MATLAB Users How for loops work in Python MATLAB Python for i=1:5 for i in [1, 2, 3, 4,

252 views • 23 slides

Massive Schema Changes in Facebook Jesse Salomon, Junyi Lu Software Engineer, Production

Massive Schema Changes in Facebook Jesse Salomon, Junyi Lu Software Engineer, Production Engineer Agenda How O Online Schema Change Online Schema Change Why Why not MySQL's native online DDL? - cannot cover all our use cases - no resource

1.77k views • 141 slides

Generative and discriminative classification techniques Machine Learning and Category

Generative and discriminative classification techniques Machine Learning and Category Representation 2013-2014 Jakob Verbeek, December 13+20, 2013 Course website: http://lear.inrialpes.fr/~verbeek/MLCR.13.14 Classification apple pear tomato

761 views • 41 slides

Mathematics 3670: Computer Systems Bits, Data Types, and Operations Dr. Andrew Mertz Mathematics

Mathematics 3670: Computer Systems Bits, Data Types, and Operations Dr. Andrew Mertz Mathematics and Computer Science Department Eastern Illinois University Fall 2012 Week 2: to do What When Read Chapter 2 this week Design Lab 2 TMs

816 views • 56 slides

Welcome! Check your audio connection to be sure your speakers are on and the volume is up. An

Welcome! Check your audio connection to be sure your speakers are on and the volume is up. An On-Demand recording of this webinar will be available at: http://schoolnutrition.org/on-demand 1 SNA CEU will be available upon completion of a quiz.

593 views • 56 slides