Weight Agnostic Neural Networks Adam Gaier 1,2 , David Ha 1 1 - PowerPoint PPT Presentation

Jul 07, 2023 •170 likes •372 views

Weight Agnostic Neural Networks Adam Gaier 1,2 , David Ha 1 1 Google Brain, 2 Inria / CNRS / Universit de Lorraine / Bonn-Rhein-Sieg University of Applied Sciences Innate abilities in animals Architecture is a Powerful Prior Deep Image Prior

Weight Agnostic Neural Networks Adam Gaier 1,2 , David Ha 1 1 Google Brain, 2 Inria / CNRS / Université de Lorraine / Bonn-Rhein-Sieg University of Applied Sciences
Innate abilities in animals Architecture is a Powerful Prior Deep Image Prior ConvNet with randomly initialized weights can still perform many image ● processing tasks Without learning, the network structure alone is a strong enough prior ● Ulyanov, D., Vedaldi, A., & Lempitsky, V. (2018). Deep image prior. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Innate abilities in machines Ulyanov, D., Vedaldi, A., & Lempitsky, V. (2018). Deep image prior. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
To what extent can neural net architectures alone encode solutions to tasks?
Neural Architecture Search Searching for trainable networks Architectures, once trained, outperform hand designed networks ● Expensive -- training of network required to judge performance ● Solution is still encoded in weights of network, not in architecture ●
Searching for Architectures Elsken, T., Metzen, J. H., & Hutter, F. (2018). Neural architecture search: A survey. arXiv preprint arXiv:1808.05377
Searching for Architectures Elsken, T., Metzen, J. H., & Hutter, F. (2018). Neural architecture search: A survey. arXiv preprint arXiv:1808.05377
How can we search for architectures... not weights?
Search without Training Assume weights are drawn from a particular distribution Search for architecture to perform given weights from this distribution ● Replace inner loop training with sampling Draw new weights from distribution at each rollout ● Judge network on zero-shot performance ●
Weight Sharing Single shared weight value used for all connections Weight value selected from distribution at each rollout ● Reduces number of parameters of network to 1 ● Reliable expected reward of topology ○ Architecture search Explore space of network topologies ● Judge network architecture based on performance over a series of rollouts ●
Topology Search
WANNs find solutions in variety of RL tasks
WANNs perform with and without training
ANN Bipedal Walker WANN Bipedal Walker (2760 connections, weights ) ( 44 connections, 1 weight )
Can we find WANNs outside of reinforcement learning domains?
Searching for Building Blocks First steps toward a different kind of architecture search Network architectures with innate biases can perform a variety of tasks ● ...and these biases can be found through search ● Weight tolerance as a heuristic for new building blocks ConvNets and LSTMs can work even untrained ● Finding novel building blocks at least as important as new arrangements of ● those which already exist
interactive article @: weightagnostic.github.io poster @: wednesday 10:45

Recommend

cProbLog: Restricting the Possible Worlds of Probabilistic Logic Programs Dimitar Shterionov

cProbLog: Restricting the Possible Worlds of Probabilistic Logic Programs Dimitar Shterionov Prof. Gerda Janssens 1 Weight: 3 Weight: 4 Weight: 8 Weight: 6 2 Weight: 3 Weight: 4 0.33 Weight: 8 Weight: 6 0.25 0.125 0.16 3 Weight:

673 views • 63 slides

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural

Learning Neural Networks Learning Neural Networks Neural Networks can represent complex Neural Networks can represent complex decision boundaries decision boundaries Variable size. Any boolean function can be Variable size. Any boolean

359 views • 14 slides

Weight Parameterizations in Deep Neural Networks Sergey Zagoruyko e Paris-Est, Universit

Weight Parameterizations in Deep Neural Networks Weight Parameterizations in Deep Neural Networks Sergey Zagoruyko e Paris-Est, Universit Ecole des Ponts ParisTech December 26, 2017 Weight Parameterizations in Deep Neural Networks

1.06k views • 45 slides

Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven

Neural Networks and Handwriting Recognition Steven Sloss Math 164 Neural Networks and Handwriting Recognition Background Neural Networks Neural Network Steven Sloss Structure Training Neural Networks Math 164 Motivation Problem

895 views • 41 slides

Neural Networks Neural networks arise from attempts to model Neural Networks human/animal

Feed-forward Networks Network Training Error Backpropagation Deep Learning Feed-forward Networks Network Training Error Backpropagation Deep Learning Neural Networks Neural networks arise from attempts to model Neural Networks

381 views • 9 slides

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Sequential Data with Neural Networks Recurrent Neural

304 views • 4 slides

LANGUAGE-AGNOSTIC INJECTION LANGUAGE-AGNOSTIC INJECTION DETECTION DETECTION Lars Hermerschmidt,

LANGUAGE-AGNOSTIC INJECTION LANGUAGE-AGNOSTIC INJECTION DETECTION DETECTION Lars Hermerschmidt, Andreas Straub, Goran Piskachev injections grow on trees 1 SHOTGUN UNPARSER SHOTGUN UNPARSER 1 if (recursive || print_dir_name) 2 { 3 if

645 views • 27 slides

MANA for MPI MPI-Agnostic Network-Agnostic Transparent Checkpointing Rohan Garg, *Gregory Price,

MANA for MPI MPI-Agnostic Network-Agnostic Transparent Checkpointing Rohan Garg, *Gregory Price, and Gene Cooperman Northeastern University Why checkpoint, and why transparently? Whether for maintenance, analysis, time-sharing, load balancing,

781 views • 40 slides

Pool-based Agnostic Pool-based Agnostic Experiment Design Experiment Design in Linear

ECML2008 Sep. 15-19, 2008 Pool-based Agnostic Pool-based Agnostic Experiment Design Experiment Design in Linear Regression in Linear Regression Masashi Sugiyama (Tokyo Tech.) Shinichi Nakajima (Nikon) 2 Linear Regression Linear

437 views • 25 slides

Gemstones a Unit of Weight Gemstones a Unit of Weight The historical unit of weight

Gemstones a Unit of Weight Gemstones a Unit of Weight The historical unit of weight for gemstones has been the Carat - the weight of a single seed from the seedpod of the carob tree (Ceratonia Siliqua) hence latin - siliqua

498 views • 12 slides

INTRODUCING Connecting Weight Loss Patients Directly to your Weight Loss Center Physicians Weight

INTRODUCING Connecting Weight Loss Patients Directly to your Weight Loss Center Physicians Weight Loss Network is a premier patient referral program. We are the industrys largest direct marketer that solely focuses on medical weight loss

393 views • 7 slides

Formulation and development of foods for weight management Paola Vitaglione Weight control and

Formulation and development of foods for weight management Paola Vitaglione Weight control and energy balance Weight Weight Weight maintenance gain loss ENERGY IN ENERGY OUT Food intake: Physical activity (15-30%) Carbohydrates

925 views • 69 slides

/k Content 2/15 1. Introduction 2. Hamming weight 3. Rank weight 4. Extended rank weight

On defining the generalized rank weight Ruud Pellikaan joint work with Relinde Jurrius Autonomous University Barcelona, 6 November 2014 /k Content 2/15 1. Introduction 2. Hamming weight 3. Rank weight 4. Extended rank weight enumerator

168 views • 15 slides

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural

Neural Information Retrieval Wassila Lalouani 1 Plan Neural network architectures Neural IR tasks Neural IR architecture Feature Representations Neural IR query auto completion Neural IR query suggestion Neural IR document

1.48k views • 18 slides

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I : Recurrent Neural Networks CHAPTER I Recurrent Neural Networks Introduction In this chapter first the

407 views • 27 slides

CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II III I CHAPTER Neural Networks as Neural Networks as Associative Memory Associative Memory CHAPTER III : III : Neural Networks as Associative Memory CHAPTER Neural Networks as

516 views • 22 slides

Finding the Funding October 16, 2019 | LaSalle Language Academy Welcome to LaSalle! Christopher

The Empowered Arts Educator: Finding the Funding October 16, 2019 | LaSalle Language Academy Welcome to LaSalle! Christopher Graves, Principal Agenda 4:304:40: Welcome and Opening Remarks 4:405:00: Test your (F)understanding!

964 views • 61 slides

Query Log Analysis for Enhancing Web Search Salvatore Orlando, University of Venice, Italy

Query Log Analysis for Enhancing Web Search Salvatore Orlando, University of Venice, Italy Fabrizio Silvestri, ISTI - CNR, Pisa, Italy From tutorials given at IEEE / WIC / ACM WI/IAT'09 and ECIR09 Query Log Analysis for Enhancing Web

1.43k views • 117 slides

The Reproducible Computing package 07/08/09 Patrick Wessa, Ed van Stee 1 07/08/09 Patrick

The Reproducible Computing package 07/08/09 Patrick Wessa, Ed van Stee 1 07/08/09 Patrick Wessa, Ed van Stee 2 Some References J. Buckheit and D. L. Donoho . Wavelab and reproducible research. In A. Antoniadis, editor, Wavelets and

968 views • 25 slides

Expert Code Review and Mastery Learning in a So f ware Development Course Sophie Engle Sami

Expert Code Review and Mastery Learning in a So f ware Development Course Sophie Engle Sami Rollins sjengle@cs.usfca.edu srollins@cs.usfca.edu CCSC Southwestern Region Conference Sophie Engle and Sami Rollins April 5-6, 2013, San Marcos, CA

513 views • 35 slides

Guided Policy Search Sergey Levine Learning on PR2 Shape sorting cube Visuomotor Policies

Guided Policy Search Sergey Levine Learning on PR2 Shape sorting cube Visuomotor Policies Guided Policy Search trajectory optimization supervised learning expectation under current policy trajectory distribution(s) Lagrange multiplier

837 views • 32 slides

IIIF SEARCH API @glenrobson USE CASES Searching OCR generated text to find words or phrases

IIIF SEARCH API @glenrobson USE CASES Searching OCR generated text to find words or phrases within a book, newspaper or other primarily textual content. Searching transcribed content , provided by crowd-sourcing or transformation of

768 views • 13 slides

Hoog e Fast Type Searching Neil Mitchell www.cs.york.ac.uk/~ndm/ Hoogle Synopsis Hoogle is

Hoog e Fast Type Searching Neil Mitchell www.cs.york.ac.uk/~ndm/ Hoogle Synopsis Hoogle is a Haskell API search engine, which allows you to search many standard Haskell libraries by either function name, or by approximate type signature.

437 views • 31 slides

Entity Representation and Retrieval Laura Dietz University of New Hampshire Alexander Kotov Wayne

Entity Representation and Retrieval Laura Dietz University of New Hampshire Alexander Kotov Wayne State University Edgar Meij Bloomberg L.P . WSDM 2017 Tutorial on Utilizing KGs in Text-centric IR Knowledge Graph Fragment WSDM 2017 Tutorial on

1.03k views • 70 slides