Improving Reproducible Deep Learning Workflows with DeepDIVA M. - PowerPoint PPT Presentation

Improving Reproducible Deep Learning Workflows with DeepDIVA M. Alberti 1 * , V. Pondenkandath 1* , L. Vögtlin 1 , M. Würsch 12 , R. Ingold 1 , M. Liwicki 13 *Equal contribution 1 DIVA Group, University of Fribourg, Switzerland 2 IIT, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Switzerland 3 EISLAB Machine Learning, Luleå University of Technology, Sweden

Reproducibility Crisis: Trust or Verify? Joelle Pineau , “ Reproducible, Reusable, and Robust Reinforcement Learning ”, invited talk @NeurIPS 2018, Montreal, Canada 2

Why Is This a Problem? No possibility to verify No possibility to extend Lots of overhead created Leads to no trust in scientific results 3

How To Make Steps Forward? Ensure reproducibility Of your own experiments Of other people’s experiments Promote open-source code Make it easy to have “good enough” code Enable code trustworthiness 4

How We Contribute: DeepDIVA Open-Source Python framework Built on top of PyTorch Makes your life easer for: Reproducing your own and other people’s experiments Provides boilerplate code for: Common deep learning scenarios Handling time consuming everyday problems Documentation & Tutorial available 5

Reproducing Your Own Experiments Short-term, or work in progress Long-term, or finished work 6

Short-term Reproducibility Dangers Kilometres of poor or incomplete log files Stochasticity in the process 7

How DeepDIVA Ensures Short-term Reproducibility Meaningful logging Saving all run parameters and command line args Providing concise coloured logs Deterministic runs Seeding the pseudo-random numbers generators: Python, Numpy and PyTorch. Disabling CuDNN (NVIDIA Deep Neural Network library) when necessary 8

Long-term Reproducibility Dangers Poor (or non-existent!) use of version control Hard-to-die bad programming habits Silent data modifications 9

How DeepDIVA Ensures Long-term Reproducibility Git status Linking every run to a specific commit in Git Allowing this feature to be disabled for dev purposes Copy code Copying the entire running code in the output folder Data Integrity Management Footprint of the data in a JSON file using SHA-1 hashes 10

Reproducing Other People’s Experiments Given a paper, try to replicate the results and observations 11

Reproducing Other People’s Experiments In order to reproduce an experiment one needs: Git repository URL Git commit identifier (full SHA) List of command line arguments used The data 12

Productivity Out-of of-the-box Making your life easier: do not reinvent the wheel! 13

“One click away” Deep Learning Scenarios 14

Prepare Your Data “when the data is ready the task is solved” Download a dataset with a click Natural images, medical images, historical documents, … Split your dataset Train, Validation and Test splits Analyse the data Mean/std and class distributions Ensure data integrity Compare the footprints 15

Real-time Visualizations Confusion Matrix Weight Histograms Tensorboard (from TensorFlow) Features Visualization Performance Evaluation 16

Automatic Hyper-Parameter Optimization Let machine learning find the best values No expensive grid or random search 17

Be A Part Of f It It Getting Started With DeepDIVA 18

How To Use It No Setup Time From source on Ubuntu (or other flavours of Linux) Docker Image Coming Soon Documentation Online and in the code Tutorials Learn new features efficiently Fork It Extensive and modular for easy modifications 19

Make Your Experiment Reproducible bit.ly/DeepDIVA 20

Improving Reproducible Deep Learning Workflows with DeepDIVA M. - PowerPoint PPT Presentation

Improving Reproducible Deep Learning Workflows with DeepDIVA M. Alberti 1 * , V. Pondenkandath 1* , L. Vgtlin 1 , M. Wrsch 12 , R. Ingold 1 , M. Liwicki 13 *Equal contribution 1 DIVA Group, University of Fribourg, Switzerland 2 IIT, FHNW

Reproducible Research with Stata using version control, GitHub, and MarkDoc E. F. Haghish Nov.

Reproducible builds in Debian and everywhere Lunar lunar@debian.org Libre Software Meeting

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Reproducible Research Practices for Economists Mindy L. Mallory November 10, 2017 Mindy L.

Reproducible research in practice ifgi Institute for Geoinformatics University of Mnster

Reproducible research in practice M ADAGASCAR software package Sergey Fomel Jackson School of

Mayfly Reproducible Research in Minutes Reproducible Research is

Reproducible Builds Valerie Young (spectranaut) Linux Conf Australia 2016 Reproducible Builds

A STEP TOWARD QUANTIFYING INDEPENDENTLY REPRODUCIBLE MACHINE LEARNING RESEARCH Edward Raff

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Numerical Simulations of the Wardle Instability Sam Falle, Department of Applied Mathematics,

for AI and Robotics Exploration and information gathering Alessandro Farinelli Outline

Dialogue corpora NPFL070 December 11, 2019 (NPFL070) Dialogue corpora December 11, 2019 1 /

Improving Background Based Conversation with Context-aware Knowledge Pre-selection Pengjie Ren

Wild Patterns: Ten Years After the Rise of Adversarial Machine Learning Battista Biggio * Slides

Scheduling multi-task applications on heterogeneous platforms Anne Benoit, Jean-Fran cois

Context to Sequence Typical Frameworks and Applications Piji Li Department of Systems

Deep Learning in Computer Vision (CSC2523) Reading List Bid for papers: Tue, Jan 26, 11.59pm,

Sambuz

Useful Links

Newsletter

Mail Us