Repeatable, Reproducible, or Useful? Amer Diwan and Robert Hundt - PowerPoint PPT Presentation

Aug 15, 2023 •38 likes •148 views

Repeatable, Reproducible, or Useful? Amer Diwan and Robert Hundt Google Repeatable I conduct the experiment twice using the same setup and get the same results Why should we care? If even I don't get consistent results from my

Repeatable, Reproducible, or Useful? Amer Diwan and Robert Hundt Google
Repeatable ● I conduct the experiment twice using the same setup and get the same results ● Why should we care? – If even I don't get consistent results from my experiment, then my experiment is doomed! ● Challenge: inter-run variation – Page mappings, interference with other jobs, ...
What can we do? ● Repeat experiments as many times as needed to obtain tight confidence intervals – T-test, … ● Report/record results with confidence intervals
Reproducible ● My friend and I conduct the same experiment using the “same” setup and get the same results ● Why should we care? – If others cannot reproduce our experiments then are they actually correct? ● Challenge: bias
Biases hiding under every rock... The setting of irrelevant environment variables can lead to contradictory conclusions
What can we do ● Account and control for all sources of bias – … yeah, right! ● Account and control for all known sources of bias – Try to interactively discover sources of bias by repeatedly submitting to the archive
Sources of bias ● Anything that affects memory layout – Environment variables, link order, heap size (Java), … ● Benchmarks – What exactly does the benchmark test? ● Software and hardware components (e.g., microprocessors) ● etc. ● If we control for all sources of bias, we should get reproducible results
Useful ● Real users should get results consistent with our experiments ● Why should we care? – If our results only apply to lab settings, then they are irrelevant! ● Challenge: “Controlling” bias is not a solution
The problem with controlling bias ● Repeating an experiment with the “same” bias gives reproducible but not useful results – e.g., Every time anyone ask my wife she predicts the same winner for the election— this is repeatable but always has the same bias! ● Need randomized trials
Randomized trials ● Randomly pick values for variables that cause bias ● Run an experiment ● Repeat Use statistical methods to summarize the trials
The vision for an archival system Self-contained script for running experiment Repeat every experiment multiple times and use t-test Repeatable Control for known sources of bias Sources of bias (benchmarks, environment variables...) Reproducible Randomized trials for known sources of bias Useful

Recommend

Reproducible Research with Stata using version control, GitHub, and MarkDoc E. F. Haghish Nov.

Reproducible Research with Stata Reproducible Research with Stata using version control, GitHub, and MarkDoc E. F. Haghish Nov. 17th, 2016 Reproducible Research with Stata Reproducible Analysis Overview Definition Figure 1: Reproducible

1.25k views • 42 slides

Reproducible builds in Debian and everywhere Lunar lunar@debian.org Libre Software Meeting

Reproducible builds in Debian and everywhere Lunar lunar@debian.org Libre Software Meeting 2015-06-07 Lunar (Debian) Reproducible builds LSM2015 1 / 126 What? Lunar (Debian) Reproducible builds LSM2015 2 / 126 What are reproducible

1.46k views • 132 slides

Reproducible Research Practices for Economists Mindy L. Mallory November 10, 2017 Mindy L.

Reproducible Research Practices for Economists Mindy L. Mallory November 10, 2017 Mindy L. Mallory Reproducible Research Practices for Economists November 10, 2017 1 / 49 Questions for the Audience Mindy L. Mallory Reproducible Research

604 views • 49 slides

Reproducible research in practice ifgi Institute for Geoinformatics University of Mnster

Reproducible research in practice ifgi Institute for Geoinformatics University of Mnster Edzer Pebesma Reproducible Research Workshop, UZH, Sep 13-14, 2016 1 / 23 Overview 1. Who am I? 2. What is reproducible research? What is

465 views • 25 slides

Reproducible research in practice M ADAGASCAR software package Sergey Fomel Jackson School of

Reproducible Research M ADAGASCAR Project Reproducible research in practice M ADAGASCAR software package Sergey Fomel Jackson School of Geosciences The University of Texas at Austin July 1, 2010 S. Fomel SciPy 2010 Reproducible Research M

431 views • 16 slides

Mayfly Reproducible Research in Minutes Reproducible Research is

Mayfly Reproducible Research in Minutes Reproducible Research is the new paradigm Step 1: Configure Environment And more graphics should be interac@ve

329 views • 5 slides

Reproducible Builds Valerie Young (spectranaut) Linux Conf Australia 2016 Reproducible Builds

Reproducible Builds Valerie Young (spectranaut) Linux Conf Australia 2016 Reproducible Builds What if you could always compile free software? Valerie Young (spectranaut) Linux Conf Australia 2016 Valerie Young F96E 6B8E FF5D 372F FDD1 DA43

743 views • 60 slides

Repeatable Mental Health Bedrooms The Radisson Blu Portman Hotel 25 th March 2015

Repeatable Mental Health Bedrooms The Radisson Blu Portman Hotel 25 th March 2015 www.procure21plus.nhs.uk Delivering Cost Reduction through Standardisation Roll-Out of Repeatable Rooms and Standard Components 25 th March 2015

1.66k views • 138 slides

A Framework for Testing for A Framework for Testing for Repeatable Success

T6 Concurrent Session Thursday 10/25/2007 11:15 AM JUMP TO: Biographical Information The Presentation Related Paper A Framework for Testing for A Framework for Testing for Repeatable Success Repeatable Success Presented by:

543 views • 35 slides

David Nickerson CellML Workshop 2012 Reproducible simula0on experiments with

David Nickerson CellML Workshop 2012 Reproducible simula0on experiments with SED-ML 13.03.2012 Dagmar Waltemath www.sbi.uni-rostock.de The necessity for reproducible science

876 views • 23 slides

Reproducible Research Using Stata L. Philip Schumm Ronald A. Thisted Department of Health

Managing Statistical Output Reproducible Research Using Stata reStructuredText Examples Reproducible Research Using Stata L. Philip Schumm Ronald A. Thisted Department of Health Studies University of Chicago July 11, 2005 Managing

500 views • 34 slides

Reproducible Research Liz Bageant erb32@cornell.edu Cornell University Outline 1. ScienAfic

and Collabora*ve Reproducible Research Liz Bageant erb32@cornell.edu Cornell University Outline 1. ScienAfic method and research failures 2. Defining reproducible research 3. Strategies for reproducibility 1. ScienAfic method and research

422 views • 30 slides

Reproducible and automated reporting using Stata Kristin MacDonald Director of Statistical

Reproducible and automated reporting Reproducible and automated reporting using Stata Kristin MacDonald Director of Statistical Services StataCorp LLC 2019 Nordic and Baltic Stata Users Group meeting K. L. MacDonald (StataCorp) 30 August

920 views • 74 slides

Re-analysis and replica/on prac/ces in reproducible research Daniele Fanelli Conceptual

Re-analysis and replica/on prac/ces in reproducible research Daniele Fanelli Conceptual challenges concerning Re-analysis and replica/on prac/ces in reproducible research Daniele Fanelli Conceptual challenges concerning Re-analysis and

306 views • 19 slides

A STEP TOWARD QUANTIFYING INDEPENDENTLY REPRODUCIBLE MACHINE LEARNING RESEARCH Edward Raff

A STEP TOWARD QUANTIFYING INDEPENDENTLY REPRODUCIBLE MACHINE LEARNING RESEARCH Edward Raff 12/2019, NEURAL INFORMATION PROCESSING SYSTEMS REPRODUCIBLE MACHINE LEARNING The machine learning community is rightfully putting a greater emphasis

702 views • 7 slides

Packrat: A Dependency Management System for R J.J. Allaire June 27, 2014 3/23 Reproducible

Packrat: A Dependency Management System for R J.J. Allaire June 27, 2014 3/23 Reproducible Research Foundational as a basis for scientific claims "The goal of reproducible research is to tie specific instructions to data analysis

594 views • 21 slides

Cataloguing Literary Archives: from the West Yorkshire Playhouse Archive into the Future Karen

Cataloguing Literary Archives: from the West Yorkshire Playhouse Archive into the Future Karen Sayers November 2013 West Yorkshire Playhouse Archive Large collection of a local theatre Several accruals Modern earliest record is

147 views • 11 slides

Justin Linford (UNM) FERMI AND JANSKY - OUR EVOLVING UNDERSTANDING OF AGN Nov. 10-12, 2011

Justin Linford (UNM) FERMI AND JANSKY - OUR EVOLVING UNDERSTANDING OF AGN Nov. 10-12, 2011 Collaborators: Gregory Taylor (UNM) Roger Romani (Stanford) Joseph Helmboldt (NRL) Anthony Readhead, Rodrigo Reeves, & Joseph Richards (Caltech)

408 views • 26 slides

Bimodal Algorithms Uni-modal distribution Input data block boundaries unimodal chunking 64 KB

Bimodal Chunking Erik Kruus Cezary Dubnicki Cristian Ungureanu Feb 29, 2010 Work done at NEC laboratories 1 Outline Content defined chunking Motivation, approach Introduce bimodal algorithms, transition regions Example

446 views • 22 slides

Making histories, sharing histories: Community-based Archives & Digging Where We Stand Dr

Making histories, sharing histories: Community-based Archives & Digging Where We Stand Dr Andrew Flinn, Reader in Archive Studies & Oral History University College London THATCamp Community Archives, Sonja Haynes Stone Centre for

1.08k views • 19 slides

Data Collection and Data Management saverio . giallorenzo @gmail.com 1 Web Science Data

Web Science Data Collection and Data Management MA Digital Humanities and Digital Knowledge, UniBo Data Collection and Data Management saverio . giallorenzo @gmail.com 1 Web Science Data Collection and Data Management MA Digital

733 views • 47 slides

Radio Data Model for Medicina and Noto Telescopes Cristina Knapic EDP Forum and Training Event

Radio Data Model for Medicina and Noto Telescopes Cristina Knapic EDP Forum and Training Event 2016 - Heidelberg Outline IA2 Archives overview; Standards followed; Radio Data Formats; NEXT Data Model NEXT Configurability

366 views • 20 slides

PHOTOMETRIC REDSHIFTS of X-ray selected sources in Stripe 82X region Tonima T Ananna WHY STRIPE

PHOTOMETRIC REDSHIFTS of X-ray selected sources in Stripe 82X region Tonima T Ananna WHY STRIPE 82? 300 sq deg observed between 80 to 120 times by SDSS Also by UKIDSS, GALEX, VISTA, 2MASS and WISE 31.3 sq deg observed by Archival

408 views • 20 slides

ESASky, all skies in your browser Bruno Mern ESAC Science Data Centre (ESDC), European Space

ESASky, all skies in your browser Bruno Mern ESAC Science Data Centre (ESDC), European Space Agency On behalf of Fabrizio Giordano, Elena Racero, Henrik Norman, Deborah Baines, Beln Lpez Mart, Jess Salgado, Sara Alberola, Christophe

169 views • 14 slides