comparing repositories visually with repograms
play

Comparing Repositories Visually with RepoGrams http://repograms.net - PowerPoint PPT Presentation

Comparing Repositories Visually with RepoGrams http://repograms.net Daniel Rozenberg, Ivan Beschastnikh, Fabian Kosmale, Valerie Poser, Heiko Becker, Marc Palyart, Gail C. Murphy University of British Columbia Saarland University Big (SE)


  1. Comparing Repositories Visually with RepoGrams http://repograms.net Daniel Rozenberg, Ivan Beschastnikh, Fabian Kosmale, Valerie Poser, Heiko Becker, Marc Palyart, Gail C. Murphy University of British Columbia Saarland University

  2. Big (SE) data • Huge opportunity for • Millions of projects researchers • Open APIs • Each open source • Meticulously tracked project is a potential and archived activity evaluation target! 2

  3. How many projects do paper authors use in their evaluation? • Experiment: selected 114 papers from ICSE, FSE, ASE, MSR, ESEM (years 2012-2014) • Recorded number of targets that the authors claim to evaluate 3

  4. How many projects do paper authors use in their evaluation? Number of papers Number of evaluation targets 4

  5. How many projects do paper authors use in their evaluation? Finding: 75% of papers use Number of papers 8 or fewer evaluation targets Number of evaluation targets 5

  6. Existing tools focus on supporting scalable analysis Number of papers Focus of existing tools/ methods: proper sampling, infrastructure.. • Number of evaluation targets 6

  7. Existing tools focus on supporting scalable analysis Number of papers RepoGrams Focus of existing tools/ methods: proper sampling, infrastructure.. Number of evaluation targets 7

  8. RepoGrams: Qualitative repository analysis Presents data in a way that can be observed but not measured 8

  9. RepoGrams: Qualitative repository analysis Presents data in a way that can be observed but not measured • Goal is not to provide an answer, but to surface relevant information • Help the user think critically/contrast relevant features of a (small number of) projects • Support curation of a small number of project ( 8) ≤ Visualization: a natural fit for qualitative analysis & nuance 9

  10. Core abstraction in RepoGrams: Repository “footprint” Block : commit Color : commit metric value Project : A B C Length : commit size Time 10

  11. Demo: the basics Constant commit Commit author metric: block width one unique color per author 11

  12. Demo: comparing two metrics Branches used metric: one unique color per branch; master is always red 12

  13. Demo: we can represent many things with a footprint Commit age metric: elapsed time between commit and its parent 13

  14. Demo: block width can denote magnitude of change Block width: linear in the LOC changed in commit 14

  15. Demo: multiple projects • wren has more commits than any other projects • wren, faker, pronto , use master initially • All projects eventually use a diversity of branches 15

  16. Demo: multiple projects • wren and PHPMailer have much larger commits • PHPMailer has huge commits in the purple and yellow branches 16

  17. Evaluation questions RQ1: Can SE researchers use RepoGrams to understand and compare characteristics of a project’s source repository? RQ2: Will SE researchers consider using RepoGrams to select evaluation targets for experiments and case studies? RQ3: How much effort is required to add metrics to RepoGrams? 17

  18. Methodology RQ1: Can SE researchers use RepoGrams to understand and compare characteristics of a project’s • 14 authors from MSR’14 source repository? • Tasks using RepoGrams • Semi-struct. interviews RQ2: Will SE researchers consider using RepoGrams to select evaluation targets for experiments and case studies? RQ3: How much effort is required to • 2 developers add metrics to RepoGrams? • Each implemented 3 metrics 18

  19. Evaluation highlights RQ1: Can SE researchers use ✦ Successfully used RepoGrams to understand and RepoGrams for complex compare characteristics of a project’s tasks source repository? ✦ Tools is of immediate use RQ2: Will SE researchers consider using RepoGrams to select evaluation ✦ Researchers want custom targets for experiments and case metrics studies? ✦ Setup: 1.5 hours RQ3: How much effort is required to ✦ Metric: avg/max = 40/52 min add metrics to RepoGrams? ✦ < 40 LOC total 19

  20. Related work • Helping researchers with the selection process • Tools/Datasets: GHTorrent, Boa, MetricMiner • Methods: “Diversity in software engineering research”, FSE13 • Visualization • Tools: CVSgrab, ConcernLines, Fractal Figures, Chronos, RelVis, Chronia, Evolution radar 20

  21. ✴ Lots of data, many potential evaluation targets! ✴ But, proper project selection is complex ✴ Researcher must be highly aware of the features of the project that may influence the study results ✦ RepoGrams: supports qualitative analysis of software repositories ✦ Presents data in a way that can be observed but not measured Try our public deployment! http://repograms.net 21

Recommend


More recommend