for dune fd
play

for DUNE FD 28 th October 2019 Mousam Rai Supervisor: Prof John - PowerPoint PPT Presentation

Track/Shower Discrimination Refresh in Pandora for DUNE FD 28 th October 2019 Mousam Rai Supervisor: Prof John Marshall 1 Roadmap for this presentation The Problem MicroBooNE Variables Current Approach In Pandora BDT1, BDT2, and


  1. Track/Shower Discrimination Refresh in Pandora for DUNE FD 28 th October 2019 Mousam Rai Supervisor: Prof John Marshall 1

  2. Roadmap for this presentation • The Problem • MicroBooNE Variables • Current Approach In Pandora • BDT1, BDT2, and BDT3 • The End. 2

  3. The Problem Track vs Shower discrimination 3

  4. W View U View V View Argon nucleus fragments Track-like from deep Shower-like ( 𝜈 − ) inelastic ( γ ) scattering Track-like ( 𝑞 ) γ , p, 𝜌 − , Ar- Multiple fragments tracks merged as merged into one one 4

  5. MicroBooNE Variables What are they? 5

  6. Types of MicroBooNE Variables • 8 Topological Variables: • 2 Calorimetric Variables: • length • charge1 • diff • charge2 • gap • rms • vertexDistance • diffAngle • pca1 • pca2 6

  7. length – 3D length of the PFO length (cm) diff – Mean difference between the position of the hits and a straight line, divided by the straight line length 7 diff (N/A)

  8. gap – Average max gap distance, divided by straight line length gap (N/A) rms – Average root mean square of linear sliding fit, divided by straight line length 8 rms (N/A)

  9. vertexDistance – Distance between the PFO vertex and the primary vertex vertexDistance (cm) diffAngle – Difference between the opening and closing angles calculated over 50% of the pfo closest and furthest from the vertex. 9 diffAngle (rad)

  10. pca1 – Ratio between the second largest and the largest PCA eigenvalue pca1 (N/A) pca2 – Ratio between the third largest and the largest PCA eigenvalue 10 pca2 (N/A)

  11. charge1 – Ratio between sigmaCharge (( 𝑑ℎ𝑏𝑠𝑕𝑓 − 𝑛𝑓𝑏𝑜𝐷ℎ𝑏𝑠𝑕𝑓 2 ሻ and the mean charge in collection plane. ሻ charge1 (N/A) charge2 – Ratio of charge in the last 10% of the PFO and the mean charge in the collection plane 11 charge2 (N/A)

  12. Current Approach In Pandora Cut flow approach to track/shower characterisation 12

  13. Cut-flow approach • Does some basic cuts based on topological variables and then decide if a PFO is track-like or shower-like • For anyone interested (PfoCharacterisationBaseAlgorithm.cc) 13

  14. BDT1, BDT2, and BDT3 14

  15. BDT setup BDT1 BDT2 BDT3 • Only uBooNE variables • Same as BDT1 • Same as BDT2 • Sensible fiducial cuts (20cm in x and y • Completeness >= 0.8 && Purity >= 0.8 • uBooNE variables + hierarchy direction, and 200 cm in z direction) variables (will talk about it later as • Completeness >= 0.1 && Purity >= 0.5 • After cuts -> 52,000 signals (track-like) well) and 23,000 backgrounds (shower-like) • 50% numu and 50% nue MCC11 samples • Veto mischaracterised pfo (will talk about it in the next slide) • After cuts -> 137,000 signals (track- like) and 52,000 backgrounds (shower-like) • kBDT, Ntrees = 800, MinNodeSize=5%, MaxDepth=3, BoostType=AdaBoost, AdaBoostBeta=0.5, BaggedSampleFraction=0.6, SeparationType=GiniIndex, nCuts=20 15

  16. mischaracterisedPfo Motivation Description • For a given PFO, work out what MCParticle each hit maps to • Workout showerProbability where, • 𝑡ℎ𝑝𝑥𝑓𝑠𝑄𝑠𝑝𝑐𝑏𝑐𝑗𝑚𝑗𝑢𝑧 = 𝑢𝑝𝑢𝑏𝑚 𝑡ℎ𝑝𝑥𝑓𝑠−𝑚𝑗𝑙𝑓 ℎ𝑗𝑢𝑡 𝑗𝑜 𝑏𝑚𝑚 𝑤𝑗𝑓𝑥 𝑢𝑝𝑢𝑏𝑚 ℎ𝑗𝑢𝑡 𝑗𝑜 𝑏𝑚𝑚 𝑤𝑗𝑓𝑥 • Flag a PFO as mischaracterised if showerProbability >= 0.5 but called as track-like or vice versa • Use this flag to improve BDT training • Essentially, don’t train on wrong topology NCDIS_P_P_P_PIZERO 16

  17. Examples of vetoed and remaining PFOs Remaining Showers Mischaracterised Tracks Remaining Tracks Mischaracterised Showers 17

  18. BDT value distribution for BDT1 18 Tracks BDT value

  19. BDT value distribution for BDT2 19 Tracks BDT value

  20. Hierarchy Variables What are they? 20

  21. Hierarchy variables • Indicates hadronic activities • 3 new variables • nAllDaughter • nHits3DDaughterTotal • daughterParentNhitsRatio • Aim is to push protons and pions towards the track-like region to improve track/shower separation 21

  22. nAllDaughter – total number of all downstream daughter pfos nAllDaughter nHits3DDaughterTotal – total number of 3D hits in all downstream daughter pfos 22 nHits3DDaughterTotal

  23. daughterParentNhitsRatio – 3D hits ratio between all daughterParentNHitsRatio downstream daughter pfos and parent pfo. 23 daughterParentNHitsRatio

  24. BDT value distribution for BDT3 24 Tracks BDT value

  25. Comparing DifferentBDTs 100 95 CorrectID 90 85 80 75 70 PIPLUS - CCDIS_MU_PIPLUS - PIMINUS - NCDIS_P_PIMINUS - E L E C T R O N - CCQ EL_E_P - (3319) PHOTON - NCRES_PIZERO - (1184) PROTON - CCRES_E_P_PIPLUS- MUON - CCQEL_MU_P - (3386) (2047) (988) (1956) Particle - Interaction -(#events) Current Pandora BDT1 BDT2 BDT3 25

  26. Validation Output - Electron Current Pandora BDT3 CCQEL_E CCQEL_E -ELECTRON: nEvents: 3419, correctId 92.478% -ELECTRON: nEvents: 3419, correctId 92.205% CCQEL_E_P CCQEL_E_P -ELECTRON: nEvents: 3319, correctId 91.1111% -ELECTRON: nEvents: 3319, correctId 92.9938% CCRES_E_P_PIPLUS CCRES_E_P_PIPLUS -ELECTRON: nEvents: 1956, correctId 89.6534% -ELECTRON: nEvents: 1956, correctId 91.0189% CCRES_E_P_PIZERO CCRES_E_P_PIZERO -ELECTRON: nEvents: 822, correctId 92.2094% -ELECTRON: nEvents: 822, correctId 92.848% 26

  27. Validation Output - Photon Current Pandora BDT3 CCRES_E_P_PIZERO CCRES_E_P_PIZERO -PHOTON1: nEvents: 822, correctId 77.4576Z -PHOTON1: nEvents: 822, correctId 80.5085% CCDIS_MU_P_PIZERO CCDIS_MU_P_PIZERO -PHOTON1: nEvents: 981, correctId 87.6457% -PHOTON1: nEvents: 981, correctId 90.9091% NCRES_PIZERO NCRES_PIZERO -PHOTON1: nEvents: 1184, correctId 75.5474% -PHOTON1: nEvents: 1184, correctId 71.2591% NCRES_P_PIZERO NCRES_P_PIZERO -PHOTON1: nEvents: 834, correctId 79.3829% -PHOTON1: nEvents: 834, correctId 81.9074% 27

  28. Validation Output - Muon Current Pandora BDT3 CCQEL_MU CCQEL_MU -MUON: nEvents: 3161, correctId 99.6145% -MUON: nEvents: 3161, correctId 99.8394% CCRES_MU_P CCQEL_MU_P -MUON: nEvents: 3386, correctId 99.3714% -MUON: nEvents: 3386, correctId 99.8503% CCRES_MU_P_PIPLUS CCRES_MU_P_PIPLUS -MUON: nEvents: 1875, correctId 98.75% -MUON: nEvents: 1875, correctId 99.1848% CCDIS_MU_PIPLUS CCDIS_MU_PIPLUS -MUON: nEvents: 2047, correctId 97.6744% -MUON: nEvents: 2047, correctId 98.8878% 28

  29. Validation Output - Proton Current Pandora BDT3 CCQEL_MU_P CCQEL_MU_P -PROTON1: nEvents: 3386, correctId 94.0557% -PROTON1: nEvents: 3386, correctId 98.7307% CCQEL_E_P CCQEL_E_P -PROTON1: nEvents: 3319, correctId 94.2895% -PROTON1: nEvents: 3319, correctId 98.4903% CCRES_E_P_PIPLUS CCRES_E_P_PIPLUS -PROTON1: nEvents: 1956, correctId 95.2663% -PROTON1: nEvents: 1956, correctId 98.4615% CCRES_MU_P_PIPLUS CCRES_MU_P_PIPLUS -PROTON1: nEvents: 1875, correctId 95.3071% -PROTON1: nEvents: 1875, correctId 98.5516% 29

  30. Validation Output - PiPlus Current Pandora BDT3 CCRES_MU_PIPLUS CCRES_MU_PIPLUS -PIPLUS: nEvents: 1867, correctId 83.4203% -PIPLUS: nEvents: 1867, correctId 92.6377% CCRES_E_P_PIPLUS CCRES_E_P_PIPLUS -PIPLUS: nEvents: 1956, correctId 78.7336% -PIPLUS: nEvents: 1956, correctId 91.3979% CCDIS_MU_PIPLUS CCDIS_MU_PIPLUS -PIPLUS: nEvents: 2047, correctId 86.2016% -PIPLUS: nEvents: 2047, correctId 91.6796% CCDIS_MU_P_PIPLUS CCDIS_MU_P_PIPLUS -PIPLUS: nEvents: 825, correctId 85.3247% -PIPLUS: nEvents: 825, correctId 90.6494% 30

  31. Validation Output - PiMinus Current Pandora BDT3 NCDIS_P_PIMINUS NCDIS_P_PIMINUS -PIMINUS: nEvents: 988, correctId 89.8361% -PIMINUS: nEvents: 988, correctId 90.7104% NCRES_P_PIMINUS NCRES_P_PIMINUS -PIMINUS: nEvents: 574, correctId 88.417% -PIMINUS: nEvents: 574, correctId 95.1737% 31

  32. What’s next? • Replicate these results inside Pandora using SKLearn • Do more tests • Release • Deep Learning approaches 32

  33. The End Thank you. Any questions? 33

  34. Backup Slides 34

  35. Small Note On Completeness VS Purity 35

  36. The GIF below cycles from Completeness && Purity >= 0.0 to 1.0. 36

  37. The GIF below cycles from Purity >= 0.5 && Completeness >= 0.0 to 1.0. 37

  38. The GIF below cycles from Completeness >= 0.5 && Purity >= 0.0 to 1.0. 38

  39. Correlation Matrix for 13 variables in BDT3 39

  40. How removing nHits3DDaughterTotal affects Track/Shower ID BDT3 w/o nHit3DDaughterTotal BDT3 40

  41. nHits3DDaughterTotal for Track-like particles 41

Recommend


More recommend