cms data transfer tests towards lhc data taking cms data
play

CMS Data Transfer tests towards LHC data taking CMS Data Transfer - PowerPoint PPT Presentation

CMS Data Transfer tests towards LHC data taking CMS Data Transfer tests towards LHC data taking D Bonacorsi CMS Facilities Infrastructure Operations INFN CNAF Bologna Italy On behalf of the CMS experiment


  1. CMS Data Transfer tests towards LHC data taking CMS Data Transfer tests towards LHC data taking D � Bonacorsi � CMS Facilities � Infrastructure Operations ��� INFN � CNAF � Bologna � Italy � On behalf of the CMS experiment *Thanks* to all site people and operators.

  2. Main CMS work � ows � simpli � ed � Prompt reco reco Prompt Data Transfer Data Transfer Skims, re-reco re-reco Skims, Calibration Calibration Data Transfer Data Transfer Data Transfer Data Transfer MC prod upload Skims download MC prod upload Skims download In this talk: focus on distributed transfer only D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �

  3. PhEDEx Physics Experiment Data Export reliable � scalable data replication system for HEP experiments CMS is exercising the data placement system since ���� PhEDEx fully interfaced also with gLite FTS since yrs CMS data transfers in a nutshell: � T �� ����� T � � T �� ����� T �� � s � �� T � s � �� T �� � s � �� s � �� T � T �� � s s in current PhEDEx transfer topology ���� of CMS Institutes Data transfers operated as if experiment was already running we have had � service outages exceeding �� hrs in the last � yrs Current transfers at ��� ��� GB � s GB � s � � ��� TB � day ��� TB � day � � � PB � month � PB � month global average rate average CMS � les size likely to be �� GB � � � �� k � les � day �� k � les � day D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �

  4. Evolution of a LoadTest � from early ���� on � A � exible infrastructure to generate data transfer tra � c among CMS Tiers Start moving � fake � but � real � No real physics � les; fully PhEDEx � compliant � though ����� activity ����� activity since mid � February ���� full cycles ��� weeks each � before Jun �� � then extended into DDT � CSA �� preparation activities � T � � T �� tape �� T � � T �� T � � T � � regional �� T � � T � � non � regional � Jan 2007 Jan 2007 mid-March 2007 mid-March 2007 D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �

  5. Start walking Jan 2007 mid-March 2007 Jan 2007 mid-March 2007 Cycle-1 Cycle-2 Jan 2007 mid-April 2007 Jan 2007 mid-April 2007 D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �

  6. Walk better and faster Jan 2007 mid-April 2007 Jan 2007 mid-April 2007 Sep 2006 Today Sep 2006 Today D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �

  7. CMS CSA06 : CMS LoadTest LoadTest 2007 2007 CMS ~1 PB in ~1 month to participating Tiers >12 PB in ~6 months among all PhEDEx Tiers joining the LoadTest D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �

  8. [ courtesy of L.Tuura, CHEP07 ] 2007/Q12 D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �

  9. Data transfers: grandview Status � as of mid ������ : With � yr to LHC start � up � CMS approaches the real transfers in scale � but not yet the full complexity From reliable transfers over the full transfer mesh � to multi � VO exercises… The progress is evident � though � Main sources of this are: A well � designed � robust � scalable transfer system A remarkable manpower investment to commission the transfer system Continuing e � orts are needed on debugging data transfers debugging data transfers more D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �

  10. [ courtesy of L.Tuura ] Screenshot as of Sep 2007 D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �� ��

  11. Quality of transfers Clear improvement in Tiers participation to test transfers since LT started Still not evident improvement in quality � though It � s not one problem that lacks solution � there is a wide span of them Greenish quality plots � successful transfers with fewer retrials � only when storage at both ends � network � PhEDEx set � up � site con � g � operators work � ne… simultaneously! 30 20 # Tiers making some successful transfers # Tiers making transfers at >50% quality Jan06 Jun07 LoadTest LoadTest CSA06 CSA06 Positive note: now stably at a ~challenge traffic load Need to: Keep sites exercised + Debug and improve quality more D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �� ��

  12. DDT Debugging Data Transfers A CMS program to maintain a high � quality transfer network to be handed over to CMS Data Operations Started in July 07 Debug � commission transfer links transfer links among CMS Tiers a Task Force � DDT � TF � is in charge since July ���� Joined e � ort with CMS Facilities � Commissioning � T � liasons � PhEDEx � Central Ops � FTS � SRM experts � network experts � site admins � … Troubleshooting � by Tiers � work by milestones focus on watching logs � ping site admins � � x problems commission T � � T �� T � downlinks to T �� T � uplinks to T �� … Gain confidence Infrastructural issues � by Facilities � Network projects Overall activity � DDT � TF � work on deliverables E � g � a real � time status map � with reasons � of all Tier � X � Tier � Y links E � g � a number of documented � success stories � in troubleshooting D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �� ��

  13. DDT The � rst DDT metric First step was to de � ne and implement a metric by which links can become � commissioned � and subsequently handed over to Data Operations There are several stages through which a link can get commissioned: � � NOT � TESTED: links never actually tested NOT � TESTED i � e � showing no successful transfer attempts within PhEDEx � � PENDING � COMMISSIONING: links that have transferred successfully at PENDING � COMMISSIONING least � � les in PhEDEx � but have not yet passed the reqs below � � COMMISSIONED: links that are demonstrated to work � * �� and can be COMMISSIONED delivered to Data Ops � Transfer ��� GB � day for � out of � consecutive days � and transfer a total of ��� TB during that same �� day period � For links involving an endpoint at a T �� this req is relaxed to � out of � days � and a total transfer volume of ��� TB � to match service business hrs support at T �� s � � � PROBLEM � RATE: links that were working but whose rate has dropped o � PROBLEM � RATE � To remain COMMISSIONED � a link must transfer at least ��� GB � day for a single day at least once every � days � Otherwise � the link must be re � commissioned by following the procedure above � � * � i � e � this does not imply that the link or the site has met the reqs of the CMS Computing Model � but simply that the link has passed some passed minimum reqs to be considered usable for Data Operations � namely: production � quality activity � not tests � D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �� ��

  14. DDT The DDT status in late ���� Steady in � ux of new links… number of COMMISSIONED links that were in danger of decommissioning within the next two days Legenda: T1 � T1 ROW � COLUMN: upper half of box T1 � T2 COLUMN � ROW: lower half of box States: [ … plus many more … ] D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �� ��

  15. DDT Going beyond The requirements � thresholds in this metric: � were developed with the idea of having a higher threshold to commission than to decommission the link � can be increased in time as networks and sites develop the rates implied by ��� GB � day are of the order of ��� MB � s per link � far below the commitments envisioned in the computing model T �� s being able to download a total of � up to � TB � day from T � sites � or over �� MB � s sustained downloads � � match the idea that transfers would be at continuous rate over several days � The Computing Model actually envisions that transfers will occur in bursts the metric used during CSA �� deviated from this model to prove the stability of data transfer links It worthed a metric revision later � But � before � see what happened with this one metric! D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �� ��

  16. Before DDT started (1 month, May 2007) The plots show the fraction successes/attempts in file transfers. A clear improvement in the number and quality of data transfer links is seen soon after DDT started (Jul 07). After DDT started (1 month, Oct 2007, during CSA07) D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �� ��

  17. In the meantime…. CCRC’08 /phase-1 (WLCG C ommon-VO C omputing R eadiness C hallenge) D � Bonacorsi Bonacorsi ISGC Symposium � Taipei � ���� April ���� �� ��

Recommend


More recommend