cray sv1 supercluster resiliency
play

CRAY SV1 SuperCluster Resiliency Mike Wolf I/O development SGI - PowerPoint PPT Presentation

CRAY SV1 SuperCluster Resiliency Mike Wolf I/O development SGI 41st Cray User Group Conference Minneapolis, Minnesota Resiliency Goals Maintain cluster operations after a panic Ring Resiliency Auto-Recovery Failover SuperCluster


  1. CRAY SV1 SuperCluster Resiliency Mike Wolf I/O development SGI 41st Cray User Group Conference Minneapolis, Minnesota

  2. Resiliency Goals Maintain cluster operations after a panic ¥ Ring Resiliency ¥ Auto-Recovery ¥ Failover

  3. SuperCluster Resiliency Ring Resiliency ¥ Operating System resets client chip ¥ Check xxx commands resetting client chip ¥ Proxy locking ¥ Dring monitor

  4. SuperCluster Resiliency Auto-Recovery ¥ Foundation / Monitoring ¥ User exits in check xxx commands ¥ Recovery ¥ Notification

  5. SuperCluster Resiliency Failover ¥ NFS ¥ UDB ¥ DCE/DFS ¥BDS

  6. Resiliency Example 1 SV1 SuperCluster Basic Building Block Mainframe 1 MPN Mainframe 2 MPN FCN MPN Mainframe 3 MPN Mainframe 4 GigaRing Ethernet SWS

  7. Resiliency Example 1 Mainframe 1 panics Mainframe 1 MPN Mainframe 2 MPN FCN MPN Mainframe 3 MPN Mainframe 4 GigaRing Ethernet SWS

  8. Resiliency Example 1 Mainframe 2 has packet backup Mainframe 1 MPN Mainframe 2 MPN FCN MPN Mainframe 3 MPN Mainframe 4 GigaRing Ethernet SWS

  9. Resiliency Example 1 Mainframe 2 hangs Mainframe 1 MPN Mainframe 2 MPN FCN MPN Mainframe 3 MPN Mainframe 4 GigaRing Ethernet SWS

  10. Resiliency Example 1 Mainframes 3 and 4 hang Mainframe 1 MPN Mainframe 2 MPN FCN MPN Mainframe 3 MPN Mainframe 4 GigaRing Ethernet SWS

  11. Resiliency Example 2 Mainframe 1 panics Mainframe 1 MPN Mainframe 2 MPN FCN MPN Mainframe 3 MPN Mainframe 4 GigaRing Ethernet SWS

  12. Resiliency Example 2 SWS stabilizes ring Mainframe 1 MPN Mainframe 2 MPN FCN MPN Mainframe 3 MPN Mainframe 4 GigaRing Ethernet SWS

  13. Resiliency Example 2 Mainframe 1 is back in service Mainframe 1 MPN Mainframe 2 MPN FCN MPN Mainframe 3 MPN Mainframe 4 GigaRing Ethernet SWS

Recommend


More recommend