Light-path Monitor System of TWAREN Optical Network National Center for High-Performance Computing Speaker: Ming-Chang Liang 1 1
2 2 INTRODUCTION
TWAREN phase 2 � TWAREN was adapted for more protection methods and better availability at the end of 2006, called TWAREN phase 2. � Tens of optical switches and hundreds of lightpaths were then served as the foundation of the layer 2 VLAN services and the layer 3 IP routing services. � In 2008, tens of VPLS switches were further incorporated to provide additional Multi-point VPLS VPN service. � The layer 1 lightpaths can be protected by SNCP, layer 2 VLAN by spanning tree recalculation and layer 2 VPLS by fast reroute technology. � All these improvements transform TWAREN phase 2 into a true hybrid network capable of providing multiple layers of services and high availability . 3 3
4 4 Architecture of Optical Network
Architecture of TWAREN phase 2 NCCU NIU NTU ASCC 6509 15454 6509 7609 7609 7609 NDHU 6509 15454 15454 15454 7609 3750 6509 15600 NCU 15454 12816 12816 15454 7609 NCNU MOEcc 7609C NHLTC Taipei 7609 12816 12816 3750 6509 6509 NCTU Hsinchu Taichung 7609C 15454 15454 15454 7609C 15600 7609 NCHC NCHC 12816 12816 6509 Tainan 15454 NCHU NCHC 7609C 7609 NTTU 6509 12816 15454 12816 NTHU 15454 3750 6509 15600 7609 15454 15454 15454 STM64 STM16 6509 7609 7609 6509 7609 6509 10GE GE NSYSU NCKU CCU 5 5
CTC is not enough for us � TWAREN phase 2 inherently has the ability to guard against a single point of hardware or circuit failure, so the failure is less likely to affect the actual service provisioning. � When port or circuit is fault, we must determine which lightpaths are affected and then correlate with services of upper layers. 6 6
7 7 DESIGN OF NMS
1 st Stage Architecture of NMS GUI & Monitor Objs Control API Ticket System Traps Fault Detection Data Collectors Fault Location MIBs Current Status DB Syslogs Threshold DB Net flows Long Term DB Telnet/SSH Case/Action DB Auto Action TL1 Threshold Analyzer Mirror Report System Interactive Passive 8 8
Lightpaths Monitor System Monitor System in NOC Alarm trigger Alarm Email Database Trap parser Light-path data Parser ONS TWAREN Optical Network 9 9
10 10 Frequent incident case 1
11 11 Frequent incident case 2
Important information in a trap Value index Value Description 1 Trap agent host name The hostname which sent this trap 2 Trap agent IP address The IP address of host which sent this trap 3 sysUpTime The system uptime of host which sent this trap 4 snmpTrapOID The mapped OID of this trap The ONS clock time 5 Cerent454NodeTime (YYYYMMDDhhmmss) 1.3.6.1.4.1.3607.6.10.20.30.20.1.80 6 cerent454AlarmState The severity level of this warning (defined by Cisco CerentNotificationClass) 12 12
cerent454AlarmState Alarm State Traps that use this alarm Meaning Number stats 31 diagnostic All traps of ONS All traps of ONS will sent this 40 cleared state when fault be solved minorNonServiceAffecting 1. carrierLossOnTheLAN 50 2. transportLayerFailure 80 minorServiceAffecting All traps of ONS 1. carrierLossOnTheLAN 90 majorServiceAffecting 2. transportLayerFailure 1. lossOfSignal 100 criticalServiceAffecting 2. lossOfFrame 13 13
The OID of ONS Trap ONS-15600 Trap Name MIB OID lossOfSignal .1.3.6.1.4.1.3607.2.20.0.430 lossOfFrame .1.3.6.1.4.1.3607.2.20.0.390 carrierLossOnTheLAN .1.3.6.1.4.1.3607.2.20.0.220 transportLayerFailure .1.3.6.1.4.1.3607.2.20.0.3540 ONS-15454 Trap Name MIB OID lossOfSignal .1.3.6.1.4.1.3607.6.10.30.0.430 lossOfFrame .1.3.6.1.4.1.3607.6.10.30.0.390 carrierLossOnTheLAN .1.3.6.1.4.1.3607.6.10.30.0.220 transportLayerFailure .1.3.6.1.4.1.3607.6.10.30.0.3540 14 14
15 15 DESIGN OF DATABASE
Relationship of Data Tables Basic Data Tables Relationship Tables Component Circuit People VLAN Services Location VPLS Services ONS Unit Light Path ONS Vendor Cross Connection …., etc …., etc 16 16
Basic Data Tables Component Data Table Component_ID Parent_C_ID Name Vendor Data Table 1 0 TN7609P ID Name 12 1 Slot_1 1 CHT 2 0 TP15454 2 APBT 16 2 Slot_3 3 RingLine 135 12 Port_9 People Data Table ID Name Phone Address Service_Time Service_WeekDay 1 John 0939123123 xxxxxxx 8-17 1,3,5 2 Mary 0958123123 xxxxxxx ALL ALL Unit Data Table Location Data Table ID Name ID Name Address 1 NCKU 1 MOEcc xxxxx 18 THU 2 NTU xxxxx 17 17
Port Table Field Type Description PortID int Port Component ID Port type: Type int Ethernet(0) SDH(1) CardName varchar Line Card type name Bandwidth int Bandwidth Status int Defined by us 18 18
Topology Link Table Field Type Description NodeA int The component IDs of the equipments connected with the link. NodeB int PortA int The component IDs of the ports connected with the link. PortB int 19 19
Cross Connection Table Field Type Description CRS int Cross Connection unique number SNCP int If SNCP protection? (0: No, 1:Yes) PortFrom1 int Port Component ID - From 1 PortFrom2 int Port Component ID - From 2 PortTo1 int Port Component ID - To 1 PortTo2 int Port Component ID - To 2 ChannelFrom1 int Port From 1 – Channel ID ChannelFrom2 int Port From 2 – Channel ID ChannelTo1 int Port To 1 - Channel ID ChannelTo2 int Port To 2 - Channel ID SNCPPathFrom int Current SNCP selector uses WORKING(1) or PROTECTION(2) SNCPPathTo int Size int VC bandwidth size, unit is VC1 (155mbps) Status int Defined by us CKTID varchar Circuit Identification string 20 20
Light-Path Table Field Type Description LightPath int Light Path unique number Name varchar Light Path name (CKTID) PortFrom int Port – From PortTo int Port - To SNCP int If SNCP protection? (0: No, 1:Yes) Size int VC bandwidth size, unit is VC1 (155mbps) Cross-connection path string that a light-path TraceCRS varchar passed through Configured ports path (port Component ID string) TraceConfigured varchar that a light-path should pass through. Actual ports path (port Component ID string) that TraceCurrent varchar a light-path pass through currently. Status int Defined by us 21 21
Alarm Table Field Type Description Id Int Alarm serial number EventName Varchar Alarm identify name HostName Varchar The name of the host which sent this alarm AgentIP Varchar The IP address of the host which sent this alarm Category Varchar The class of this alarm Severity Varchar The severity level of this alarm (defined by us) Varchar The uptime of the equipment which sent this UpTime alarm TrapTime Varchar The time that this alarm be generated Interface Varchar The port be affected AlarmStatus Varchar The severity level that be defined by CISCO LightPath Varchar The light-path names be affected 22 22
23 23 IMPLEMENTATION
Working steps � Build Port table by reading from Component table. � Send some TL1 commands to all ONS. � Build TopologyLink and CrossConnection tables by parsing the responses of TL1. � Build LightPath table by aggregating Port, TopologyLink, CrossConnection tables. � Determine the affected ports When receive traps from ONS, and then correlate the affected services with database. 24 24
TL1 commands Command Description ACT-USER::username:123::password; Login RTRV-NE-IPMAP:::123; Get information about topology link neighbors RTRV-CRS::ALL:123; Get information about cross connections RTRV-VC::ALL:123; Get information about all VC statuses, including SNCP selector status CANC-USER::username:123:; logout 25 25
Partial SNCP (1) ONS-B ONS-A RA ONS-C ONS-D RB Configured working path Configured protection path 26 26
Partial SNCP (2) ONS-B ONS-A RA ONS-C ONS-D RB Actual working path Configured protection path 27 27
28 28
29 29 WEB-BASED ALARM LOG SYSTEM
30 30 Example 1
31 31 Example 1
32 32 Example 2
33 33 INTEGRATED VISUAL INTERFACE
34 34
35 35
36 36
Recommend
More recommend