Integration of Enterprise Monitoring Product, Senlin and Mistral for Auto-healing Xinhui Li(VMware), Wolf Peter(Nokia), Qiming Teng(IBM)
Motivation • Compute Controller • Service Resiliency • Network Controller • Quality of Service Application • Database • Cost • Message Queue • Transparency OpenStack • Storage • Data Integrity • … • ... VM • Virtual Machine • Physical nodes • Incl. Container • Physical network • Virtual Network • Physical storage • Virtual Storage Host • Hypervisor • VM Mobility • Host OS • Ease of Management • … • ...
Auto-Healing by Senlin Application Events Nova Server 2 app monitor notification VM Events 2 os-collect-config nova MQ Polling 4 “Restart” Ceilometer/Aodh Recover Action: 2 1 Alarm • reboot Cluster • rebuild Senlin-engine • migrate 3 Receiver MQ • remote-restart A notion of VM groups for A native signal / alarm • VM/Application redundancy • HA policies
Gap to an industry-level Solution 4 Monitoring/Alarm/Notification is the foundation of Day2 operation. reliable and scalable monitoring across different sources Support users’ own custom workflow Long-run task Error handling and flexible workflow Integrate with VRops and Mistral
Agenda 5 Senlin overview Mistral as recover action Integration with vRops for alert and notification Demo
Senlin – Resource Pool Manager Scaling Load-Balancing Senlin CLI Senlin API Senlin Engine OSC plug-in Region Placement Health Senlin Deletion Zone Placement Senlin Engine Dashboard Affinity ...... Python/Java Bindings docker-py openstacksdk nova heat docker containers heat stacks VM servers
How to Use Senlin - Cluster 7 Rich primitives and operations Cluster membership management Cluster scale/capacity management Scale in/out Resize Policy management Auto-Scaling Aut-Healing Placement Load-balancing Create Standby/Active Cluster by move node among different clusters Support Blue/Green mode
Senlin Polices 8 Placement policy Health Policy health_policy: type: senlin.policy.affinity type: OS::Senlin::Policy version: 1.0 properties: properties: type: senlin.policy.health-1.0 servergroup: bindings: name: my_server_group - cluster: {get_resource: cluster} policies: affinity properties: availability_zone: nova detection: enable_drs_extension: false type: VM_LIFECYCLE_EVENTS recovery: actions: - REBUILD
Agenda 9 Senlin overview Mistral as recover action Integration with vRops for alert and notification Demo
Workflow as Recover Action 10
Agenda 11 Senlin overview Mistral as recover action Integration with vRops for alert and notification Demo
vRops - Policy driven Performance, Capacity, 12 Configuration and Compliance analytics
Managing Data Collection from 13 Different Sources Adapter Instance Status list Solution Status
Sample Heat Template – vROps Alert Resources 14 vROps-notification
Sample Heat Template – Hook Back to 15 OpenStack CONFIDENTIAL
Demo 16 VIO OpenStack Custom group: Heat type: name Template value: cluster-name rule: contains Notifications Senlin cluster 1 Senlin cluster 2 receiver Recover: vm vm Mistral vm vm based resize vm
Future Work Event correlation Event collection across layers Correlate events to find root cause and react Sequence of Actions Workflow comprised of more than one nova/heat action
Q & A
Recommend
More recommend