click to edit master title style
play

Click to edit Master title style SCALING NETWORK MONITORING IN A - PowerPoint PPT Presentation

Click to edit Master title style SCALING NETWORK MONITORING IN A LARGE ENTERPRISE BroCon 2016 Austin, TX Click to edit Master title style Who am I? I work for Amazons Worldwide Consumer Information Security group What are we going to


  1. Click to edit Master title style SCALING NETWORK MONITORING IN A LARGE ENTERPRISE BroCon 2016 – Austin, TX

  2. Click to edit Master title style Who am I? I work for Amazon’s Worldwide Consumer Information Security group

  3. What are we going to talk about? Click to edit Master title style How we scaled our network monitoring solution while the network is continuously growing

  4. Click to edit Master title style Why do we even do this? Understanding what is occurring on our corporate network is important to us

  5. Click to edit Master title style In the beginning… http://spaceflight.nasa.gov/gallery/images/station/crew-7/html/iss007e10807.html

  6. Click to edit Master title style How do we approach this? We originally decided on using vendor network sensors to get visibility in to what was occurring on our network

  7. Click to edit Master title style How we started off • Decided a vendor appliance was an effective way of gathering the data we needed • We can buy network sensors, right? • So we bought network sensors and plugged them into our network

  8. Click to edit Master title style Vendor network sensor Life was much simpler back then... • 1Gb/s capable firewalls • SPAN sessions from our routers to vendor network sensors • Small number of firewalls to monitor • We got layer 3 and layer 4 header information from this network sensor

  9. Click to edit Master title style It looked something like this Authorized users Vendor appliance SPAN session Corporate The Internet network Router Firewall Netflow export Netflow collector

  10. Click to edit Master title style What is a SPAN port? http://www.cisco.com/c/en/us/support/docs/switches/catalyst-6500-series-switches/10570-41.html

  11. Click to edit Master title style Where do we go from here? Our network traffic volume kept growing • Our sensor vendor stopped selling and • supporting the platform we were using Increased internal maturity about using this data • Vendor Management platform can’t scale • Driven by API usage by internal customers • Started getting close to the limit of network sensors • the management platform could handle

  12. Click to edit Master title style Future proofing? • We have a vendor’s system we’re starting to push the limits on • What features do we need? • Do we continue to buy or do we look at building instead?

  13. Click to edit Master title style Build vs Buy Build Buy Speed of execution   Control   Vendor support   Logistics   Performance  

  14. Click to edit Master title style Pushing for the next level • My co-workers evaluated various options nProbe • Snort • Suricata • Bro •

  15. Click to edit Master title style Bro Generation One • Ran on a single host • Connected to our router via a 10G fiber link • SPAN session from the router to our Bro host

  16. Bro Generation One looks like… Click to edit Master title style Authorized users Log store Bro SPAN session Corporate The Internet network Router Firewall Netflow export Netflow collector

  17. The challenges of Generation One Click to edit Master title style • The Bro host was a single point of failure • Individual host installs have high operational costs • High traffic volumes on our SPAN sessions caused our router to reboot • Will this continue to scale with the growth of our network?

  18. Click to edit Master title style Scorecard Vendor solution Generation One Single point of   failure? Data collected via SPAN SPAN Control   Scalability   Logistics / Install   effort Cost per Gb/s $$$ $

  19. Click to edit Master title style And we are done! Or so we thought….

  20. Click to edit Master title style Along came Seth… • Seth spotted everything in the history field was in upper case – Turned out to be a trivial configuration change • We started off with 32GB of RAM in our hosts and ended up upgrading to 128GB

  21. Scaling to infinity and beyond! Click to edit Master title style • Capture loss levels (as reported by Bro) started rising beyond acceptable levels once we were past 3Gb/s of traffic on our existing hardware platform • We knew that traffic levels were going to continue to increase so our design needed to evolve as well

  22. Introducing Bro Generation 1.5 Click to edit Master title style • We migrated to optical taps over SPAN sessions – SPAN sessions were good for speed of deployment but not for long term use • Introduced a method to allow us load balance traffic among physical hosts – Similar outcome to the work done by LBNL – Eliminated the SPOF with our Bro host https://commons.lbl.gov/download/attachments/120063098/100GIntrusionDetection.pdf –

  23. Click to edit Master title style Bro horizontal scaling • While we do run Bro in a cluster, it is limited to a single physical host • We don’t want to share state across hosts • The Bro manager process being a single point of failure isn’t all that appealing to us • Keep the hosts simple and consistent

  24. Click to edit Master title style And here is how it looks Bro host #3 Bro host #1 Bro host #2 Load balancer Corporate Optical The Internet tap network Router Firewall Netflow export Netflow collector

  25. Click to edit Master title style Scorecard Vendor Generation Generation solution One 1.5 Single point of    failure? Data collected SPAN SPAN Optical taps via Control    Scalability    Logistics/    Install effort Cost per Gb/s $$$ $ $

  26. Click to edit Master title style Optical taps overview RX TX Optical tap RX TX TX Router Firewall 10Gb/s TX 10Gb/s Load balancer

  27. Click to edit Master title style Still some work to do • This was a great step forward, but it was only an incremental improvement • We can now scale out but it is still time consuming to get individual hosts deployed • Migrating to an integrated solution would help solve these challenges

  28. Click to edit Master title style Bro Generation 2.0 • Combined our hosts, load balancers and optical taps into a “cookie cutter” rack design • We now just order a small, medium or large rack depending expected traffic volumes

  29. Bro Generation 2.0 physical layout Click to edit Master title style Network rack Bro rack Load balancer Load balancer Optical tap Router Firewall Bro host #n Bro host #1 Bro host #2

  30. Scaling Bro Generation 2.0 footprint Click to edit Master title style Bro rack #1 Load balancer Network rack Load balancer Bro host #1Bro host #2 Bro host #n Bro rack #2 Optical tap Load balancer Router Firewall Bro host #1Bro host #2 Bro host #n

  31. Click to edit Master title style Scorecard Vendor Generation Generation Generation solution One 1.5 2 Single point     of failure? Data SPAN SPAN Optical Optical collected via taps taps Control     Scalability     Logistics/ Install effort     Cost per Gb/s $$$ $ $ $

  32. What do we do with all this data? Click to edit Master title style We stream the logs to our central log store

  33. Click to edit Master title style Central log storage

  34. Learn from some of our mistakes… Click to edit Master title style Our original ETL jobs were based on the Bro 2.3 field order (output in TSV) – Bro 2.4 changed the ordering of some of the fields – Use JSON if you’re loading this data elsewhere • One line configuration change!

  35. Click to edit Master title style Wrapping up http:// www.nasa.gov/image-feature/sunset-from-the-international-space-station

  36. Click to edit Master title style Lessons learnt • Scale horizontally and not vertically • Stateless sensors • Decouple dependencies • Plan up-front • Lab testing is never overrated • Get experts on-site to validate • Document wins • Know your customers

  37. Click to edit Master title style Thanks to… Industry peers – Thanks to LBNL, Mozilla and the others who responded to our queries and everyone who has publicly spoke or documented their install

  38. Click to edit Master title style Thank you!

Recommend


More recommend