farms fabrics and clouds
play

Farms, Fabrics and Clouds Steve Loughran Julio Guijarro HP - PowerPoint PPT Presentation

Farms, Fabrics and Clouds Steve Loughran Julio Guijarro HP Laboratories, Bristol, UK December 2007 steve.loughran@hpl.hp.com julio.guijarro@hpl.hp.com Julio Guijarro Researcher at HP Laboratories Area of interest: Deployment In charge of


  1. Farms, Fabrics and Clouds Steve Loughran Julio Guijarro HP Laboratories, Bristol, UK December 2007 steve.loughran@hpl.hp.com julio.guijarro@hpl.hp.com

  2. Julio Guijarro Researcher at HP Laboratories Area of interest: Deployment In charge of OSS release http://smartfrog.org/ Farms, Fabrics and Clouds www.smartfrog.org Page 2

  3. Steve Loughran Researcher at HP Laboratories Area of interest: Deployment Author of Ant in Action Farms, Fabrics and Clouds www.smartfrog.org Page 3

  4. Our research • How to host big applications across distributed resources – Automatically – Repeatably – Dynamically – Correctly – Securely • How to manage them from installation to removal • How to make dynamically allocated servers useful Farms, Fabrics and Clouds www.smartfrog.org Page 4

  5. Question Who had breakfast this morning? Farms, Fabrics and Clouds www.smartfrog.org Page 5

  6. Question Who harvested wheat or corn, or killed an animal for that breakfast? Farms, Fabrics and Clouds www.smartfrog.org Page 6

  7. Farms provide food. It is somebody else's problem Farms, Fabrics and Clouds www.smartfrog.org Page 7

  8. Question Who is wearing clothes they wove or knitted themselves? Farms, Fabrics and Clouds www.smartfrog.org Page 8

  9. Provisioning of clothing -fabrics- is outsourced It is somebody else's problem Farms, Fabrics and Clouds www.smartfrog.org Page 9

  10. All new applications are on the Web • Web Browser, AJAX clients • Richer: Flash, XUL, Silverlight • "… as a Service " ⇒ Lots of code running in the server ⇒ Data mining/analysis problems ⇒ Unpredictable demand Farms, Fabrics and Clouds www.smartfrog.org Page 10

  11. Old world installation: single server ������������������ �������� Single web server, Single DB ��������� RAID filestore ������ -SPOF -limitations of scale Farms, Fabrics and Clouds www.smartfrog.org Page 11

  12. yesterday: clustering ��������� ������ Multiple web servers, Replicated DB ������������ ������� RAID Network filestore Load-balancing router -Cost -Complexity ������������ �������� -Limitations of scale ��������� �������� Maintains the illusion of a single server Farms, Fabrics and Clouds www.smartfrog.org Page 12

  13. Now: server farms 500 web servers, Distributed filestore Rented storage & CPU Scales up No capital outlay Agile infrastructure Farms, Fabrics and Clouds www.smartfrog.org Page 13

  14. tomorrow? grid fabric. 50000 servers Farms, Fabrics and Clouds www.smartfrog.org Page 14

  15. Application architectures and deployment problems change radically in this world Farms, Fabrics and Clouds www.smartfrog.org Page 15

  16. Application architectures • ROA/REST • Virtualized • Map/Reduce • Shards • Tuple-spaces • Grid Farms, Fabrics and Clouds www.smartfrog.org Page 16

  17. Virtualization Farms, Fabrics and Clouds www.smartfrog.org Page 17

  18. Why? • Save on hardware (and power, space) • Dynamically move running servers • Demand creation of new images • Testing complex system configurations • Redistributing entire machine image • 'virtual appliance' Farms, Fabrics and Clouds www.smartfrog.org Page 18

  19. Assumptions that are now invalid • Systems have a long lifespan • It is slow/expensive to create a new system • It is expensive to duplicate one • Systems can/should be managed by hand • Clocks proceed at the same rate • Physical RAM doesn’t get swapped out • Running machines can't be moved/cloned Farms, Fabrics and Clouds www.smartfrog.org Page 19

  20. Server Farms Farms, Fabrics and Clouds www.smartfrog.org Page 20

  21. Assumptions that are now invalid • System failure is an unusual event • 100% availability can be achieved • Data is always near the server • You need physical access to the severs • Databases are the best storage form • You need millions of $/£/€ to play Farms, Fabrics and Clouds www.smartfrog.org Page 21

  22. Who has the servers? • Yahoo, Google, MSN, eBay: services • MMORPG Game Vendors: Word of Warcraft, Second Life • EU Grid: Scientists • HP, IBM, Sun: rent to companies -focus on CPU performance • Amazon: rent to anyone with an Amazon account -focus on startups Farms, Fabrics and Clouds www.smartfrog.org Page 22

  23. Amazon EC2 Public Internet $ $ $ $ Host Host AMI AMI AMI AMI $ (Xen VM) (Xen VM) (Xen VM) (Xen VM) /mnt /mnt /mnt /mnt Fast (free) network free access; slow initial read time pay per GET; per megabyte S3 Storage Farms, Fabrics and Clouds www.smartfrog.org Page 23

  24. Amazon EC2 • Pay as you go Virtual Machine Hosting • No persistent storage other than S3 filestore - uses HTTP GET/PUT/DELETE operations • $0.10 per CPU/hour • S3 Storage has own billing (by MB & by access -cheaper in bulk) Farms, Fabrics and Clouds www.smartfrog.org Page 24

  25. Demo Farms, Fabrics and Clouds www.smartfrog.org Page 25

  26. Map/Reduce Run code near the data, then merge the results Farms, Fabrics and Clouds www.smartfrog.org Page 26

  27. Assumptions that are now invalid • Terabyte datasets are hard to work with • Code runs on a single machine • Sequential code is better than parallel code • RAID hardware is the best way to store data • Databases are better than filesystems Farms, Fabrics and Clouds www.smartfrog.org Page 27

  28. Shards Farms, Fabrics and Clouds www.smartfrog.org Page 28

  29. Assumptions that are now invalid • A single farm needs to scale to infinity • You need to provide 100% availability to 100% of users • You have to roll out simultaneous updates to the application, changes to the DB schema, globally Farms, Fabrics and Clouds www.smartfrog.org Page 29

  30. Changes for developers • Many classic assumptions are invalid • Design for scale • Rent servers from the outset —every developer can have their own set • Cover your server costs from the outset and you are in the black from day 1 Farms, Fabrics and Clouds www.smartfrog.org Page 30

  31. Problems for us farmers • Power management • Predictive disk failure management • Load balancing for availability, power • Data cache management • Billing • Security/Isolation • How this will change server hardware • Managing/Configuring Machine Images • Diagnostics when things go wrong Farms, Fabrics and Clouds www.smartfrog.org Page 31

  32. Topic for discussion Where is all this heading? Farms, Fabrics and Clouds www.smartfrog.org Page 32

Recommend


More recommend