how stranger things can
play

How Stranger Things can happen with Visual Analytics Jason Flittner - PowerPoint PPT Presentation

#NetflixData How Stranger Things can happen with Visual Analytics Jason Flittner Senior Analytics Engineer / Manager Netflix - Content Data Engineering and Analytics About Netflix Tableau + Big Data Lessons Learned Where


  1. #NetflixData How “Stranger Things” can happen with Visual Analytics Jason Flittner Senior Analytics Engineer / Manager Netflix - Content Data Engineering and Analytics

  2. ● About Netflix ● Tableau + Big Data ○ Lessons Learned ○ Where we are today ● Analytics and Iterating Quickly

  3. What is Netflix?

  4. ● 93+ million members Metrics ● 190 countries ● 1,000+ devices ● 10B hours/qtr We plan on spending ~$6B in 2017 on content for our members

  5. ● ~60 PB DW on S3 ● ~1400 Tableau users ● Live & extract connections ● Analytics on billions of rows

  6. Compute Storage Data Interface Data Access, Analytics and Visualization AWS (Hadoop S3 clusters)

  7. ● About Netflix ● Tableau + Big Data ○ Lessons Learned ○ Where we are today ● Analytics and Iterating Quickly

  8. Choosing a source Hive ● Spark ● Presto ● Redshift ● Published Data Source ● etc... ●

  9. ● Powerful and scalable backend ● “Slower” 1,000,000,000/hr ● Hive + Tableau ○ Thrift Servers ○ Custom SQL vs Tables ○ Metadata ○ ODBC Optimization

  10. ● Scalable ● Faster than Hive in many cases ● Spark + Tableau ○ Thrift Servers ○ Long running job on Cluster ○ Query reliability

  11. ● Fast query engine ● Great for experimenting and “smaller” data sets ● Connecting to Tableau ○ Web data connector ○ ODBC

  12. ● About Netflix ● Tableau + Big Data ○ Lessons Learned ○ Where we are today ● Analytics and Iterating Quickly

  13. Tableau Extract API Tableau Data Extract Publish to Server

  14. Distributed Tableau Extract API Publish to Server Issues Command Create Extract Provision Container Resource Create Tableau Data Extract

  15. Amazon ● Very fast loads from S3 Redshift ● Native Tableau connector ● Quick Tableau Iteration ● Live or Extract ● Concurrency

  16. ● Too big to extract? BIG Data ● Optimized live connections ○ SQL ● Custom data viz with Druid ● Tableau + Hyper!?

  17. ● About Netflix ● Tableau + Big Data ○ Lessons Learned ○ Where we are today ● Analytics and Iterating Quickly

  18. Analytics Engineer Analytics: Binge Analysis ● Viewing Patterns ● Hours Viewed ● Customer Joy ● Content Quality ● Business users

  19. ● Content analytics Bringing it all ● Iterate quickly together ● Move between backend sources ● Strong user adoption

  20. Merci Thank you Jason Flittner -

Recommend


More recommend