From Big Data Management to Big Data Science 1
What is next? Real big data is widely available Only a few people know how to deal with it You’re now one of them Applications The project is a start Keep your hands dirty Consider using the public cloud (e.g., AWS, Google Cloud, or Microsoft Azure) 2
Job Market 3 https://www.techicy.com/5-best-programming-languages-to-watch-out-in-2019-for-data-science.html
Data Science Credits: Drew Conway 4
Data Science 5 https://mashimo.wordpress.com/2016/05/28/big-data-data-science-and-machine-learning-explained/
Data Scientist 6
Next Steps CS Big data tools Python/R/Scala Math/Stats Linear algebra Correlation analysis Hypothesis tests Collaboration with domain experts Visualization Prototyping 7
CS 8 https://www.slideshare.net/galvanizeHQ/how-to-become-a-data-scientist-by-ryan-orban-vp-of-operations-and-expansion-galvanize
CS/Big Data 9 https://www.slideshare.net/galvanizeHQ/how-to-become-a-data-scientist-by-ryan-orban-vp-of-operations-and-expansion-galvanize
Math/Stats 10 https://www.slideshare.net/galvanizeHQ/how-to-become-a-data-scientist-by-ryan-orban-vp-of-operations-and-expansion-galvanize
Online Courses 11 https://www.slideshare.net/galvanizeHQ/how-to-become-a-data-scientist-by-ryan-orban-vp-of-operations-and-expansion-galvanize
Data Analytics 12 https://www.slideshare.net/galvanizeHQ/how-to-become-a-data-scientist-by-ryan-orban-vp-of-operations-and-expansion-galvanize
Big Data Landscape Big data MLlib GraphX SparkR packages High level Pig Spark HBase Algebricks APIs Latin SQL Query Map RDD Hyracks Processing Reduce Distributed KV LSM Column HDFS Storage stores trees stores 13
Recommend
More recommend