machine learning apis comm mmon n appli pplications
play

Machine Learning APIs Comm mmon n appli pplications cations - PowerPoint PPT Presentation

Machine Learning APIs Comm mmon n appli pplications cations Autonomous vehicles Optical character recognition Automatic image tagging Language translation Speech to text Video recommendations Portland State University CS 430P/530


  1. Machine Learning APIs

  2. Comm mmon n appli pplications cations Autonomous vehicles Optical character recognition Automatic image tagging Language translation Speech to text Video recommendations Portland State University CS 430P/530 Internet, Web & Cloud Systems

  3. AI/ML: /ML: an en enormou rmous s sp space ce  Many different models and approaches  Expert systems / decision trees / knowledge engineering (medical diagnosis)  Supervised learning (Bayesian filters for spam detection)  Unsupervised learning (Clustering algorithms for Google News)  Combinatorial search (Chess)  Reinforcement learning (NPC in games)  Evolutionary/genetic algorithms (Smart fuzzing)  Often combined with each other  But recently…neural networks with supervised learning  Convolutional neural networks (CNNs) Portland State University CS 430P/530 Internet, Web & Cloud Systems

  4. Neu eural ral netw etwor orks ks app pproach oach  Model the way the brain works via large collections of simulated neurons  Formation and pruning of selected connections encodes information Portland State University CS 430P/530 Internet, Web & Cloud Systems

  5. Simu mulat lated ed in comput puter ers  Selected weights within a simulated network of neurons encodes information Portland State University CS 430P/530 Internet, Web & Cloud Systems

  6. Everything old…  Neural Networks circa 1950s-1960s (perceptrons)  Multi-layer neural networks 1970s-1980s  Ran on original Macintosh!  Why the renaissance?  Video cards with massive numbers of processing units (thanks to gaming)  Massive storage capacities for data  Crowd-sourced platforms to provide labeling (to learn by example) Portland State University CS 430P/530 Internet, Web & Cloud Systems

  7. Ima mageNe geNet t (2009) 09)  Pioneered "Deep Learning"  Use convolutional neural networks with massive sets of labeled data to solve the computer vision problem  Approach  Ignore academic skeptics and take risk on ancient algorithm  Collect image data from the Internet  Hire humans to label it via Mechanical Turk  Send through enormous neural network  Profit?  Fei Fei Li https://youtu.be/40riCqvRoMs?t=2m46s Portland State University CS 430P/530 Internet, Web & Cloud Systems

  8. Now…“The of x” SpaceNet MusicNet Medical ImageNet DigitalGlobe, CosmiQ Works, NVIDIA J. Thickstun et al, 2017 Stanford Radiology, 2017 ShapeNet EventNet ActivityNet A.Chang et al, 2015 G. Ye et al, 2015 F. Heilbron et al, 2015  But, not all sunshine and rainbows… Portland State University CS 430P/530 Internet, Web & Cloud Systems

  9. Corpus rpus can n have e iss ssues ues  US vs. Russian tanks in 1980s with early NN  US images crisp marketing shots on sunny days  Russian images grainy and on cloudy days  ML trains on wrong feature (crisp/sunny vs. grainy/cloudy) Portland State University CS 430P/530 Internet, Web & Cloud Systems

  10.  SE Asian workers used to label ideal vacation photos  Label conference reception photos in hotels as the ideal vacation!  Perhaps the beach is hard work if they fish for a living?  Labeling is a relative task!  What happens when one uses ImageNet photos taken by humans on high-quality cameras for drone applications with poor imaging hardware and no human? Portland State University CS 430P/530 Internet, Web & Cloud Systems

  11. Bi Bias as in labeli eling ng  Institutionalize bias behind facade of an "objective" algorithm  Racial, socio-economic bias in data being labeled and used  TED talk  GCP podcast #114 (2/2018)  FAT* conference  Example: ML for domestic violence  Most common way is via neighbor complaint  ML incorrectly learns that only those who live in row-homes and apartments commit domestic violence!  Hard problem  "It’s not always so obvious ahead of time what the bad outcomes might be" Portland State University CS 430P/530 Internet, Web & Cloud Systems

  12.  MM's NY Times article (11/2018) and book… Portland State University CS 430P/530 Internet, Web & Cloud Systems

  13. Rare e pr problematic blematic cases ses  Example: Self-driving cars  Kangaroos, white trucks against a white sky  Need millions of miles trained to get enough anomalous conditions  Must also train in winter/spring, in snow/rain/clear conditions Portland State University CS 430P/530 Internet, Web & Cloud Systems

  14. Res esistance stance to adver ersaries saries  Both in training and in inference  Small changes fool classifier Portland State University CS 430P/530 Internet, Web & Cloud Systems

  15. Requi uires res la large-scale scale particip icipat ation/da ion/data ta  In thousands of entries 0.28 0.66 172 157 123 81 0.03 0.23 35 29 2010 2011 2012 2013 2014 2015 2016 Average Precision Number of Entries Classification Errors (top-5) For Object Detection Portland State University CS 430P/530 Internet, Web & Cloud Systems

  16. “Datasets— not algorithms — might be the key limiting factor to development of human- level artificial intelligence.” A L E X A N D E R W I S S N E R - G R O S S Edge.org, 2016 Portland State University CS 430P/530 Internet, Web & Cloud Systems

  17. Towar ards ds crowd wd-sour sourced ced ML We’re passing the Why? democratizing image-net.org baton to Kaggle : a data is vital to remains live at community of more democratizing AI. Stanford. than 1M data scientists. Portland State University CS 430P/530 Internet, Web & Cloud Systems

  18. An Ex Expl plosion osion of Da Datase tasets ts 1627 276 1919 1MM 4MM Hosted Datasets Commercial Student Data Scientists ML Models Competitions Competitions Submitted Portland State University CS 430P/530 Internet, Web & Cloud Systems

  19. Qu Ques estio tion  Who on this planet has the largest, most interesting, data-sets?  Monetize data via ML models Portland State University CS 430P/530 Internet, Web & Cloud Systems

  20. Softw tware's are's new w ap applic lication ation bu buildin lding bl blocks cks  Don't build your own when you can use pre-trained models done by experts on massive datasets  Part of an emerging API development platform  Accessed via REST APIs with results sent back in JSON  Abstraction raised, “Hello world” will never be the same Portland State University CS 430P/530 Internet, Web & Cloud Systems

  21. Cloud ud Visi sion on API  Image recognition  Image labeling on thousands of labels  Face detection  Sentiment analysis (Emotobooth)  Text detection (optical character recognition)  SafeSearch content identification (adult/violent content)  Logo identification  Landmark identification Portland State University CS 430P/530 Internet, Web & Cloud Systems

  22. Cloud ud Spe peech ech APIs Is  Speech-to-Text  Word recognition  Context-aware transcription  Automated punctuation  Offensive content detection  Text-to-Speech (speech synthesis)  Both in 120 languages Portland State University CS 430P/530 Internet, Web & Cloud Systems

  23. Cloud ud Translation anslation API  Language detection and translation  100+ languages  Python, Java, Ruby, Objective-C bindings via Google API client libraries Cloud ud Natural tural Lang nguage uage API  Language analysis  Syntax analysis  Semantic analysis, entity recognition  Sentiment analysis  Common e-mail responses Portland State University CS 430P/530 Internet, Web & Cloud Systems

  24. Cloud ud Video eo Intelligence elligence API  Summary and information extraction from video  Autotag objects in video to enable searching  Scene detection for thumbnail generation  Automated highlights (RedZone!)  Trained on massive set of labeled YouTube videos Portland State University CS 430P/530 Internet, Web & Cloud Systems

  25. Cloud ud Video eo Intelligence elligence API  Demo code similar to image blurring lab  Flow chart triggered when new video placed in bucket Portland State University CS 430P/530 Internet, Web & Cloud Systems

  26. ML as s a se service ice  Use ML to generate custom ML models so regular users can build their own ML models  AmazonML  http://cloudacademy.com/blog/aws-machine-learning/  AzureML  http://cloudacademy.com/blog/azure-machine-learning/  Watson Analytics  http://www.ibm.com/analytics/watson-analytics/  Google AutoML Portland State University CS 430P/530 Internet, Web & Cloud Systems

  27. Cloud ud Aut utoML ML for Visi sion on (2018) 8)  ML services trained on general datasets, but custom images and domain labels often needed  AutoML  Apply transfer learning to re-use trained models to quickly learn new domains  Custom models via  Labeled data (done by user)  Unlabeled data (done by humans at Google)  Evaluate many different ML models and pick best one (> 13)  Example: Custom models to recognize machine parts  Democratizes ML for the masses  All that is required is for you to upload your data with labels or exemplars for Google to label Portland State University CS 430P/530 Internet, Web & Cloud Systems

Recommend


More recommend