Time-aware API Popularity Prediction via Heterogeneous - PowerPoint PPT Presentation

ICWS 2015 New York, USA June 27- July 2 2015 Time-‑aware ¡API ¡Popularity ¡Prediction ¡via ¡ Heterogeneous ¡Features Yao ¡Wan 1 , ¡Liang ¡Chen 1 , ¡Jian ¡Wu 1 and ¡Qi ¡Yu 2 1 Zhejiang ¡University, ¡Hangzhou, ¡China 2 Rochester ¡Institute ¡of ¡Technology, ¡Rochester, ¡USA {wanyao, ¡cliang, ¡wujian2000}@zju.edu.cn, ¡qi.yu@rit.edu June ¡28, ¡2015

Outline ¨ Introduction ¨ Dataset ¡Characteristics ¨ Popularity ¡Prediction ¡Models ¨ Experiments ¨ Conclusion ¡& ¡Future ¡Work 2 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Introduction ¨ The ¡spring ¡up ¡of ¡web ¡APIs ¡ ¡ ¤ 10634 ¡APIs ¡and ¡6049 ¡mashups ¡in ¡ProgrammableWeb until ¡Nov. ¡2014 (www.programmableweb.com) ¨ The ¡advantages ¡of ¡web ¡APIs ¤ Make ¡web ¡programmable ¤ Increase ¡ecnomic ¡transactions ¡from ¡web ¡browsers ¡to ¡API-‑driven ¡interactions ¤ eBay, ¡APIs ¡drive ¡over ¡60% ¡of ¡its ¡sellers; ¡Twitter, ¡13 ¡billion ¡calls ¡per ¡day ¡through ¡ APIs 3 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Introduction ¨ Framework ¡of ¡API ¡market: more ¡and ¡more ¡Web ¡sites ¡ dedicated ¡to ¡API ¡market ¡are ¡emerging (e.g.Mashape) Manage 1. ¡Recommend r 2. ¡Ranking 3. ¡Monitoring Provider Developer Profit User ¡ 4 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Introduction Motivation 1. ¡With ¡the ¡increasing ¡APIs, ¡the ¡need ¡to ¡search ¡and ¡manage ¡APIs ¡becomes ¡urgent 2. ¡Programmable ¡manage ¡APIs ¡through ¡the ¡followers ¡number ¡at ¡one ¡monent • For ¡manager Manage • 1. ¡Find ¡and ¡recommend ¡the ¡APIs ¡ ¡ ¡ 1. ¡Recommend r 2. ¡Ranking that ¡may ¡be ¡popular ¡in ¡the ¡future 3. ¡Monitoring • 2. ¡Give ¡a ¡more ¡reasonable ¡ ranking ¡to ¡APIs • 3. ¡Monitor ¡the ¡trend ¡of ¡API ¡growth Provider Developer • For ¡API ¡providers Profit • Give ¡a ¡guide ¡to ¡API ¡providers ¡that ¡ • For ¡developers how ¡to ¡publish ¡their ¡APIs ¡effectively ¡ • Developers ¡can ¡select ¡the ¡ and ¡do ¡marketing latest ¡released ¡APIs ¡whose ¡ User ¡ popularities ¡may ¡be ¡low 5 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Introduction Challenges ¨ The ¡growth ¡patterns ¡of ¡two ¡APIs Popularity ¡of ¡Twilio ¡API Popularity ¡of ¡Youtube ¡API Note : ¡In ¡our ¡paper, ¡the ¡popularity ¡of ¡API ¡is ¡defined ¡as ¡the ¡number ¡of ¡mashups ¡that ¡are ¡ composed ¡of ¡this ¡API. ¨ Heterogeneous ¡features 6 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Dataset ¡Characterstics ¨ An ¡overview ¡of ¡ProgrammableWeb ¡data Number ¡of ¡APIs 10,634 Number ¡of ¡mashups 6,049 Number ¡of ¡users 52,512 ¨ Top-‑20 ¡API ¡categories • Popularity ¡Ranking • 1. ¡Mapping • 2. ¡Photos • 3. ¡Video • 4. ¡Search • 5. ¡Social 8 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Dataset ¡Characterstics ¨ Statistical ¡analysis Rank ¡of ¡provider ¡is ¡obtained ¡from ¡Alexa.com API ¡information ¡integrity ¡means ¡the ¡degree ¡of ¡ details ¡providers ¡describe ¡the ¡API • Observations • 1. ¡The ¡lower ¡the ¡rank ¡of ¡provider ¡the ¡lower ¡of ¡its ¡ popularity • 2. ¡The ¡provider ¡of ¡API ¡that ¡with ¡a ¡higher ¡information ¡ integrity ¡is ¡more ¡prudent ¡to ¡release ¡the ¡API, ¡the ¡API ¡ may ¡be ¡of ¡higher ¡quality • 3. ¡The ¡category ¡of ¡API ¡has ¡an ¡effect ¡on ¡it's ¡ popularity 9 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Popularity ¡Prediction ¡Models ¨ Evaluation ¡metrics ¨ Szabo-‑Huberman ¡(S-‑H) ¡model ¤ Szabo ¡and ¡Huberman ¡found ¡a ¡strong ¡linear ¡correlation ¡ between ¡the ¡early ¡and ¡future ¡popularity 11 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Popularity ¡Prediction ¡Models ¨ Linear ¡regression ¡(LR) ¡model ¤ Feature ¡vector ¡of ¡API ¤ Optimization ¡problem ¤ Let ¡ ¤ Can ¡be ¡solved ¡by ¡ordinary ¡linear ¡squares 12 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Popularity ¡Prediction ¡Models ¨ Linear ¡regression ¡with ¡heterogeneous ¡features ¤ Heterogeneous ¡features n Time ¡features n Numerical ¡features n Categorical ¡features n Textual ¡features ¨ Optimization ¡problem where ¡ ¡ ¡ ¡ ¡ ¡ ¡is ¡the ¡feature ¡vector ¡and ¡ ¡ ¡ ¡ ¡is ¡the ¡tunning ¡parameter, is ¡the ¡L 2 ¡ penality ¡item. 13 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Experiments Setup ¨ Dataset ¡extraction ¤ The ¡target ¡date ¡ t t varies ¡from ¡24 ¡months ¡to ¡48 ¡months. ¤ We ¡extract ¡the ¡APIs ¡whose ¡releasing ¡date ¡is ¡between ¡2005 ¡ and ¡Nov. ¡2010 ¤ Discard ¡those ¡APIs ¡whose ¡popularity ¡are ¡alway ¡zero ¤ We ¡get ¡613 ¡APIs ¨ Experimental ¡datasets ¤ Full ¡dataset ¤ Popular ¡dataset: ¡APIs ¡whose ¡popularities ¡are ¡greater ¡than ¡5 ¤ Junk ¡dataset: ¡APIs ¡whose ¡popularities ¡are ¡smaller ¡than ¡5 15 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Experiments ¨ Feature ¡extraction ¤ Numerical ¡feagure n Normalization ¤ Category ¡feature n Encoded ¡into ¡binary ¡code ¤ Textual ¡feature ¡extraction n Case-‑folding ¡and ¡tokenization: ¡tokenized ¡by ¡white ¡space n Pruning: ¡filter ¡stopwords ¡(e.g. is, ¡very, ¡should ), ¡keep ¡ adjectives ¡using ¡a ¡part-‑of-‑speech ¡tagger n Stemming: ¡strip ¡word ¡to ¡obtain ¡the ¡stem ¡word n Spell ¡correcting ¨ Training ¡and ¡testing ¤ KFold ¡cross ¡validation ¤ 50% ¡traing ¡+ ¡25% ¡crossvalidation ¡+25% ¡testing 16 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Experiments ¨ Performance ¡evaluation ¡on ¡full ¡dataset 17 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Experiments ¨ Performance ¡evaluation ¡on ¡popular ¡dataset • Observations • 1. ¡Our ¡model ¡has ¡and ¡obviously ¡better ¡performance ¡over ¡S-‑H ¡and ¡LR ¡ model, ¡especially ¡when ¡the ¡reference ¡date ¡is ¡small • 2. ¡With ¡the ¡reference ¡time ¡becoming ¡closer ¡to ¡the ¡target ¡time, ¡the ¡ mRSE ¡decreases ¡for ¡all ¡models • 3. ¡With ¡the ¡reference ¡time ¡becoming ¡closer ¡to ¡the ¡target ¡time, ¡the ¡ advantages ¡of ¡our ¡model ¡shrink, ¡S-‑H ¡and ¡LR ¡model ¡is ¡enough 18 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Experiments ¨ Performance ¡evaluation ¡on ¡junk ¡dataset • Observations • 1. ¡Our ¡LR-‑HF ¡model ¡still ¡has ¡an ¡ ¡better ¡performance • 2. ¡The ¡improvement ¡is ¡small ¡when ¡just ¡introducing ¡API's ¡textual ¡features • 3. ¡Our ¡model ¡is ¡particularly ¡suitable ¡for ¡predicting ¡the ¡popularity ¡of ¡APIs ¡ that ¡not ¡changes ¡frequently 19 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Experiments ¨ Impact ¡of ¡lambda • Observations • 1. ¡The ¡value ¡of ¡lambda ¡really ¡has ¡an ¡impact ¡on ¡the ¡prediction ¡accuracy • 2. ¡The ¡most ¡appropriate ¡value ¡of ¡lambda ¡is ¡between ¡0.2 ¡and ¡0.4 20 4/25/16 Time-‑aware ¡API ¡popularity ¡prediction

Time-aware API Popularity Prediction via Heterogeneous - PowerPoint PPT Presentation

ICWS 2015 New York, USA June 27- July 2 2015 Time-aware API Popularity Prediction via Heterogeneous Features Yao Wan 1 , Liang Chen 1 , Jian Wu 1 and Qi Yu 2 1 Zhejiang University,

RESTFUL API BEST PRACTICES By Malwina Nowakowska STX NEXT talented developers | flexible teams

How kids perceive popularity? Sha Lana Clinton Outline u Popularity u Experiment u

API Ruby on Rails UI ES API Hedtek Wijiti API API Elasticsearch Depositing user Build

API Connect Arnauld Desprets - arnauld_desprets@fr.ibm.com Technical Sale 0 Agenda 1. API

Spock Data driven testing RESTful API What is a RESTful API ? A RESTful API is an application

Introduction to the SAGA API Outline SAGA Standardization API Structure and Scope (C++)

Study of an API Migration for two XML APIs Thiago Bartholomei Krzysztof Czarnecki Ralf Lmmel

Mining API Popularity 40 35 # projects using an API element 30 junit.framework.TestSuite

Branch Prediction Branch Prediction vs vs Execution Time Execution Time Prediction

Stability, Popularity, and Lower Quotas Meghana Nasre IIT Madras CAALM 2019 Chennai

API Gateway API Gateway Gateway ESB At present tooling for API

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Data sharing through an API platform API AGRO Erik Rehben, Batrice Balvay , Theo Paul

Preparing a REST API Rules of REST APIs, API patterns, Typical CRUD operations Rules for a REST

Exploring API Embedding for API Usages and Applications Yi Chang Trong Duc Nguyen, Anh Tuan

Using the Library for your Final Year Project Laura Woods, Computing & Engineering Librarian

Post-quantum cryptography Tanja Lange 07 October 2015 SPACE 2015 In the long term, all

Building the Kamus Besar Bahasa Indonesia (KBBI) Database and Its Online Application David

k -fingerprinting: a Robust Scalable Website Fingerprinting Technique George Danezis Jamie Hayes

Network security (DNS caching and DoS) CS 161: Computer Security Prof. Raluca Ada Popa March 1,

Big Data for Data Science Data streams and low latency processing event.cwi.nl/lsde DATA STREAM

IT-SDC : Support for Distributed Computing 1 The problem Pick a number of generic

2D Face Image Analysis Probabilistic Morphable Models Summer School, June 2017 Sandro Schnborn

Time-aware API Popularity Prediction via Heterogeneous - PowerPoint PPT Presentation

ICWS 2015 New York, USA June 27- July 2 2015 Time-aware API Popularity Prediction via Heterogeneous Features Yao Wan 1 , Liang Chen 1 , Jian Wu 1 and Qi Yu 2 1 Zhejiang University,

RESTFUL API BEST PRACTICES By Malwina Nowakowska STX NEXT talented developers | flexible teams

How kids perceive popularity? Sha Lana Clinton Outline u Popularity u Experiment u

API Ruby on Rails UI ES API Hedtek Wijiti API API Elasticsearch Depositing user Build

API Connect Arnauld Desprets - arnauld_desprets@fr.ibm.com Technical Sale 0 Agenda 1. API

Spock Data driven testing RESTful API What is a RESTful API ? A RESTful API is an application

Introduction to the SAGA API Outline SAGA Standardization API Structure and Scope (C++)

Study of an API Migration for two XML APIs Thiago Bartholomei Krzysztof Czarnecki Ralf Lmmel

Mining API Popularity 40 35 # projects using an API element 30 junit.framework.TestSuite

Branch Prediction Branch Prediction vs vs Execution Time Execution Time Prediction

Stability, Popularity, and Lower Quotas Meghana Nasre IIT Madras CAALM 2019 Chennai

API Gateway API Gateway Gateway ESB At present tooling for API

Coverage in Heterogeneous Coverage in Heterogeneous Networks Xiaoli Chu King s College

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Data sharing through an API platform API AGRO Erik Rehben, Batrice Balvay , Theo Paul

Preparing a REST API Rules of REST APIs, API patterns, Typical CRUD operations Rules for a REST

Exploring API Embedding for API Usages and Applications Yi Chang Trong Duc Nguyen, Anh Tuan

Using the Library for your Final Year Project Laura Woods, Computing &amp; Engineering Librarian

Post-quantum cryptography Tanja Lange 07 October 2015 SPACE 2015 In the long term, all

Building the Kamus Besar Bahasa Indonesia (KBBI) Database and Its Online Application David

k -fingerprinting: a Robust Scalable Website Fingerprinting Technique George Danezis Jamie Hayes

Network security (DNS caching and DoS) CS 161: Computer Security Prof. Raluca Ada Popa March 1,

Big Data for Data Science Data streams and low latency processing event.cwi.nl/lsde DATA STREAM

IT-SDC : Support for Distributed Computing 1 The problem Pick a number of generic

2D Face Image Analysis Probabilistic Morphable Models Summer School, June 2017 Sandro Schnborn

Using the Library for your Final Year Project Laura Woods, Computing & Engineering Librarian