Discovering Information Explaining API Types Using Text Course - PowerPoint PPT Presentation

Dec 02, 2022 •812 likes •1.04k views

Discovering Information Explaining API Types Using Text Course Instructor: Classification Dr. Jin Guo Presented by: Sunyam Bagga TEXT CLASSIFICATION Relevant/Irrelevant [API type, Section fragment] Source:

Discovering Information Explaining API Types Using Text Course Instructor: Classification Dr. Jin Guo Presented by: Sunyam Bagga
TEXT CLASSIFICATION Relevant/Irrelevant [API type, Section fragment] Source: https://www.python-course.eu/text_classification_introduction.php
Technical Concepts 1. Recodoc tool 2. LOOCV 3. Maximum Entropy 4. Cosine similarity with tf-idf weighting 5. KAPPA
RecoDoc “Recovering Traceability Links between an API and Its Learning Resources” 1
Aim : - Find API types referenced in a tutorial: - Identifies CLTs - Links these CLTs to exact API type “DateTime….such as year() or monthOfYear().” - Precisely link code-like terms (e.g., year()) to specific code elements (e.g., DateTime.year())
Ambiguity ▪ Declaration Ambiguity : CLTs are rarely fully qualified. ▪ Overload Ambiguity: CLTs do not indicate the number/type of parameters (method is overloaded). ▪ External Reference Ambiguity: May refer to code elements in external libraries. ▪ Language Ambiguity: Human errors: typographical (HtttpClient), case errors, forgetting parameters etc.
Parsing Artifacts and Recovering Traceability Links - Linking Types: Given a CLT, they find all types in the codebase whose name matches the term. - Disambiguate and filter.
LOOCV “Evaluating a classifier’s performance” 2
Leave-one-out Cross Validation Source: https://towardsdatascience.com/train-test-split-and-cross-validation-in-python-80b61beca4b6
MaxEnt Classifier “Using Maximum Entropy for Text Classification” by Nigam et al. 3
Maximum Entropy: - Technique for estimating probability distributions from data - Principle : Without external knowledge, pick the distribution that has the maximum entropy (most-uniform). - Labeled training data helps put constraints on the distribution
Example Source: NLP by Dan Jurafsky and Chris Manning
Add Noun feature: f1 = {NN, NNS, NNP, NNPS} Add Proper Noun feature: f2 = {NNP, NNPS} Source: NLP by Dan Jurafsky and Chris Manning
Constraints and Features - Restricts the model distribution to have the same expected value for a feature as seen in training data, D: - Features for text classification:
Cosine Similarity with tf-idf “Comparison with Information Retrieval” 4
Tf-Idf - Technique to vectorise text data - Term Frequency is a simple frequency count of a term in a document - Inverse Document Frequency gives more weight rare words.
Cosine Similarity - Measures the cosine of the angle between the vectors: - They consider a section relevant if the similarity value is higher than a certain threshold.
KAPPA score “Annotating the Experimental Corpus” 5
Kappa formula - Measures inter-annotator agreement. ▪ Po: observed agreement among annotators ▪ Pe: hypothetical probability of chance agreement ▪ More robust than simple percent agreement calculation
Kappa Example: ▪ P o = (20+15) / 50 = 0.7 ▪ P(Yes) = 0.5*0.6 = 0.3 ▪ P(No) = 0.5*0.4 = 0.2 ▪ P e = P(Yes) + P(No) = 0.5 Kappa = (0.7 - 0.5) / (1 - 0.5) = 0.4 Source: https://en.wikipedia.org/wiki/Cohen%27s_kappa
Thanks! Any questions?

Recommend

RESTFUL API BEST PRACTICES By Malwina Nowakowska STX NEXT talented developers | flexible teams

RESTFUL API BEST PRACTICES By Malwina Nowakowska STX NEXT talented developers | flexible teams | agile experts Malwina Nowakowska Developer STX Next Love Python API API API API API API API API Special demo event RESTful API

890 views • 58 slides

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text

10 slides that always work Simple text boxes (I) Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample text Sample

208 views • 10 slides

API Ruby on Rails UI ES API Hedtek Wijiti API API Elasticsearch Depositing user Build

Consuming user API Ruby on Rails UI ES API Hedtek Wijiti API API Elasticsearch Depositing user Build index Updates DSpace Solr Discovery Stats Depositing Consuming user user Ruby on Rails UI DSpace REST API Updates DSpace

172 views • 5 slides

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here

CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here Enter Text Here Enter Text Here CONTENT TITLE Insert Subtitle Here Enter Text Here Enter Text Here Enter Text Here Enter Text

697 views • 66 slides

API Connect Arnauld Desprets - arnauld_desprets@fr.ibm.com Technical Sale 0 Agenda 1. API

API Connect Arnauld Desprets - arnauld_desprets@fr.ibm.com Technical Sale 0 Agenda 1. API Understanding the space 2. API Connect 3. Sample implementations 4. Dmonstration 1 API - Definition API = standard interface API ~ Product

1.02k views • 64 slides

Spock Data driven testing RESTful API What is a RESTful API ? A RESTful API is an application

Spock Data driven testing RESTful API What is a RESTful API ? A RESTful API is an application program interface ( API ) that uses HTTP requests to GET, PUT, POST and DELETE data. RESTful API stucture Endoint Method Type Payload

290 views • 7 slides

Introduction to the SAGA API Outline SAGA Standardization API Structure and Scope (C++)

Introduction to the SAGA API Outline SAGA Standardization API Structure and Scope (C++) API Walkthrough SAGA SoftwareComponents Command Line Tools Python API bindings C++ API bindings [ Java API bindings ] SAGA:

1.26k views • 87 slides

Study of an API Migration for two XML APIs Thiago Bartholomei Krzysztof Czarnecki Ralf Lmmel

Study of an API Migration for two XML APIs Thiago Bartholomei Krzysztof Czarnecki Ralf Lmmel Tijs van der Storm API migration Source API uses Application API X Transformation Target API uses Application API Y API migration --

536 views • 25 slides

Discovering Gods Word (Part-2) Discovering Gods Word (Part-2) Hermeneutics = The science

Discovering Gods Word (Part-2) Discovering Gods Word (Part-2) Hermeneutics = The science (principles) and art (task) by which the meaning of the Biblical text is determined. Discovering Gods Word (Part-2) This is That Joel

862 views • 38 slides

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B

Post-Conference Presentation Sunday Oladayo Oladejo Table of Content A Introduction B Benefits C Take-Aways D Research Areas Add text add text add text add text add text add text add text add text add text add text add text E Research

513 views • 12 slides

API Gateway API Gateway Gateway ESB At present tooling for API

API Gateway API Gateway Gateway ESB At present tooling for API gateways is achingly immature and so while defining applications with API gateways is possible its most definitely not for the

416 views • 22 slides

Types Dynamic types Types are broken down into many categories Static types Duck typing

Types Dynamic types Types are broken down into many categories Static types Duck typing Dynamic types Subtypes Types are broken down into many categories Classes and subclasses Static types Strong types Dependent Duck typing

1.22k views • 98 slides

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50

Enhancing ICANN Text Accountability 26 June 2014 Text #ICANN50 Text #ICANN50 Text #ICANN50 Inventory of ICANNs Accountability Efforts Text *Non-exhaustive inventory #ICANN50 Inventory of ICANNs Accountability Efforts Text

456 views • 29 slides

Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1

COMPANY NAME Add Your Title Here Replace your text here! Replace your text here! Insert your title here 1 2 Your text Your text Replace your text here! Replace your text here! Replace your text here! Replace your text here! Replace

365 views • 12 slides

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program

Text Text #ICANN51 15 October 2014 Text Text IDN Root Zone LGR Sarmad Hussain IDN Program Senior Manager #ICANN51 Agenda Text Text Introduction Sarmad Hussain Need, Limitations and Mechanisms for the Root Zone LGR Marc

817 views • 65 slides

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update

Text Text #ICANN51 Contractual Compliance Text Text Contractual Compliance Update Wednesday, 15 October 2014 #ICANN51 Agenda Text Text Learn More about Compliance Metrics Audit Program Update Registrar Related Update

847 views • 57 slides

SBFM12 Formal model reduction Jrme Feret Laboratoire dInformatique de lcole

SBFM12 Formal model reduction Jrme Feret Laboratoire dInformatique de lcole Normale Suprieure INRIA, NS, CNRS 29 March 2012 Overview 1. Context and motivations 2. Handmade ODEs 3. Abstract interpretation framework 4.

717 views • 51 slides

S u r v e y s in marketing research SU R VE Y AN D ME ASU R E ME N T D E VE L OP ME N T IN R

S u r v e y s in marketing research SU R VE Y AN D ME ASU R E ME N T D E VE L OP ME N T IN R George Mo u nt Data anal y tics ed u cator " On a scale of 1 to 5..." SURVEY AND MEASUREMENT DEVELOPMENT IN R Anatom y of a s u r v e y

820 views • 32 slides

NSM2006 Nonstandard Methods Congress, Pisa May 25-31, 2006. June 6, 2006 Salma Kuhlmann 1

NSM2006 Nonstandard Methods Congress, Pisa May 25-31, 2006. June 6, 2006 Salma Kuhlmann 1 Research Center for Algebra, Logic and Computation University of Saskatchewan, McLean Hall, 106 Wiggins Road, Saskatoon, SK S7N 5E6, Canada email:

756 views • 19 slides

Week 2 Video 2 Diagnostic Metrics, Part 1 Different Methods, Different Measures Today well

Week 2 Video 2 Diagnostic Metrics, Part 1 Different Methods, Different Measures Today well focus on metrics for classifiers Later this week well discuss metrics for regressors And metrics for other methods will be discussed later

761 views • 39 slides

Introduc)on Yaquan Fang Workshop in November Physics/simula)ons will have separated (parallel

Introduc)on Yaquan Fang Workshop in November Physics/simula)ons will have separated (parallel sec)ons) under detector sec)on. Hope Physics sec)on a joint one with theory (which will not occupy space in theory sec)ons): HL-LHC and CEPC

331 views • 5 slides

Modelling with R A Solution to Two Non-linear Regressions Examples 1. The data set in the file

Modelling with R A Solution to Two Non-linear Regressions Examples 1. The data set in the file rat_data.csv gives the production of insulin in experimental animals, (rats), in response to a mixture of two drugs. The drug doses are in variables x1

194 views • 5 slides

Confusion Detection in Code Reviews Felipe Ebert Fernando Castor Nicole Novielli Alexander

Confusion Detection in Code Reviews Felipe Ebert Fernando Castor Nicole Novielli Alexander Serebrenik Confusion Detection in Code Reviews Felipe Ebert Fernando Castor Nicole Novielli Alexander Serebrenik Confusion Detection in Code

600 views • 33 slides

$Michele Ronco Deformed symmetries in noncommutative and multifractional spacetimes G. Calcagni$

Michele Ronco Deformed symmetries in noncommutative and multifractional spacetimes G. Calcagni

Michele Ronco Deformed symmetries in noncommutative and multifractional spacetimes G. Calcagni and M. Ronco, arXiv:1608.01667 [hep-th] Oviedo V Postgraduate Meeting On Theoretical Physics November 17th, 2016 OBJECTIVES AND MOTIVATIONS Quantum

642 views • 22 slides