Presented by - Karan Kurani and Jason Marcell (Some slides adapted - PowerPoint PPT Presentation

Nov 21, 2022 •5 likes •165 views

Presented by - Karan Kurani and Jason Marcell (Some slides adapted from presentation on 12 th November) Karan Jason Theo Kiyan Bistra Goal Datasets Software Engineering Latent Dirichlet Allocation Methodology Results

Presented by - Karan Kurani and Jason Marcell (Some slides adapted from presentation on 12 th November)
Karan Jason Theo Kiyan Bistra
 Goal  Datasets  Software Engineering  Latent Dirichlet Allocation  Methodology  Results  Future Work
 Find people who are doing Comp Sust. But who are not aware about it or we don’t know about them.  Techniques –  Citation Network Analysis (Not implemented yet)  Similarity Measure  Combination of both.
 CS Based - DBLP, arnetminer.org, CiteSeerX.  Multidisciplinary – BASE, Bioone, ChemSeerX, Crossref for citation.  Currently Used –
Revision Logging Unit Testing Control Object-Relational Mapping Integrated Development Environment
 DBLP Stats:  Total docs: 1632441  With abstract text: 653507  With references: 316559  Possible approaches included –  LSA, pLSA and LDA.  All of them make a bag of words model.
*From the review paper “Topic Models” - David M. Blei, Princeton University. John D. Lafferty, Carnegie  Mellon University
Images (Fei-Fei and Perona, 2005; Russell et al., 2006; Blei and Jordan, Population 2003; Barnard et genetics data al., 2003), (Pritchard et al., 2000), Survey data (Erosheva et al., 2007), Social ne l networks ks d data (Airoldi et al.,2007).
DBLP Data Set CompSust Stop Words Keyword Filter Filter MAHOUT LDA Extract corpus and seed paper topic distributions Squared Symmetric KL- Cosine Distance Euclidean divergence Distance distance
 Evolving results set can be browsed on the web: http://www.cs.cornell.edu/~kiyan/compsust- sn/
 Noisy but Encouraging (Most of the results are recent (2006-2010.) )  Reasons -  Many false positives because of alternate uses of keywords.  Over fitting because of sub optimal parameters for LDA.
Correlated Topic Models Dynamic Topic Models
 Add additional data sources.  Customized web crawler.  Incorporate network analysis (Author – topic model, Link- LDA)

Recommend

CSC373 Algorithm Design, Analysis & Complexity Karan Singh 373F19 Karan Singh 1

CSC373 Algorithm Design, Analysis & Complexity Karan Singh 373F19 Karan Singh 1 Introduction Instructors Karan Singh o dgp.toronto.edu/~karan, karan@dgp, BA 5258 o SEC 5101 and 5201 Nisarg Shah o cs.toronto.edu/~nisarg,

997 views • 65 slides

CSC418 Computer Graphics Im not Professor Karan Singh Course web site (includes course

CSC418 Computer Graphics Im not Professor Karan Singh Course web site (includes course information sheet and discussion board): http://www.dgp.toronto.edu/~karan/courses/418/ Instructors: L0101, T 6-8pm L0201, W 3-5pm Karan Singh David

1.27k views • 98 slides

Week 2: Greedy Algorithms Karan Singh 373F19 - Karan Singh 1 Recap Divide & Conquer

CSC373 Week 2: Greedy Algorithms Karan Singh 373F19 - Karan Singh 1 Recap Divide & Conquer Master theorem Counting inversions in ( log ) Finding closest pair of points in 2 in log Fast

653 views • 47 slides

January 28 2013 presentation by Karen Marcell Decisions. We make them every day. Some are Routine

January 28 2013 presentation by Karen Marcell Decisions. We make them every day. Some are Routine . Some are Reactive with unintended consequences. Some are Proactive, Well Thought out and can Result in Far Reaching Benefits. With respect to the

386 views • 3 slides

CSC418: Computer Graphics Some slides and figures courtesy of Karan Singh Some figures from Peter

CSC418: Computer Graphics Some slides and figures courtesy of Karan Singh Some figures from Peter Shirley, Fundamentals of Computer Graphics, 3rd Ed. Some video shots used from YouTube channel AlanBeckerTutorials Other images sourced

1.1k views • 87 slides

Orthogonal polynomials, zeros and electrostatics F. Marcell an Universidad Carlos III de

Orthogonal polynomials, zeros and electrostatics F. Marcell an Universidad Carlos III de Madrid (UC3M) and Instituto de Ciencias Matem aticas (ICMAT) OPCOP2017 - Universidad de Cantabria April 19-22, 2017 - (CIEM) Castro Urdiales

618 views • 32 slides

Observation of the drying process in secondary school kos Szeidemann, ron Bodor, Marcell

Observation of the drying process in secondary school kos Szeidemann, ron Bodor, Marcell Juhsz Teaching Physics Innovatively New Learning Environm ents and Methods in Physics Education 17-19 August 20 15 Extra curricular activity The

368 views • 20 slides

SOL SOLUTION UTION FO FOR R PR PRAWN WN FAR ARMING MING Marcell B. de Carvalho Ridley

REC RECIR IRCUL CULATION TION AQU QUACUL CULTURE TURE SY SYSTEM STEM (RAS) (RAS) - A BIOSECUR A BIOSECURITY ITY SOL SOLUTION UTION FO FOR R PR PRAWN WN FAR ARMING MING Marcell B. de Carvalho Ridley Aquafeed OVER VERVI

878 views • 17 slides

Mechanical integration of PANGEA Marcell Steinen Helmholtz-Institut Mainz Panda Coll. Meeting

Mechanical integration of PANGEA Marcell Steinen Helmholtz-Institut Mainz Panda Coll. Meeting 18/1, GSI, 03/06/17 PANGEA PAnda GErmanium Array 20 individual detectors, 3 crystals each Electro-mech. Cooling (~LN2 temperatures)

340 views • 11 slides

MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN

MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN]

841 views • 49 slides

Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides

Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides

105 views • 8 slides

Complexity 373F19 - Nisarg Shah & Karan Singh 1 Recap Linear Programming Standard

CSC373 Weeks 7 & 8: Complexity 373F19 - Nisarg Shah & Karan Singh 1 Recap Linear Programming Standard formulation Slack formulation Simplex Duality 373F19 - Nisarg Shah & Karan Singh 2 And Now

990 views • 84 slides

CSC373 Algorithm Design, Analysis & Complexity Nisarg Shah 373F19 - Nisarg Shah 1

CSC373 Algorithm Design, Analysis & Complexity Nisarg Shah 373F19 - Nisarg Shah 1 Introduction Instructors Karan Singh o dgp.toronto.edu/~karan, karan@dgp, BA 5258 o SEC 5101 and 5201 Nisarg Shah o cs.toronto.edu/~nisarg,

1.19k views • 65 slides

CSC373 Week 11: Randomized Algorithms 373F19 - Nisarg Shah & Karan Singh 1 Randomized

CSC373 Week 11: Randomized Algorithms 373F19 - Nisarg Shah & Karan Singh 1 Randomized Algorithms Input Deterministic Algorithm Output Input Randomized Algorithm Output Randomness 373F19 - Nisarg Shah & Karan Singh 2 Randomized

1.07k views • 57 slides

SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides

SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides

427 views • 10 slides

Jason OBeirne MARKETING PRESENTATION 2015 LOCAL EXPERTS. GLOBAL REACH. Jason OBeirne

...extraordinary homes with extraordinary lives. Jason OBeirne MARKETING PRESENTATION 2015 LOCAL EXPERTS. GLOBAL REACH. Jason OBeirne 2015 773.368.3421 JOBEIRNE@JAMESONSIR.COM JASON OBEIRNE Jason and his team provide a level of

423 views • 10 slides

Beta Presentation Open Source Intel The Capstone Experience Team GM Ben Buscarino Will

Beta Presentation Open Source Intel The Capstone Experience Team GM Ben Buscarino Will Crecelius Igli Ndoj Qiming Ren Taylor Zachar Department of Computer Science and Engineering Michigan State University From Students Spring 2019

388 views • 9 slides

Web crawler system for collecting malicious activities FIRST TC Mauritius 2016 Hisao Nashiwa

Web crawler system for collecting malicious activities FIRST TC Mauritius 2016 Hisao Nashiwa Internet Initiative Japan Inc. Who am I? Threat analyst at Internet Initiative Japan Inc. that is short for IIJ. IIJ is a Japanese

489 views • 24 slides

Session 6A - Big data sources: web scraping and smart meters Using Internet as a Data Source for

NTTS 2015 Session 6A - Big data sources: web scraping and smart meters Using Internet as a Data Source for Official Statistics: a Comparative Analysis of Web Scraping Technologies Giulio Barcaroli(*) (barcarol@istat.it), Monica Scannapieco (*)

441 views • 12 slides

Step by step guide Step 1: Purchasing an RSSeo! membership Step 2: Download RSSeo! 2.1 Download

Step by step guide Step 1: Purchasing an RSSeo! membership Step 2: Download RSSeo! 2.1 Download the component 2.2 Download RSSeo! language files Step 3: Installing RSSeo! 3.1 Installing the component 3.2 Minimum requirements 3.3 Installing

703 views • 53 slides

COMPARISON OF CATEGORICAL PROPERTIES OFFERED BY MULTIPLE MOOC PLATFORMS Using automated Web

COMPARISON OF CATEGORICAL PROPERTIES OFFERED BY MULTIPLE MOOC PLATFORMS Using automated Web Crawler in Python with Scrapy Bachelor Thesis - Introduction Presentation Louis Mbuyu Aufgabensteller: Prof. Dr. Franois Bry Betreuer: Prof. Dr.

421 views • 27 slides

SCORPION B SCAN CRAWLER What is the Scorpion B Scan Crawler? The Scorpion is a rugged

SCORPION B SCAN CRAWLER What is the Scorpion B Scan Crawler? The Scorpion is a rugged remote access ultrasonic crawler designed to allow cost Effective A and B-scan imaging on above ground ferro-magnetic structures without the need for

387 views • 3 slides

TechSEO360 SEO Spider Software for Windows and Mac. Complete technical SEO and sitemaps

TechSEO360 SEO Spider Software for Windows and Mac. Complete technical SEO and sitemaps tool. Flexible crawler and filtering of collected data. Can crawl and handle very large websites. History TechSEO360 merges features from A1

856 views • 33 slides

Making Sense of Massive Amounts of Scientific Publications: The Scientific Knowledge Miner

Making Sense of Massive Amounts of Scientific Publications: The Scientific Knowledge Miner Project Francesco Ronzano, Ana Freire, Diego Saez-Trumper, Horacio Saggion 20 seconds 1 paper The Rise of Open Access Science 04 Oct 2013 Vol. 342,

602 views • 28 slides

Presented by - Karan Kurani and Jason Marcell (Some slides adapted - PowerPoint PPT Presentation

Presented by - Karan Kurani and Jason Marcell (Some slides adapted from presentation on 12 th November) Karan Jason Theo Kiyan Bistra Goal Datasets Software Engineering Latent Dirichlet Allocation Methodology Results

CSC373 Algorithm Design, Analysis & Complexity Karan Singh 373F19 Karan Singh 1

CSC418 Computer Graphics Im not Professor Karan Singh Course web site (includes course

Week 2: Greedy Algorithms Karan Singh 373F19 - Karan Singh 1 Recap Divide & Conquer

January 28 2013 presentation by Karen Marcell Decisions. We make them every day. Some are Routine

CSC418: Computer Graphics Some slides and figures courtesy of Karan Singh Some figures from Peter

Orthogonal polynomials, zeros and electrostatics F. Marcell an Universidad Carlos III de

Observation of the drying process in secondary school kos Szeidemann, ron Bodor, Marcell

SOL SOLUTION UTION FO FOR R PR PRAWN WN FAR ARMING MING Marcell B. de Carvalho Ridley

Mechanical integration of PANGEA Marcell Steinen Helmholtz-Institut Mainz Panda Coll. Meeting

MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN

Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides

Complexity 373F19 - Nisarg Shah & Karan Singh 1 Recap Linear Programming Standard

CSC373 Algorithm Design, Analysis & Complexity Nisarg Shah 373F19 - Nisarg Shah 1

CSC373 Week 11: Randomized Algorithms 373F19 - Nisarg Shah & Karan Singh 1 Randomized

SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides

Jason OBeirne MARKETING PRESENTATION 2015 LOCAL EXPERTS. GLOBAL REACH. Jason OBeirne

Beta Presentation Open Source Intel The Capstone Experience Team GM Ben Buscarino Will

Web crawler system for collecting malicious activities FIRST TC Mauritius 2016 Hisao Nashiwa

Session 6A - Big data sources: web scraping and smart meters Using Internet as a Data Source for

Step by step guide Step 1: Purchasing an RSSeo! membership Step 2: Download RSSeo! 2.1 Download

COMPARISON OF CATEGORICAL PROPERTIES OFFERED BY MULTIPLE MOOC PLATFORMS Using automated Web

SCORPION B SCAN CRAWLER What is the Scorpion B Scan Crawler? The Scorpion is a rugged

TechSEO360 SEO Spider Software for Windows and Mac. Complete technical SEO and sitemaps

Making Sense of Massive Amounts of Scientific Publications: The Scientific Knowledge Miner

Sambuz

Useful Links

Newsletter

Mail Us

Presented by - Karan Kurani and Jason Marcell (Some slides adapted - PowerPoint PPT Presentation

Presented by - Karan Kurani and Jason Marcell (Some slides adapted from presentation on 12 th November) Karan Jason Theo Kiyan Bistra Goal Datasets Software Engineering Latent Dirichlet Allocation Methodology Results

CSC373 Algorithm Design, Analysis &amp; Complexity Karan Singh 373F19 Karan Singh 1

CSC418 Computer Graphics Im not Professor Karan Singh Course web site (includes course

Week 2: Greedy Algorithms Karan Singh 373F19 - Karan Singh 1 Recap Divide &amp; Conquer

January 28 2013 presentation by Karen Marcell Decisions. We make them every day. Some are Routine

CSC418: Computer Graphics Some slides and figures courtesy of Karan Singh Some figures from Peter

Orthogonal polynomials, zeros and electrostatics F. Marcell an Universidad Carlos III de

Observation of the drying process in secondary school kos Szeidemann, ron Bodor, Marcell

SOL SOLUTION UTION FO FOR R PR PRAWN WN FAR ARMING MING Marcell B. de Carvalho Ridley

Mechanical integration of PANGEA Marcell Steinen Helmholtz-Institut Mainz Panda Coll. Meeting

MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN SLIDES [EN] MARKDOWN

Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides Needs Slides

Complexity 373F19 - Nisarg Shah &amp; Karan Singh 1 Recap Linear Programming Standard

CSC373 Algorithm Design, Analysis &amp; Complexity Nisarg Shah 373F19 - Nisarg Shah 1

CSC373 Week 11: Randomized Algorithms 373F19 - Nisarg Shah &amp; Karan Singh 1 Randomized

SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides SBF AGM 2017 CEO Slides

Jason OBeirne MARKETING PRESENTATION 2015 LOCAL EXPERTS. GLOBAL REACH. Jason OBeirne

Beta Presentation Open Source Intel The Capstone Experience Team GM Ben Buscarino Will

Web crawler system for collecting malicious activities FIRST TC Mauritius 2016 Hisao Nashiwa

Session 6A - Big data sources: web scraping and smart meters Using Internet as a Data Source for

Step by step guide Step 1: Purchasing an RSSeo! membership Step 2: Download RSSeo! 2.1 Download

COMPARISON OF CATEGORICAL PROPERTIES OFFERED BY MULTIPLE MOOC PLATFORMS Using automated Web

SCORPION B SCAN CRAWLER What is the Scorpion B Scan Crawler? The Scorpion is a rugged

TechSEO360 SEO Spider Software for Windows and Mac. Complete technical SEO and sitemaps

Making Sense of Massive Amounts of Scientific Publications: The Scientific Knowledge Miner

Sambuz

Useful Links

Newsletter

Mail Us

CSC373 Algorithm Design, Analysis & Complexity Karan Singh 373F19 Karan Singh 1

Week 2: Greedy Algorithms Karan Singh 373F19 - Karan Singh 1 Recap Divide & Conquer

Complexity 373F19 - Nisarg Shah & Karan Singh 1 Recap Linear Programming Standard

CSC373 Algorithm Design, Analysis & Complexity Nisarg Shah 373F19 - Nisarg Shah 1

CSC373 Week 11: Randomized Algorithms 373F19 - Nisarg Shah & Karan Singh 1 Randomized