python for nlp
play

Python for NLP August 26-30, 2019 LORIA, Nancy - PowerPoint PPT Presentation

Python for NLP August 26-30, 2019 LORIA, Nancy https://synalp.loria.fr/python4nlp LIFT (C. Gardent), CNRS GDR Organisation OLKi Impact Project (C. and Funding Cerisara), LUE IDEX Basic Python Required Humanity Students


  1. Python for NLP August 26-30, 2019 LORIA, Nancy https://synalp.loria.fr/python4nlp

  2. ● LIFT (C. Gardent), CNRS GDR Organisation OLKi Impact Project (C. ● and Funding Cerisara), LUE IDEX

  3. Basic Python Required ● ● Humanity Students (linguists etc.) and researchers Audience ● CS students and researchers Industrials ●

  4. Objective Learn to ● Retrieve and store textual data from web, api (e.g., Gutenberg books, web pages, social network data) Apply linguistic processing (POS tagging, Parsing, NER, etc) ● Compute basic statistics and their visualisation (Nb of sentences, of ● tokens etc.) ● Apply basic Machine Learning Techniques (Classification, Clustering, Regression) Use word embeddings ●

  5. Program

  6. ● Interaction Web Server / Browser ● What’s in a web page Collecting Text Processing web pages ● ● What’s an API Extracting text from Wikipedia ● and Social Networks

  7. ● Sentence segmentation and tokenization ● Morphological analysis, stemming Processing Text ● POS tagging ● Named Entity Recognition ● Parsing

  8. Descriptive statistics ● ● Univariate Analysis (distribution, dispersion) Analysing Text Bivariate Analysis ● (Contingency, covariance) Vizualisation (scatter plot, box ● plots, histograms, bar plots)

  9. ● What is Machine Learning? Extracting Features ● Classification ● Train/Dev/Test Data and Clustering Supervised and unsupervised ● learning (Classification, regression, clustering)

  10. What are word embeddings ? ● Word ● Downloading and Using word embeddings Embeddings

  11. Registration UL Students 0 Euros Students (<500km) 300 Euros Students (>500km) 100 euros Academics 400 euros Private Sector 800 euros

Recommend


More recommend