Normalizing tweets with edit scripts and recurrent neural - PowerPoint PPT Presentation

Nov 01, 2023 •167 likes •373 views

Normalizing tweets with edit scripts and recurrent neural embeddings Grzegorz Chrupaa | Tilburg University Normalizing tweets Convert tweets to canonical form easy to understand for downstream applications Examples I will c wat i can do

Normalizing tweets with edit scripts and recurrent neural embeddings Grzegorz Chrupała | Tilburg University
Normalizing tweets
Convert tweets to canonical form easy to understand for downstream applications
Examples I will c wat i can do I will see what I can do imma jus start puttn it out there I'm going to just start putting it out there
Approaches ● Noisy-channel-style ● Finite-state transducers ● Dictionary-based ○ Hand-crafted ○ Automatically constructed
Labeled vs unlabeled data ● Noisy-channel: P(target|source) ∝ P(source|target) × P(target) labeled unlabeled ● Dictionary lookup: ○ Induce dictionary from unlabeled data ○ Labeled data for parameter tuning
Discriminative model target = argmax target P(diff( source, target ) | source ) ● diff(·,·) transforms source to target ● P(·) is a Conditional Random Field
Signal from raw tweets included via learned text representations .
Architecture
Simple Recurrent Networks Elman, J. L. (1990). Finding structure in time. Cognitive science , 14 (2), 179-211.
Recurrent neural embeddings ● SRN trained to predict next character ● Representation: ● Embed string (at each position) in low- dimensional space
Visualizing embeddings String Nearest neighbors in embedding space should h should d will s will m should a @justth @neenu @raven_ @lanae @despic maybe u maybe y cause i wen i when i
diff - Edit script Input c _ w a t diff DEL INS(see) NIL INS(h) NIL Output see_ w ha t Each position in string labeled with edit op
Features ● Baseline n-gram features c _ w a t c_ _w wa at c_w _wa wat c_wa _wat c_wat ● SRN features ○ 400 MB raw Twitter feed ○ 400 hidden units ○ Activations discretized
Dataset ● Han, B., & Baldwin, T. (2011). Lexical normalisation of short text messages: Makn sens a# twitter. In ACL . ● 549 tweets, with normalized versions ● Only lexical normalizations
Results ● No-op make no changes ● Doc train on and label whole tweets ● OOV train on and label OOV-words
Compared to Han & Bo 2012 Method WER (%) No-op 11.2 S-dict 9.7 GHM-dict 7.6 HB-dict 6.6 Dict-combo 4.9 OOV NGRAM+SRN 4.7
Where SRN features helped 9 cont continued 5 gon gonna 4 bro brother 4 congrats congratulations 3 yall you 3 pic picture 2 wuz what’s 2 mins minutes 2 juss just 2 fb facebook
Conclusion ● Supervised discriminative model performs at state-of-the-art with little training data ● Neural text embeddings effectively incorporate signal from raw tweets

Recommend

CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I :

Ugur HALICI - METU EEE - ANKARA 11/18/2004 CHAPTER II I CHAPTER I Recurrent Neural Networks Recurrent Neural Networks CHAPTER II : I : Recurrent Neural Networks CHAPTER I Recurrent Neural Networks Introduction In this chapter first the

404 views • 27 slides

Scriptless Scripts Andrew Poelstra grindelwald@wpsoftware.net March 4, 2017 Scriptless Scripts

Scriptless Scripts Scriptless Scripts Andrew Poelstra grindelwald@wpsoftware.net March 4, 2017 Scriptless Scripts Introduction Scriptless Scripts? Scriptless scripts: magicking digital signatures so that they can only be created by

795 views • 40 slides

Scriptless Scripts Andrew Poelstra grindelwald@wpsoftware.net May 10, 2017 Scriptless Scripts

Scriptless Scripts Scriptless Scripts Andrew Poelstra grindelwald@wpsoftware.net May 10, 2017 Scriptless Scripts Introduction Scriptless Scripts? Scriptless scripts: magicking digital signatures so that they can only be created by

298 views • 15 slides

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang

Recurrent Neural Network Xiaogang Wang xgwang@ee.cuhk.edu.hk February 26, 2019 cuhk Xiaogang Wang (CUHK) Recurrent Neural Network February 26, 2019 1 / 52 Outline 1 Recurrent neural networks Recurrent neural networks BP on RNN Variants

1.22k views • 52 slides

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class Recurrent Neural Network Cell Recurrent Neural Networks (RNNs) Bi-Directional Recurrent Neural Networks (Bi-RNNs) Multiple-layer /

583 views • 47 slides

CS6501: Deep Learning for Visual Recognition Recurrent Neural Networks (RNNs) Todays Class

793 views • 21 slides

Click to edit Master title style DRVR Click to edit Master title style Click to edit Master

Click to edit Master title style DRVR Click to edit Master title style Click to edit Master title style DRVR Topic Areas Click to edit Master title style Click to edit Master title style Regional Trends Click to edit Master title style

386 views • 12 slides

Click to edit Master title style Click to edit Master title style Click to edit Master title

Click to edit Master title style Click to edit Master title style Click to edit Master title style ? Click to edit Master text styles ? ? Click to edit Master text styles ? Click to edit Master text styles TAROC: TCP-Aware RObust ` Second

566 views • 18 slides

Sequential Data with Neural Networks Recurrent Neural Networks Sequential input / output Greg

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Sequential Data with Neural Networks Recurrent Neural

303 views • 4 slides

scripts.mit.edu Quentin Smith scripts@mit.edu Student Information Processing Board October 29,

Services Backend Further Info scripts.mit.edu Quentin Smith scripts@mit.edu Student Information Processing Board October 29, 2019 Quentin Smith scripts@mit.edu scripts.mit.edu Services Backend Further Info Outline Services 1 Web Mail

897 views • 42 slides

perf scripts jiri olsa 1 PERF SCRIPTS | JIRI OLSA HI basics perf in python post

perf scripts jiri olsa 1 PERF SCRIPTS | JIRI OLSA HI basics perf in python post process scripts 2 PERF SCRIPTS | JIRI OLSA COUNTING perf stat CPU 0 CPU 1 CPU 2 start $ perf stat -e ' cycles,instructions ' WORKLOAD

1.2k views • 59 slides

NLP Programming Tutorial 8 - Recurrent Neural Nets Graham Neubig Nara Institute of Science and

NLP Programming Tutorial 8 Recurrent Neural Nets NLP Programming Tutorial 8 - Recurrent Neural Nets Graham Neubig Nara Institute of Science and Technology (NAIST) 1 NLP Programming Tutorial 8 Recurrent Neural Nets Feed Forward Neural

604 views • 34 slides

The Power of Linear Recurrent Neural Networks Neural Networks Was knnen lineare rekurrente

The Power of Linear Recurrent The Power of Linear Recurrent Neural Networks Neural Networks Was knnen lineare rekurrente neuronale Netze? Frieder Stolzenburg Overview Frieder Stolzenburg Introduction Recurrent Neural Networks

1.17k views • 66 slides

Click to edit Master title style Click to edit Master title style Edit Master text styles Edit

Click to edit Master title style Click to edit Master title style Edit Master text styles Edit Master text styles Introduction to TESLA Forecasting Solutions Who are TESLA Forecasting Solutions? Experts in electricity and gas Not to be confused

482 views • 13 slides

Minimum Cost Edit Distance Edit a source string into a target string Each edit has a cost

Minimum Cost Edit Distance Edit a source string into a target string Each edit has a cost Find the minimum cost edit(s) actress insert(s) actres delete(t) minimum cost actrest edit distance can be accomplished insert(t) in

387 views • 24 slides

Recurrent Neural Networks Greg Mori - CMPT 419/726 Goodfellow, Bengio, and Courville: Deep

Recurrent Neural Networks Long Short-Term Memory Temporal Convolutional Networks Examples Recurrent Neural Networks Greg Mori - CMPT 419/726 Goodfellow, Bengio, and Courville: Deep Learning textbook Ch. 10 Recurrent Neural Networks Long

667 views • 18 slides

Understanding Hidden Memories of Recurrent Neural Networks Yao Ming , Shaozu Cao, Ruixiang Zhang,

Understanding Hidden Memories of Recurrent Neural Networks Yao Ming , Shaozu Cao, Ruixiang Zhang, Zhen Li, Yuanzhe Chen, Yangqiu Song, Huamin Qu. THE HONG KONG UNIVERSITY OF SCIENCE AND TECHNOLOGY What is a Recurrent Neural Network? H K U S T

397 views • 37 slides

Introduction to the course RECURREN T N EURAL N ETW ORK S F OR LAN GUAGE MODELIN G IN P YTH ON

Introduction to the course RECURREN T N EURAL N ETW ORK S F OR LAN GUAGE MODELIN G IN P YTH ON David Cecchini Data Scientist Text data is available online RECURRENT NEURAL NETWORKS FOR LANGUAGE MODELING IN PYTHON Applications of machine

683 views • 33 slides

Recurrent neural network grammars Slide credits: Chris Dyer, Adhiguna Kuncoro Widespread

Recurrent neural network grammars Slide credits: Chris Dyer, Adhiguna Kuncoro Widespread phenomenon: Polarity items can only appear in certain contexts Example: anybody is a polarity item that tends to appear only in specific contexts:

1.31k views • 102 slides

Recurrent Recommendation with Local Coherence Jianling Wang and James Caverlee Dynamics in

Recurrent Recommendation with Local Coherence Jianling Wang and James Caverlee Dynamics in Recommenders Users and Items are constantly in flux. Local Coherence Within a short-term sequence, the neighboring items or users is

432 views • 16 slides

Sparse Attentive Backtracking: Temporal credit assignment through reminding Nan Rosemary Ke 1,2 ,

Sparse Attentive Backtracking: Temporal credit assignment through reminding Nan Rosemary Ke 1,2 , Anirudh Goyal 1 , Olexa Bilaniuk 1 , Jonathan Binas 1 Chris Pal 2,4 , Mike Mozer 3 , Yoshua Bengio 1,5 1 Mila, Universit e de Montr eal 2 Mila,

377 views • 6 slides

Geometry-Aware Deep Visual Learning Katerina Fragkiadaki zebras How this talk fits the workshop

Geometry-Aware Deep Visual Learning Katerina Fragkiadaki zebras How this talk fits the workshop We will discuss new neural architectures for video understanding and feature learning without human annotations We will still use SGD to

784 views • 57 slides

Neural Network Part 4: Recurrent Neural Networks Yingyu Liang Computer Sciences 760 Fall 2017

Neural Network Part 4: Recurrent Neural Networks Yingyu Liang Computer Sciences 760 Fall 2017 http://pages.cs.wisc.edu/~yliang/cs760/ Some of the slides in these lectures have been adapted/borrowed from materials developed by Mark Craven,

384 views • 37 slides

Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) CMSC 678 UMBC Recap

Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) CMSC 678 UMBC Recap from last time Feed-Forward Neural Network: Multilayer Perceptron + 0 ) = (

1.04k views • 93 slides