Text-Denoising mit Go(lang) Bedcon 2015 – Dennis Kluge – Charité Berlin
dataflex-science.de
Berlin Expert Days 2015 | Dennis Kluge | Slide 3
Berlin Expert Days 2015 | Dennis Kluge | Folie 4
NLP Natural Language Processing Berlin Expert Days 2015 | Dennis Kluge | Slide 5
?
Die @bedcon ist die großartigste #Konferenz des Jahres. 😎 http://bedcon.org Berlin Expert Days 2015 | Dennis Kluge | Slide 7
Die bedcon ist die großartigste Konferenz des Jahres Berlin Expert Days 2015 | Dennis Kluge | Slide 8
die bedcon ist die großartigste konferenz des jahres Berlin Expert Days 2015 | Dennis Kluge | Slide 9
[“di”, “ie”, “e_”, “_b”, “be”, “ed”…] Berlin Expert Days 2015 | Dennis Kluge | Slide 10
GO(LANG) • 2009 - erschienen … 2012 - Version 1.0 • kompiliert, stark typisiert, imperativ, strukturiert • optimiert für Nebenläufigkeit • Garbage Collection • C angelehnte Syntax Berlin Expert Days 2015 | Dennis Kluge | Slide 12
STRINGS • Go source code is always UTF-8. • A string holds arbitrary bytes. • A string literal, absent byte-level escapes, always holds valid UTF-8 sequences. • Those sequences represent Unicode code points, called runes. • No guarantee is made in Go that characters in strings are normalized. • https://blog.golang.org/strings Berlin Expert Days 2015 | Dennis Kluge | Slide 13
Berlin Expert Days 2015 | Dennis Kluge | Slide 14
Boolean, Numeric, String, Array, Slice, Map, Interface, Map, Channel Berlin Expert Days 2015 | Dennis Kluge | Slide 15
Berlin Expert Days 2015 | Dennis Kluge | Slide 16
Type Inference Berlin Expert Days 2015 | Dennis Kluge | Slide 16
Berlin Expert Days 2015 | Dennis Kluge | Slide 17
Berlin Expert Days 2015 | Dennis Kluge | Slide 18
Package Berlin Expert Days 2015 | Dennis Kluge | Slide 19
Berlin Expert Days 2015 | Dennis Kluge | Slide 20
Großbuchstabe deklariert public Berlin Expert Days 2015 | Dennis Kluge | Slide 21
Berlin Expert Days 2015 | Dennis Kluge | Slide 22
C-Style Berlin Expert Days 2015 | Dennis Kluge | Slide 23
Berlin Expert Days 2015 | Dennis Kluge | Slide 24
Berlin Expert Days 2015 | Dennis Kluge | Slide 25
MATCHING URLS • mathiasbynens.be/demo/url-regex • stackoverflow.com/questions/161738/what-is-the-best-regular- expression-to-check-if-a-string-is-a-valid-url Berlin Expert Days 2015 | Dennis Kluge | Slide 26
Berlin Expert Days 2015 | Dennis Kluge | Slide 27
Berlin Expert Days 2015 | Dennis Kluge | Slide 28
Berlin Expert Days 2015 | Dennis Kluge | Slide 29
Berlin Expert Days 2015 | Dennis Kluge | Slide 30
Berlin Expert Days 2015 | Dennis Kluge | Slide 31
github.com/horstmumpitz/bedcon2015 Berlin Expert Days 2015 | Dennis Kluge | Slide 32
BAG OF BIGRAMS Bigram Tweet 1 Tweet 2 Tweet 3 di 1 2 0 ie 0 5 4 e_ 3 0 9 … Berlin Expert Days 2015 | Dennis Kluge | Slide 34
Berlin Expert Days 2015 | Dennis Kluge | Slide 35
D a n k e –dennis.kluge@charite.de – @HorstMumpitz
Recommend
More recommend