Phonotactic Reconstruction of Encrypted VoIP Conversations: Hookt - PowerPoint PPT Presentation

Phonotactic Reconstruction of Encrypted VoIP Conversations: Hookt on fon-iks Adam White, Austin Matthews, Kevin Snow, and Fabian Monrose Presented By Corly Leung

Introduction - Google Hangout, Skype, FaceTime - Encrypting VoIP Packets - Variable-Bit-Rate for speech encoding - Length-preserving stream ciphers - Determine language spoken, identity, and presence of known phrases

Background - Phonetic Models of Speech - Individual units of phones - Consonants vs Vowels - Characterize by articulatory processes - Alphabets for representing phones: International Phonetic Alphabet (IPA) - Voice over IP - Audio encoded with an audio codec - Code Excited Linear Prediction - Excitation signal and Shape Signal

Related Works - Traffic Analysis of Encrypted Network - Encrypted VoIP calls to infer language and match to known phrases - Silence suppression to identify speeches

Data and Adversarial Assumptions - TIMIT Acoustic-Phonetic Continuous Speech Corpus - Collection of Speech with time-aligned word and phonetic transcripts - Encoded to Speex encoded - Adversary - Sequence of Packet Lengths for an encrypted VoIP call - Knowledge of the language - Representative example of sequences for each phoneme - Phonetic dictionary

High Level Overview of Approach

Finding the Phoneme Boundaries - Identify which packets represent a portion of speech containing boundary between phonemes. - Maximum entropy modeling by maximizing p(w|v) - Evaluation: Cross Validation with about 0.85 accuracy for n=1 - n frames within boundary

Classifying the Phonemes - Classification problem of various phonemes - Context dependent - Maximum entropy modeling: model only parameters of interest - Context independent - Profile hidden Markov modeling: model entire distribution over examples - Bayesian inference to update posterior given by maximum entropy classifier with evidence by HMM - Enhancing Classification using Language Modeling - Evaluation: 77% context dependent, 67% context independent vs 69% human

Segmenting Phoneme Streams Into Words - Identify likely word boundaries - Insert potential word breaks into sequence of phonemes - Pronunciation dictionary to find valid word matches - Evaluation: Precision 73% and Recall 85%

Identifying Words via Phonetic Edit Distance - Convert Subsequences of Phonemes into English Words - Phonetically based alignment method - Distance between two vowels/ consonants by rounding, backness, height or voice, manner, and place of articulation - Phonetic distance between sequence and each pronunciation in dictionary - Homophones (eight vs ate) - Word and part of speech model

Overall Evaluation - Speaker independent model - Content-dependent - Multiple utterance of particular sentence - Scoring around 0.67 and 0.9 with 0.5 being understandable - Content-independent - All TIMIT utterances - 0.45 average

Measuring Confidence - Close pronunciation matches are more likely to be correct than distant matches - Mean of probability estimates of each word in hypothesized transcript - Forgoing less confidence words

Mitigations - Varying frame based per packet - Packets are observed in correct order - Relatively large block sizes - Constant bit-rate codecs - Drop or packets

Discussion - What are the key contributions of the paper? - How practical is the attack? - Are the mitigations sufficient?

Phonotactic Reconstruction of Encrypted VoIP Conversations: Hookt - PowerPoint PPT Presentation

Phonotactic Reconstruction of Encrypted VoIP Conversations: Hookt on fon-iks Adam White, Austin Matthews, Kevin Snow, and Fabian Monrose Presented By Corly Leung Introduction - Google Hangout, Skype, FaceTime - Encrypting VoIP Packets -

Learning Phonotactic Grammars from Surface Forms: Phonotactic Patterns are Neighborhood-distinct

3D RECONSTRUCTION Reconstruction method Reconstruction from images Reconstruction from video

Expressing (most of) Phonotactic Knowledge as Contrast Bruce Tesar Linguistics Dept. / Center

Improved Modeling of Cross-Decoder Phone Co-occurrences in SVM-based Phonotactic Language

Warm Welcome Matrix SETU VTEP Single Span VoIP to T1/E1 PRI Gateway Introduction VoIP to T1/E1

What is VoIP? 1 VoIP What is it and how can it help my business? - VoIP is an acronym for

VoIP switching and billing suite Break through your data VoIP switching and billing suite

Voice over IP (VoIP) Technology Services 875-4357 cchelpdesk@ccis.edu What is VoIP? VoIP

Delaunay Triangulation: Applications Reconstruction Meshing 1 Reconstruction From points 2 -

Improving QoS of VoIP Improving QoS of VoIP over Wireless Networks over Wireless Networks

Transporting Voice by Using IP NTP VoIP Testbed: A SIP-based VoIP Platform Department of CSIE,

VoIP/SMPP traffic sniffer Break through your data Traffic sniffer modules VoIP traffic sniffer

VoIP Hacking Lars Strand PhD student Norwegian Defence Research Establishment (FFI) Jely,

SIP- -based Prepaid Mechanism based Prepaid Mechanism SIP on NTP VoIP Platform in on NTP VoIP

Modern VoIP in Modern Infrastructures Designing and implementing VoIP architectures in the cloud

Overview Overview VoIP Introduction Basic PSTN Concepts and SS7 Old Private

Evaluation: Simulate Effect on Simulation Results on Internet Network Traces Simulate

4/9/2012 Spring Clean Your Documents David A. Ericksen, Esq. Severson & Werson, A Professional

Stop the Drop: Profiles of Innovative Medicaid Renewal Initiatives and Lessons for 2014 and

Question as You Arrive What is the ONE most urgent, nagging, burning issue or concern for you

WEBRTC, MOBILE CONSIDERATIONS AND VOICE OVER IP IETF e W3C 0 c . 1 r u Google C o T

Transport Layer over Wireless Networks + Voice over IP (VoIP) JP Hubaux With help from P.

Natural Language for Communication ( cont .) -- Speech Recognition Chapter 23.5 Automatic

Automated Speech Recognition in Controller Communications applied to Workload Measurement Third

Sambuz

Useful Links

Newsletter

Mail Us

Phonotactic Reconstruction of Encrypted VoIP Conversations: Hookt - PowerPoint PPT Presentation

Phonotactic Reconstruction of Encrypted VoIP Conversations: Hookt on fon-iks Adam White, Austin Matthews, Kevin Snow, and Fabian Monrose Presented By Corly Leung Introduction - Google Hangout, Skype, FaceTime - Encrypting VoIP Packets -

Learning Phonotactic Grammars from Surface Forms: Phonotactic Patterns are Neighborhood-distinct

3D RECONSTRUCTION Reconstruction method Reconstruction from images Reconstruction from video

Expressing (most of) Phonotactic Knowledge as Contrast Bruce Tesar Linguistics Dept. / Center

Improved Modeling of Cross-Decoder Phone Co-occurrences in SVM-based Phonotactic Language

Warm Welcome Matrix SETU VTEP Single Span VoIP to T1/E1 PRI Gateway Introduction VoIP to T1/E1

What is VoIP? 1 VoIP What is it and how can it help my business? - VoIP is an acronym for

VoIP switching and billing suite Break through your data VoIP switching and billing suite

Voice over IP (VoIP) Technology Services 875-4357 cchelpdesk@ccis.edu What is VoIP? VoIP

Delaunay Triangulation: Applications Reconstruction Meshing 1 Reconstruction From points 2 -

Improving QoS of VoIP Improving QoS of VoIP over Wireless Networks over Wireless Networks

Transporting Voice by Using IP NTP VoIP Testbed: A SIP-based VoIP Platform Department of CSIE,

VoIP/SMPP traffic sniffer Break through your data Traffic sniffer modules VoIP traffic sniffer

VoIP Hacking Lars Strand PhD student Norwegian Defence Research Establishment (FFI) Jely,

SIP- -based Prepaid Mechanism based Prepaid Mechanism SIP on NTP VoIP Platform in on NTP VoIP

Modern VoIP in Modern Infrastructures Designing and implementing VoIP architectures in the cloud

Overview Overview VoIP Introduction Basic PSTN Concepts and SS7 Old Private

Evaluation: Simulate Effect on Simulation Results on Internet Network Traces Simulate

4/9/2012 Spring Clean Your Documents David A. Ericksen, Esq. Severson &amp; Werson, A Professional

Stop the Drop: Profiles of Innovative Medicaid Renewal Initiatives and Lessons for 2014 and

Question as You Arrive What is the ONE most urgent, nagging, burning issue or concern for you

WEBRTC, MOBILE CONSIDERATIONS AND VOICE OVER IP IETF e W3C 0 c . 1 r u Google C o T

Transport Layer over Wireless Networks + Voice over IP (VoIP) JP Hubaux With help from P.

Natural Language for Communication ( cont .) -- Speech Recognition Chapter 23.5 Automatic

Automated Speech Recognition in Controller Communications applied to Workload Measurement Third

Sambuz

Useful Links

Newsletter

Mail Us

4/9/2012 Spring Clean Your Documents David A. Ericksen, Esq. Severson & Werson, A Professional