Creating a long-term memory for the global DNS Mattijs Jonker - PowerPoint PPT Presentation

Creating a long-term “memory” for the global DNS Mattijs Jonker

Introduction Almost fjve years ago, we started with an idea: ● “Can we measure (large parts) of the global DNS on a daily basis?” In this talk, I will discuss: ● – The data we gather (nowadays) – How do we perform our measurements – Which data do we share – And planned improvements / additional data 2019-12-09 OpenINTEL WIE-KISMET 2019 2/11

How we perform our measurements OpenINTEL performs an active measurement, sending a fjxed ● set of queries for all covered names, once every 24 hours We do this at scale , covering over 227 million domains per day: ● – gTLDs : .com, .net, .org, .info, .mobi, .aero, .asia, .name, .biz, .gov + almost 1200 "new" gTLDs (.xxx, .amsterdam, .berlin, …) – ccTLDs: .nl, .se, .nu, .ca, .fj, .at, .dk, .ru, .рф, .us, .na, .gt, .co – Various other sources: Alexa top 1M, Cisco Umbrella, diverse blacklists 2019-12-09 OpenINTEL WIE-KISMET 2019 3/11

How we perform our measurements The measurement process involves three stages ● 1. Extraction of names 2. Active measurement 3. Streaming and persisting data 2019-12-09 OpenINTEL WIE-KISMET 2019 4/11

Stage I: collecting names Extraction of names from zone fjles and other sources (at ● least once daily) Store state of covered namespace in “names to measure” DB ● Convert zone fjles to Avro ● 2019-12-09 OpenINTEL WIE-KISMET 2019 5/11

Stage II: main measurement Actively sending queries for all collected names (daily) ● Workers write results to fjles, chunked per 100k names ● Also track measurement performance (meta-data) ● Stage II: measurements / querying coordinator Measurement data (per source) (Avro) Set of workers per source (scalable) DNS queries & answers Measurement meta-data Domain Internet names DB 2019-12-09 OpenINTEL WIE-KISMET 2019 6/11

Stage III: storage and persistence We stream the data (measurement, meta, zone fjles) to a Kafka cluster ● Allows near real-time stream-based analysis (WIP) – Data is persisted in HDFS ● allowing batch-based, longit. analyses (many successes) – Clone data ofg-site (archive on tape & CAIDA clone) ● We are adding additional data to our streaming system (e.g., CTLs, RPKI data, ...) ● Stage III: data streaming, enrichment & persistence Kafka cluster Measurement Persist “Stream” additional CAIDA clone data & meta (HDFS) data (Avro) (com/net/org) Other data sources Zone fjles data (Avro) RV geo- Hadoop Ofg-site location <add more> pfx2as SDSC cluster archival (Swift) (tape) 2019-12-09 OpenINTEL WIE-KISMET 2019 7/11

What do we have, in simple numbers Started measuring February, 2015 ● ⋅ We collect over 2.4 10 9 DNS records each day ● ⋅ So far, we collected over 3.6 10 12 results (3.6 trillion) ● 2019-12-09 OpenINTEL WIE-KISMET 2019 8/11

Which data do we share We share open data publicly ● – Open sources (e.g., .se, .nu, Alexa) – As Avro fjles on openintel.nl, /w “light” docs We share closed data with other researchers ● – Typically require them to have registry operator contracts We share closed data with the respective registry operators ● 2019-12-09 OpenINTEL WIE-KISMET 2019 9/11

Ongoing and planned improvements More and improved data sharing ● – Aggregate datasets – Public Kafka broker – Rolling stats & insights (openintel.nl) – Jupyter containers (Dockerfjle) /w example analyses (also for education purposes) Fusing more data in streaming system ● – e.g.: certifjcate transparency logs, BGP events, outages, DoS attacks, … Reverse address space measurements (in-addr.arpa) ● Targeting additional authoritative(?) name servers ● Support distributed (multi-VP) measurement ● 2019-12-09 OpenINTEL WIE-KISMET 2019 10/11

Questions ? 2019-12-09 OpenINTEL WIE-KISMET 2019 11/11

Creating a long-term memory for the global DNS Mattijs Jonker - PowerPoint PPT Presentation

Creating a long-term memory for the global DNS Mattijs Jonker Introduction Almost fjve years ago, we started with an idea: Can we measure (large parts) of the global DNS on a daily basis? In this talk, I will discuss:

DNS and Security DNS and Security DNS and Security DNS and Security DNS and Security DNS and

Creating a "long-term memory" for the global DNS Introduction Almost five years ago

DNS Session 2: DNS cache operation and DNS debugging TENET NSRC - 2013 DNS Cache Operation

and DNS data mining Making Windows DNS Server Cloud Ready ~Kumar Ashutosh, Microsoft Windows DNS

Name Detection System By Auke Zwaan DNS DNS DNS Give me google. gle.nl nl DNS Give me

Resilient Networking 6: Attacks on DNS 1 Chapter Outline Overview of DNS Known attacks

Memory Memory processing is the ability to: Acquire (Short term memory) Manipulate

Memory II. Memory improvement III. Problems with memory 3 systems/stages of Memory: memory

DNS(SEC) client analysis assisted by Bart Gijsen (TNO) DNS-OARC, San Francisco, March 2011

DNS Session 2: DNS cache operation and DNS debugging These materials are licensed under the

DNSSEC and DNS Proxying DNS is hard at scale when you are a huge target 2 CloudFlare

Domain Name System (DNS) Learning Goal Foundations of DNS Security in DNS: Integrity

Long-Term Memory Introduction STM versus LTM Episodic Memory Semantic Memory

OpenINTEL an infrastructure for long-term, large-scale and high-performance active DNS

Meeting your goals Meeting your goals Meeting your goals Meeting your goals We are DNS DNS

DNSSEC in Windows DNS Server Kumar Ashutosh, Microsoft Windows DNS Server Widely deployed in

Memory Models for Incremental Learning Architectures Viktor Losing, Heiko Wersing and Barbara

Boosting the deep multidimensional long short- term memory network for handwritten recognition

CS7015 (Deep Learning) : Lecture 15 Long Short Term Memory Cells (LSTMs), Gated Recurrent Units

Discrimination between genuine versus fake emotion using long-short term memory with parametric

ARFIMA (long memory) models Christopher F Baum EC 327: Financial Econometrics Boston College,

Full OCM model for the ATC Andrea Lorenzani http://www.di.unipi.it/~lorenzan/work/FM4IS.ppt

CS1110 Nate Brunelle Today: How do computers? Questions? Last Time Paper airplanes

Class 15 - Long Short-Term Memory (LSTM) Class 15 - Long Short-Term Memory (LSTM) Study materials

Creating a long-term memory for the global DNS Mattijs Jonker - PowerPoint PPT Presentation

Creating a long-term memory for the global DNS Mattijs Jonker Introduction Almost fjve years ago, we started with an idea: Can we measure (large parts) of the global DNS on a daily basis? In this talk, I will discuss:

DNS and Security DNS and Security DNS and Security DNS and Security DNS and Security DNS and

Creating a &quot;long-term memory&quot; for the global DNS Introduction Almost five years ago

DNS Session 2: DNS cache operation and DNS debugging TENET NSRC - 2013 DNS Cache Operation

and DNS data mining Making Windows DNS Server Cloud Ready ~Kumar Ashutosh, Microsoft Windows DNS

Name Detection System By Auke Zwaan DNS DNS DNS Give me google. gle.nl nl DNS Give me

Resilient Networking 6: Attacks on DNS 1 Chapter Outline Overview of DNS Known attacks

Memory Memory processing is the ability to: Acquire (Short term memory) Manipulate

Memory II. Memory improvement III. Problems with memory 3 systems/stages of Memory: memory

DNS(SEC) client analysis assisted by Bart Gijsen (TNO) DNS-OARC, San Francisco, March 2011

DNS Session 2: DNS cache operation and DNS debugging These materials are licensed under the

DNSSEC and DNS Proxying DNS is hard at scale when you are a huge target 2 CloudFlare

Domain Name System (DNS) Learning Goal Foundations of DNS Security in DNS: Integrity

Long-Term Memory Introduction STM versus LTM Episodic Memory Semantic Memory

OpenINTEL an infrastructure for long-term, large-scale and high-performance active DNS

Meeting your goals Meeting your goals Meeting your goals Meeting your goals We are DNS DNS

DNSSEC in Windows DNS Server Kumar Ashutosh, Microsoft Windows DNS Server Widely deployed in

Memory Models for Incremental Learning Architectures Viktor Losing, Heiko Wersing and Barbara

Boosting the deep multidimensional long short- term memory network for handwritten recognition

CS7015 (Deep Learning) : Lecture 15 Long Short Term Memory Cells (LSTMs), Gated Recurrent Units

Discrimination between genuine versus fake emotion using long-short term memory with parametric

ARFIMA (long memory) models Christopher F Baum EC 327: Financial Econometrics Boston College,

Full OCM model for the ATC Andrea Lorenzani http://www.di.unipi.it/~lorenzan/work/FM4IS.ppt

CS1110 Nate Brunelle Today: How do computers? Questions? Last Time Paper airplanes

Class 15 - Long Short-Term Memory (LSTM) Class 15 - Long Short-Term Memory (LSTM) Study materials

Creating a "long-term memory" for the global DNS Introduction Almost five years ago