Publication Details
Collection of Datasets with DNS over HTTPS Traffic
Hynek Karel
Čejka Tomáš, doc. Ing., Ph.D. (SSDIT)
Ryšavý Ondřej, doc. Ing., Ph.D. (DIFS)
DNS over HTTPS, DNS, HTTPS, Computer, Network,Monitoring,Network Traffic
The DNS over HTTPS (DoH) is becoming a default option for domain resolution in modern privacy-aware software. Therefore, researchers have already focused on various aspects; however, a comprehensive dataset from an actual production network is still missing. This paper presents a collection of novel datasets comprising multiple PCAP files of DoH and HTTPS traffic. The captured traffic is generated towards multiple DoH providers to cover differences of various DoH server implementations and configurations. In addition to generated traffic, we also provide real network traffic captured on high-speed backbone lines of a large Internet Service Provider with around half a million users. Even though the network identifiers (excluding network identifiers of DoH resolvers) in the real network traffic (e.g., IP addresses and transmitted content) were anonymized, the essential characteristics of the traffic can still be obtained from the data. Therefore, the dataset can be used in whole network traffic analysis areas such as traffic classification research.
@article{BUT178119,
author="Kamil {Jeřábek} and Karel {Hynek} and Tomáš {Čejka} and Ondřej {Ryšavý}",
title="Collection of Datasets with DNS over HTTPS Traffic",
journal="Data in Brief (Online)",
year="2022",
volume="2022",
number="42",
pages="1--13",
doi="10.1016/j.dib.2022.108310",
issn="2352-3409",
url="https://www.sciencedirect.com/science/article/pii/S2352340922005121"
}