Publication Details
Collection of Datasets with DNS over HTTPS Traffic
Hynek Karel
Čejka Tomáš, doc. Ing., Ph.D. (SSDIT)
Ryšavý Ondřej, doc. Ing., Ph.D. (DIFS)
DNS over HTTPS, DNS, HTTPS, Computer, Network,Monitoring,Network Traffic
The DNS over HTTPS (DoH) is becoming a default option for domain resolution in
modern privacy-aware software. Therefore, researchers have already focused on
various aspects; however, a comprehensive dataset from an actual production
network is still missing. This paper presents a collection of novel datasets
comprising multiple PCAP files of DoH and HTTPS traffic. The captured traffic is
generated towards multiple DoH providers to cover differences of various DoH
server implementations and configurations. In addition to generated traffic, we
also provide real network traffic captured on high-speed backbone lines of
a large Internet Service Provider with around half a million users. Even though
the network identifiers (excluding network identifiers of DoH resolvers) in the
real network traffic (e.g., IP addresses and transmitted content) were
anonymized, the essential characteristics of the traffic can still be obtained
from the data. Therefore, the dataset can be used in whole network traffic
analysis areas such as traffic classification research.
@article{BUT178119,
author="Kamil {Jeřábek} and Karel {Hynek} and Tomáš {Čejka} and Ondřej {Ryšavý}",
title="Collection of Datasets with DNS over HTTPS Traffic",
journal="Data in Brief (Online)",
year="2022",
volume="2022",
number="42",
pages="1--13",
doi="10.1016/j.dib.2022.108310",
issn="2352-3409",
url="https://www.sciencedirect.com/science/article/pii/S2352340922005121"
}