TY - UNPB
T1 - Clusters in the Expanse
T2 - Understanding and Unbiasing IPv6 Hitlists
AU - Gasser, Oliver
AU - Scheitle, Quirin
AU - Foremski, Pawel
AU - Lone, Qasim
AU - Korczyński, Maciej
AU - Strowes, Stephen D.
AU - Hendriks, Luuk
AU - Carle, Georg
N1 - See https://ipv6hitlist.github.io for daily IPv6 hitlists, historical data, and additional analyses
PY - 2018/6/5
Y1 - 2018/6/5
N2 - Network measurements are an important tool in understanding the Internet. Due to the expanse of the IPv6 address space, exhaustive scans as in IPv4 are not possible for IPv6. In recent years, several studies have proposed the use of target lists of IPv6 addresses, called IPv6 hitlists. In this paper, we show that addresses in IPv6 hitlists are heavily clustered. We present novel techniques that allow IPv6 hitlists to be pushed from quantity to quality. We perform a longitudinal active measurement study over 6 months, targeting more than 50 M addresses. We develop a rigorous method to detect aliased prefixes, which identifies 1.5 % of our prefixes as aliased, pertaining to about half of our target addresses. Using entropy clustering, we group the entire hitlist into just 6 distinct addressing schemes. Furthermore, we perform client measurements by leveraging crowdsourcing. To encourage reproducibility in network measurement research and to serve as a starting point for future IPv6 studies, we publish source code, analysis tools, and data.
AB - Network measurements are an important tool in understanding the Internet. Due to the expanse of the IPv6 address space, exhaustive scans as in IPv4 are not possible for IPv6. In recent years, several studies have proposed the use of target lists of IPv6 addresses, called IPv6 hitlists. In this paper, we show that addresses in IPv6 hitlists are heavily clustered. We present novel techniques that allow IPv6 hitlists to be pushed from quantity to quality. We perform a longitudinal active measurement study over 6 months, targeting more than 50 M addresses. We develop a rigorous method to detect aliased prefixes, which identifies 1.5 % of our prefixes as aliased, pertaining to about half of our target addresses. Using entropy clustering, we group the entire hitlist into just 6 distinct addressing schemes. Furthermore, we perform client measurements by leveraging crowdsourcing. To encourage reproducibility in network measurement research and to serve as a starting point for future IPv6 studies, we publish source code, analysis tools, and data.
KW - cs.NI
U2 - 10.48550/arXiv.1806.01633
DO - 10.48550/arXiv.1806.01633
M3 - Working paper
BT - Clusters in the Expanse
PB - ArXiv.org
ER -