Skip to Content
[CAIDA - Center for Applied Internet Data Analysis logo]
Center for Applied Internet Data Analysis
www.caida.org > publications : papers : 2017 : impact_router_outages_as
The Impact of Router Outages on the AS-level Internet
M. Luckie and R. Beverly, "The Impact of Router Outages on the AS-level Internet", in ACM SIGCOMM, Aug 2017, pp. 488--501.
|   View full paper:    PDF    Data Supplement    DOI    |  Citation:    BibTeX   |

The Impact of Router Outages on the AS-level Internet

Matthew Luckie 2
Robert Beverly 1
1

Naval Postgraduate School

2

University of Waikato

We propose and evaluate a new metric for understanding the dependence of the AS-level Internet on individual routers. Whereas prior work uses large volumes of reachability probes to infer outages, we design an efficient active probing technique that directly and unambiguously reveals router restarts. We use our technique to survey 149,560 routers across the Internet for 2.5 years. 59,175 of the surveyed routers (40%) experience at least one reboot, and we quantify the resulting impact of each router outage on global IPv4 and IPv6 BGP reachability.

Our technique complements existing data and control plane outage analysis methods by providing a causal link from BGP reachability failures to the responsible router(s) and multi-homing configurations. While we found the Internet core to be largely robust, we identified specific routers that were single points of failure for the prefixes they advertised. In total, 2,385 routers – 4.0% of the routers that restarted over the course of 2.5 years of probing – were single points of failure for 3,396 IPv6 prefixes announced by 1,708 ASes. We inferred 59% of these routers were the customer-edge border router. 2,374 (70%) of the withdrawn prefixes were not covered by a less specific prefix, so 1,726 routers (2.9%) of those that restarted were single points of failure for at least one network. However, a covering route did not imply reachability during a router outage, as no previously-responsive address in a withdrawn more specific prefix responded during a one-week sample.We validate our reboot and single point of failure inference techniques with four networks, finding no false positive or false negative reboots, but find some false negatives in our single point of failure inferences.

Keywords: active data analysis, internet outages, measurement methodology, routing, topology
  Last Modified: Wed Oct-11-2017 17:04:09 PDT
  Page URL: http://www.caida.org/publications/papers/2017/impact_router_outages_as/index.xml