Systems for analyzing routing policies and localizing faults in the Internet

Loading...
Thumbnail Image

Authors

Jin, Yuchen

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Our reliance on the Internet continues to grow; however, Internet communication has seen little progress over the years because it typically spans multiple Autonomous Systems (ASes) that are operated by individual Internet Service Providers (ISPs) and organizations. This inherent autonomy of the Internet limits the visibility into other networks and the velocity of change. As a result, public Internet communication has become the weak link for Internet-based services. In this thesis, I design, build, and evaluate practical algorithms and systems that ISPs and cloud providers can use to analyze Internet routing policies and localize faults in the Internet. Knowledge of the business relationships between ASes is essential to understanding the behavior of the Internet routing system. I develop ProbLink, a probabilistic algorithm to infer business relationships between ASes in the Internet. By integrating noisy but useful features, it overcomes the challenges in inferring hard links such as routing violating the valley-free assumption, limited visibility, and non-conventional peering practices. I build three real-world applications on top of ProbLink and show that ProbLink has a significant impact when applied to practical applications compared to the state-of-the-art inference algorithms based on empirical rules. For Internet-based services such as video calls and online games, providing low latency is important. I design and build a system, BlameIt, that automatically localizes the faulty AS when there is latency degradation between clients and clouds. BlameIt employs a hybrid two-phased blame assignment, combining the best parts of passive analysis (low measurement overhead) and active probing (fine-grained fault localization). BlameIt has been in production deployment for 3 years at Microsoft Azure and produces results with high accuracy at low overheads.

Description

Thesis (Ph.D.)--University of Washington, 2021

Citation

DOI