007: Democratically Finding The Cause of Packet Drops

  • ,
  • Selim Ciraci ,
  • Luiz Chamon ,
  • Yibo Zhu ,
  • Hongqiang Liu ,
  • Jitu Padhye ,
  • Boon Thau Loo ,
  • Geoff Outhred

15th USENIX Symposium on Networked Systems Design and Implementation (NSDI 18) |

Related File

Network failures continue to plague datacenter operators as their symptoms may not have direct correlation with where or why they occur. We introduce 007, a lightweight, always-on diagnosis application that can find problematic links and also pinpoint problems for each TCP connection. 007 is completely contained within the end host. During its two month deployment in a tier-1 datacenter, it detected every problem found by previously deployed monitoring tools while also finding the sources of other problems previously undetected.

Téléchargements de publications

Vigil: Democratically Finding The Cause of Packet Drops

mai 9, 2018

The source code for the simulations in the paper, 007: Democratically Finding The Cause of Packet Drops (NSDI 2018). To run: use MultiFailureDriver or SingleLinkFailureDriver (these are the starting points/drivers of the code).