1

Replay without Recording of Production Bugs for Service Oriented Applications

Short time-to-localize and time-to-x for production bugs is extremely important for any 24x7 service-oriented application (SOA). Debugging buggy behavior in deployed applications is hard, as it requires careful reproduction of a similar environment …

LogLens: A Real-Time Log Analysis System

Administrators of most user-facing systems depend on periodic log data to get an idea of the health and status of production applications. Logs report information, which is crucial to diagnose the root cause of complex problems. In this paper, we …

An Analytics Approach to Traffic Analysis in Network Virtualization

Network virtualization has been propounded as a diversifying attribute of the future inter-networking paradigm. However, monitoring and troubleshooting operational virtual networks can be a daunting task, due to their size, distributed state, and …

Enabling Layer 2 Pathlet Tracing through Context Encoding in Software-Defined Networking

Troubleshooting Software-Defined Networks requires a structured approach to detect mistranslations between high-level intent (policy) and low-level forwarding behavior, and a flexible on-demand packet tracing tool is highly desirable on the data …

IntroPerf: Transparent Context-Sensitive Multi-Layer Performance Inference using System Stack Traces

Performance bugs are frequently observed in commodity software. While profilers or source code-based tools can be used at development stage where a program is diagnosed in a welldefined environment, many performance bugs survive such a stage and …

Uscope: A Scalable Unified Tracer from Kernel to User Space

Unified tracing is the process of collecting trace logs across the boundary of kernel and user spaces, and has been used to understand the in-depth correspondence between low level events and application program context for diagnosing system failures …

CLUE: System Trace Analytics for Cloud Service Performance Diagnosis

In this paper, we present CLUE, a system event analytics tool for black-box performance diagnosis in production Cloud Computing systems. CLUE provides an unified and extensible means of profiling service transactional behaviors, and builds structured …

Software System Performance Debugging with Kernel Events Feature Guidance

To diagnose performance problems in production systems, many OS kernel-level monitoring and analysis tools have been proposed. Using low level kernel events provides benefits in efficiency and transparency to monitor application software. On the …

DeltaPath: Precise and Scalable Calling Context Encoding

Calling context provides important information for a large range of applications, such as event logging, profiling, debugging, anomaly detection, and performance optimization. While some techniques have been proposed to track calling context …

HybNET: Network Manager for Hybrid Network Infrastructure

The emergence of Software-Defined Networking(SDN) has led to a paradigm shift in network management. SDN has the capability to provide clear and easy management of complex operational challenges in large scale networks. However, most of the existing …