Making every bit count in wide-area analytics

Ariel Rabkin, Matvey Arye, Siddhartha Sen, Vivek Pai, Michael J. Freedman

Research output: Contribution to conferencePaperpeer-review

10 Scopus citations


Many data sets, such as system logs, are generated from widely distributed locations. Current distributed systems often discard this data because they lack the ability to backhaul it efficiently, or to do anything meaningful with it at the distributed sites. This leads to lost functionality, efficiency, and business opportunities. The problem with traditional backhaul approaches is that they are slow and costly, and require analysts to define the data they are interested in up-front. We propose a new architecture that stores data at the edge (i.e., near where it is generated) and supports rich real-time and historical queries on this data, while adjusting data quality to cope with the vagaries of wide-area bandwidth. In essence, this design transforms a distributed data collection system into a distributed data analysis system, where decisions about collection do not preclude decisions about analysis.

Original languageEnglish (US)
StatePublished - 2013
Event14th Workshop on Hot Topics in Operating Systems, HotOS 2013 - Santa Ana Pueblo, United States
Duration: May 13 2013May 15 2013


Conference14th Workshop on Hot Topics in Operating Systems, HotOS 2013
Country/TerritoryUnited States
CitySanta Ana Pueblo

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Hardware and Architecture
  • Information Systems


Dive into the research topics of 'Making every bit count in wide-area analytics'. Together they form a unique fingerprint.

Cite this