Releasing search queries and clicks privately

Aleksandra Korolova, Krishnaram Kenthapadi, Nina Mishra, Alexandros Ntoulas

Research output: Chapter in Book/Report/Conference proceedingConference contribution

163 Scopus citations

Abstract

The question of how to publish an anonymized search log was brought to the forefront by a well-intentioned, but privacy-unaware AOL search log release. Since then a series of ad-hoc techniques have been proposed in the literature, though none are known to be provably private. In this paper, we take a major step towards a solution: we show how queries, clicks and their associated perturbed counts can be published in a manner that rigorously preserves privacy. Our algorithm is decidedly simple to state, but non-trivial to analyze. On the opposite side of privacy is the question of whether the data we can safely publish is of any use. Our findings offer a glimmer of hope: we demonstrate that a non-negligible fraction of queries and clicks can indeed be safely published via a collection of experiments on a real search log. In addition, we select an application, keyword generation, and show that the keyword suggestions generated from the perturbed data resemble those generated from the original data. Copyright is held by the International World Wide Web Conference Committee (IW3C2).

Original languageEnglish (US)
Title of host publicationWWW'09 - Proceedings of the 18th International World Wide Web Conference
Pages171-180
Number of pages10
DOIs
StatePublished - 2009
Externally publishedYes
Event18th International World Wide Web Conference, WWW 2009 - Madrid, Spain
Duration: Apr 20 2009Apr 24 2009

Publication series

NameWWW'09 - Proceedings of the 18th International World Wide Web Conference

Conference

Conference18th International World Wide Web Conference, WWW 2009
Country/TerritorySpain
CityMadrid
Period4/20/094/24/09

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications

Keywords

  • Algorithms
  • Experimentation
  • Human factors
  • Legal aspects
  • Measurement
  • Performance
  • Security
  • Theory

Fingerprint

Dive into the research topics of 'Releasing search queries and clicks privately'. Together they form a unique fingerprint.

Cite this