Abstract
Local Differential Privacy (LDP) protects user privacy from the data collector. LDP protocols have been increasingly deployed in the industry. A basic building block is frequency oracle (FO) protocols, which estimate frequencies of values. While several FO protocols have been proposed, the design goal does not lead to optimal results for answering many queries. In this paper, we show that adding post-processing steps to FO protocols by exploiting the knowledge that all individual frequencies should be non-negative and they sum up to one can lead to significantly better accuracy for a wide range of tasks, including frequencies of individual values, frequencies of the most frequent values, and frequencies of subsets of values. We consider 10 different methods that exploit this knowledge differently. We establish theoretical relationships between some of them and conducted extensive experimental evaluations to understand which methods should be used for different query tasks.
Original language | English |
---|---|
Title of host publication | Network and Distributed Systems Security Symposium 2020 |
DOIs | |
Publication status | Published - 26 Feb 2020 |
Externally published | Yes |
Event | Network and Distributed System Security Symposium, NDSS 2020 - Catamaran Resort Hotel & Spa, San Diego, United States Duration: 23 Feb 2020 → 26 Feb 2020 https://www.ndss-symposium.org/ndss2020/ |
Conference
Conference | Network and Distributed System Security Symposium, NDSS 2020 |
---|---|
Abbreviated title | NDSS 2020 |
Country/Territory | United States |
City | San Diego |
Period | 23/02/20 → 26/02/20 |
Internet address |