Inside Dropbox: Understanding Personal Cloud Storage Services

Idilio Drago, Marco Mellia, Maurizio M. Munafò, Anna Sperotto, R. Sadre, Aiko Pras

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

231 Citations (Scopus)
1854 Downloads (Pure)

Abstract

Personal cloud storage services are gaining popularity. With a rush of providers to enter the market and an increasing offer of cheap storage space, it is to be expected that cloud storage will soon generate a high amount of Internet traffic. Very little is known about the architecture and the performance of such systems, and the workload they have to face. This understanding is essential for designing efficient cloud storage systems and predicting their impact on the network. This paper presents a characterization of Dropbox, the leading solution in personal cloud storage in our datasets. By means of passive measurements, we analyze data from four vantage points in Europe, collected during 42 consecutive days. Our contributions are threefold: Firstly, we are the first to study Dropbox, which we show to be the most widely-used cloud storage system, already accounting for a volume equivalent to around one third of the YouTube traffic at campus networks on some days. Secondly, we characterize the workload users in different environments generate to the system, highlighting how this reflects on network traffic. Lastly, our results show possible performance bottlenecks caused by both the current system architecture and the storage protocol. This is exacerbated for users connected far from storage data-centers. All measurements used in our analyses are publicly available in anonymized form at the SimpleWeb trace repository: http://traces.simpleweb.org/dropbox/
Original languageUndefined
Title of host publicationProceedings of the 2012 ACM Conference on Internet measurement, IMC 2012
Place of PublicationNew York
PublisherAssociation for Computing Machinery (ACM)
Pages481-494
Number of pages14
ISBN (Print)978-1-4503-1705-4
DOIs
Publication statusPublished - Nov 2012

Publication series

Name
PublisherACM

Keywords

  • METIS-289701
  • Cloud Storage
  • EWI-22286
  • Internet Measurement
  • Dropbox
  • IR-81818

Cite this

Drago, I., Mellia, M., Munafò, M. M., Sperotto, A., Sadre, R., & Pras, A. (2012). Inside Dropbox: Understanding Personal Cloud Storage Services. In Proceedings of the 2012 ACM Conference on Internet measurement, IMC 2012 (pp. 481-494). New York: Association for Computing Machinery (ACM). https://doi.org/10.1145/2398776.2398827
Drago, Idilio ; Mellia, Marco ; Munafò, Maurizio M. ; Sperotto, Anna ; Sadre, R. ; Pras, Aiko. / Inside Dropbox: Understanding Personal Cloud Storage Services. Proceedings of the 2012 ACM Conference on Internet measurement, IMC 2012. New York : Association for Computing Machinery (ACM), 2012. pp. 481-494
@inproceedings{140b466635a54601b5d4364f6c0fda11,
title = "Inside Dropbox: Understanding Personal Cloud Storage Services",
abstract = "Personal cloud storage services are gaining popularity. With a rush of providers to enter the market and an increasing offer of cheap storage space, it is to be expected that cloud storage will soon generate a high amount of Internet traffic. Very little is known about the architecture and the performance of such systems, and the workload they have to face. This understanding is essential for designing efficient cloud storage systems and predicting their impact on the network. This paper presents a characterization of Dropbox, the leading solution in personal cloud storage in our datasets. By means of passive measurements, we analyze data from four vantage points in Europe, collected during 42 consecutive days. Our contributions are threefold: Firstly, we are the first to study Dropbox, which we show to be the most widely-used cloud storage system, already accounting for a volume equivalent to around one third of the YouTube traffic at campus networks on some days. Secondly, we characterize the workload users in different environments generate to the system, highlighting how this reflects on network traffic. Lastly, our results show possible performance bottlenecks caused by both the current system architecture and the storage protocol. This is exacerbated for users connected far from storage data-centers. All measurements used in our analyses are publicly available in anonymized form at the SimpleWeb trace repository: http://traces.simpleweb.org/dropbox/",
keywords = "METIS-289701, Cloud Storage, EWI-22286, Internet Measurement, Dropbox, IR-81818",
author = "Idilio Drago and Marco Mellia and Munaf{\`o}, {Maurizio M.} and Anna Sperotto and R. Sadre and Aiko Pras",
year = "2012",
month = "11",
doi = "10.1145/2398776.2398827",
language = "Undefined",
isbn = "978-1-4503-1705-4",
publisher = "Association for Computing Machinery (ACM)",
pages = "481--494",
booktitle = "Proceedings of the 2012 ACM Conference on Internet measurement, IMC 2012",
address = "United States",

}

Drago, I, Mellia, M, Munafò, MM, Sperotto, A, Sadre, R & Pras, A 2012, Inside Dropbox: Understanding Personal Cloud Storage Services. in Proceedings of the 2012 ACM Conference on Internet measurement, IMC 2012. Association for Computing Machinery (ACM), New York, pp. 481-494. https://doi.org/10.1145/2398776.2398827

Inside Dropbox: Understanding Personal Cloud Storage Services. / Drago, Idilio; Mellia, Marco; Munafò, Maurizio M.; Sperotto, Anna; Sadre, R.; Pras, Aiko.

Proceedings of the 2012 ACM Conference on Internet measurement, IMC 2012. New York : Association for Computing Machinery (ACM), 2012. p. 481-494.

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - Inside Dropbox: Understanding Personal Cloud Storage Services

AU - Drago, Idilio

AU - Mellia, Marco

AU - Munafò, Maurizio M.

AU - Sperotto, Anna

AU - Sadre, R.

AU - Pras, Aiko

PY - 2012/11

Y1 - 2012/11

N2 - Personal cloud storage services are gaining popularity. With a rush of providers to enter the market and an increasing offer of cheap storage space, it is to be expected that cloud storage will soon generate a high amount of Internet traffic. Very little is known about the architecture and the performance of such systems, and the workload they have to face. This understanding is essential for designing efficient cloud storage systems and predicting their impact on the network. This paper presents a characterization of Dropbox, the leading solution in personal cloud storage in our datasets. By means of passive measurements, we analyze data from four vantage points in Europe, collected during 42 consecutive days. Our contributions are threefold: Firstly, we are the first to study Dropbox, which we show to be the most widely-used cloud storage system, already accounting for a volume equivalent to around one third of the YouTube traffic at campus networks on some days. Secondly, we characterize the workload users in different environments generate to the system, highlighting how this reflects on network traffic. Lastly, our results show possible performance bottlenecks caused by both the current system architecture and the storage protocol. This is exacerbated for users connected far from storage data-centers. All measurements used in our analyses are publicly available in anonymized form at the SimpleWeb trace repository: http://traces.simpleweb.org/dropbox/

AB - Personal cloud storage services are gaining popularity. With a rush of providers to enter the market and an increasing offer of cheap storage space, it is to be expected that cloud storage will soon generate a high amount of Internet traffic. Very little is known about the architecture and the performance of such systems, and the workload they have to face. This understanding is essential for designing efficient cloud storage systems and predicting their impact on the network. This paper presents a characterization of Dropbox, the leading solution in personal cloud storage in our datasets. By means of passive measurements, we analyze data from four vantage points in Europe, collected during 42 consecutive days. Our contributions are threefold: Firstly, we are the first to study Dropbox, which we show to be the most widely-used cloud storage system, already accounting for a volume equivalent to around one third of the YouTube traffic at campus networks on some days. Secondly, we characterize the workload users in different environments generate to the system, highlighting how this reflects on network traffic. Lastly, our results show possible performance bottlenecks caused by both the current system architecture and the storage protocol. This is exacerbated for users connected far from storage data-centers. All measurements used in our analyses are publicly available in anonymized form at the SimpleWeb trace repository: http://traces.simpleweb.org/dropbox/

KW - METIS-289701

KW - Cloud Storage

KW - EWI-22286

KW - Internet Measurement

KW - Dropbox

KW - IR-81818

U2 - 10.1145/2398776.2398827

DO - 10.1145/2398776.2398827

M3 - Conference contribution

SN - 978-1-4503-1705-4

SP - 481

EP - 494

BT - Proceedings of the 2012 ACM Conference on Internet measurement, IMC 2012

PB - Association for Computing Machinery (ACM)

CY - New York

ER -

Drago I, Mellia M, Munafò MM, Sperotto A, Sadre R, Pras A. Inside Dropbox: Understanding Personal Cloud Storage Services. In Proceedings of the 2012 ACM Conference on Internet measurement, IMC 2012. New York: Association for Computing Machinery (ACM). 2012. p. 481-494 https://doi.org/10.1145/2398776.2398827