Abstract
We analyze dependencies in power law graph data (Web sample, Wikipedia sample and a preferential attachment graph) using statistical inference for multivariate regular variation. The well developed theory of regular variation is widely applied in extreme value theory, telecommunications and mathematical finance, and it provides a natural mathematical formalism for analyzing dependencies between variables with power laws. However, most of the proposed methods have never been used in the Web graph data mining. The present work fills this gap. The new insights this yields are striking: the three above-mentioned data sets are shown to have a totally different dependence structure between different graph parameters, such as in-degree and PageRank.
Original language | English |
---|---|
Title of host publication | WWW '08 |
Subtitle of host publication | Proceedings of the 17th International Conference on the World Wide Web 2008 |
Place of Publication | New York, NY |
Publisher | Association for Computing Machinery |
Pages | 1113-1114 |
Number of pages | 2 |
ISBN (Print) | 978-1-60558-085-2 |
DOIs | |
Publication status | Published - Apr 2008 |
Event | 17th International World Wide Web Conference, WWW 2008 - Beijing, China Duration: 21 Apr 2008 → 25 Apr 2008 Conference number: 17 |
Publication series
Name | |
---|---|
Publisher | ACM |
Number | WoTUG-31 |
Conference
Conference | 17th International World Wide Web Conference, WWW 2008 |
---|---|
Abbreviated title | WWW |
Country/Territory | China |
City | Beijing |
Period | 21/04/08 → 25/04/08 |
Keywords
- Regular variation
- Web
- Wikipedia
- Preferential attachment
- PageRank