### Abstract

Original language | English |
---|---|

Pages (from-to) | 47-69 |

Number of pages | 23 |

Journal | Internet mathematics |

Volume | 5 |

Issue number | 1-2 |

Publication status | Published - 2008 |

### Fingerprint

### Keywords

- EWI-17732
- IR-71099
- METIS-268957

### Cite this

*Internet mathematics*,

*5*(1-2), 47-69.

}

*Internet mathematics*, vol. 5, no. 1-2, pp. 47-69.

**A singular perturbation approach for choosing the PageRank damping factor.** / Avrachenkov, Konstatin; Litvak, Nelly; Pham, Kim Son.

Research output: Contribution to journal › Article › Academic › peer-review

TY - JOUR

T1 - A singular perturbation approach for choosing the PageRank damping factor

AU - Avrachenkov, Konstatin

AU - Litvak, Nelly

AU - Pham, Kim Son

PY - 2008

Y1 - 2008

N2 - We study the PageRank mass of principal components in a bow-tie web graph as a function of the damping factor $c$. It is known that the web graph can be divided into three principal components: SCC, IN, and OUT. The giant strongly connected component (SCC) contains a large group of pages having a hyperlink path connecting them. The pages in the IN (OUT) component have a path to (from) the SCC, but not back. Using a singular perturbation approach, we show that the PageRank share of the IN and SCC components remains high even for very large values of the damping factor, in spite of the fact that it drops to zero when $c$ tends to one. However, a detailed study of the OUT component reveals the presence of "dead ends" (small groups of pages linking only to each other) that receive an unfairly high ranking when $c$ is close to 1. We argue that this problem can be mitigated by choosing $c$ as small as ½.

AB - We study the PageRank mass of principal components in a bow-tie web graph as a function of the damping factor $c$. It is known that the web graph can be divided into three principal components: SCC, IN, and OUT. The giant strongly connected component (SCC) contains a large group of pages having a hyperlink path connecting them. The pages in the IN (OUT) component have a path to (from) the SCC, but not back. Using a singular perturbation approach, we show that the PageRank share of the IN and SCC components remains high even for very large values of the damping factor, in spite of the fact that it drops to zero when $c$ tends to one. However, a detailed study of the OUT component reveals the presence of "dead ends" (small groups of pages linking only to each other) that receive an unfairly high ranking when $c$ is close to 1. We argue that this problem can be mitigated by choosing $c$ as small as ½.

KW - EWI-17732

KW - IR-71099

KW - METIS-268957

M3 - Article

VL - 5

SP - 47

EP - 69

JO - Internet mathematics

JF - Internet mathematics

SN - 1542-7951

IS - 1-2

ER -