Predicting the long-term citation impact of recent publications

Clara Stegehuis, Nelli Litvak, Ludo Waltman

Research output: Contribution to journalArticleAcademicpeer-review

38 Downloads (Pure)

Abstract

A fundamental problem in citation analysis is the prediction of the long-term citation impact of recent publications. We propose a model to predict a probability distribution for the future number of citations of a publication. Two predictors are used: the impact factor of the journal in which a publication has appeared and the number of citations a publication has received one year after its appearance. The proposed model is based on quantile regression. We employ the model to predict the future number of citations of a large set of publications in the field of physics. Our analysis shows that both predictors (i.e., impact factor and early citations) contribute to the accurate prediction of long-term citation impact. We also analytically study the behavior of the quantile regression coefficients for high quantiles of the distribution of citations. This is done by linking the quantile regression approach to a quantile estimation technique from extreme value theory. Our work provides insight into the influence of the impact factor and early citations on the long-term citation impact of a publication, and it takes a step toward a methodology that can be used to assess research institutions based on their most recently published work.
Original languageUndefined
Pages (from-to)642-657
Number of pages16
JournalJournal of informetrics
Volume9
Issue number3
DOIs
Publication statusPublished - Jul 2015

Keywords

  • EWI-26223
  • Impact factor
  • Quantile estimation
  • Quantile regression
  • Citation impact
  • METIS-312697
  • IR-96849
  • Citation analysis
  • Prediction

Cite this

Stegehuis, Clara ; Litvak, Nelli ; Waltman, Ludo. / Predicting the long-term citation impact of recent publications. In: Journal of informetrics. 2015 ; Vol. 9, No. 3. pp. 642-657.
@article{a72f9a28933e4eb4be558b7ab9243ab4,
title = "Predicting the long-term citation impact of recent publications",
abstract = "A fundamental problem in citation analysis is the prediction of the long-term citation impact of recent publications. We propose a model to predict a probability distribution for the future number of citations of a publication. Two predictors are used: the impact factor of the journal in which a publication has appeared and the number of citations a publication has received one year after its appearance. The proposed model is based on quantile regression. We employ the model to predict the future number of citations of a large set of publications in the field of physics. Our analysis shows that both predictors (i.e., impact factor and early citations) contribute to the accurate prediction of long-term citation impact. We also analytically study the behavior of the quantile regression coefficients for high quantiles of the distribution of citations. This is done by linking the quantile regression approach to a quantile estimation technique from extreme value theory. Our work provides insight into the influence of the impact factor and early citations on the long-term citation impact of a publication, and it takes a step toward a methodology that can be used to assess research institutions based on their most recently published work.",
keywords = "EWI-26223, Impact factor, Quantile estimation, Quantile regression, Citation impact, METIS-312697, IR-96849, Citation analysis, Prediction",
author = "Clara Stegehuis and Nelli Litvak and Ludo Waltman",
note = "eemcs-eprint-26223 ; http://eprints.ewi.utwente.nl/26223",
year = "2015",
month = "7",
doi = "10.1016/j.joi.2015.06.005",
language = "Undefined",
volume = "9",
pages = "642--657",
journal = "Journal of informetrics",
issn = "1751-1577",
publisher = "Elsevier",
number = "3",

}

Predicting the long-term citation impact of recent publications. / Stegehuis, Clara; Litvak, Nelli; Waltman, Ludo.

In: Journal of informetrics, Vol. 9, No. 3, 07.2015, p. 642-657.

Research output: Contribution to journalArticleAcademicpeer-review

TY - JOUR

T1 - Predicting the long-term citation impact of recent publications

AU - Stegehuis, Clara

AU - Litvak, Nelli

AU - Waltman, Ludo

N1 - eemcs-eprint-26223 ; http://eprints.ewi.utwente.nl/26223

PY - 2015/7

Y1 - 2015/7

N2 - A fundamental problem in citation analysis is the prediction of the long-term citation impact of recent publications. We propose a model to predict a probability distribution for the future number of citations of a publication. Two predictors are used: the impact factor of the journal in which a publication has appeared and the number of citations a publication has received one year after its appearance. The proposed model is based on quantile regression. We employ the model to predict the future number of citations of a large set of publications in the field of physics. Our analysis shows that both predictors (i.e., impact factor and early citations) contribute to the accurate prediction of long-term citation impact. We also analytically study the behavior of the quantile regression coefficients for high quantiles of the distribution of citations. This is done by linking the quantile regression approach to a quantile estimation technique from extreme value theory. Our work provides insight into the influence of the impact factor and early citations on the long-term citation impact of a publication, and it takes a step toward a methodology that can be used to assess research institutions based on their most recently published work.

AB - A fundamental problem in citation analysis is the prediction of the long-term citation impact of recent publications. We propose a model to predict a probability distribution for the future number of citations of a publication. Two predictors are used: the impact factor of the journal in which a publication has appeared and the number of citations a publication has received one year after its appearance. The proposed model is based on quantile regression. We employ the model to predict the future number of citations of a large set of publications in the field of physics. Our analysis shows that both predictors (i.e., impact factor and early citations) contribute to the accurate prediction of long-term citation impact. We also analytically study the behavior of the quantile regression coefficients for high quantiles of the distribution of citations. This is done by linking the quantile regression approach to a quantile estimation technique from extreme value theory. Our work provides insight into the influence of the impact factor and early citations on the long-term citation impact of a publication, and it takes a step toward a methodology that can be used to assess research institutions based on their most recently published work.

KW - EWI-26223

KW - Impact factor

KW - Quantile estimation

KW - Quantile regression

KW - Citation impact

KW - METIS-312697

KW - IR-96849

KW - Citation analysis

KW - Prediction

U2 - 10.1016/j.joi.2015.06.005

DO - 10.1016/j.joi.2015.06.005

M3 - Article

VL - 9

SP - 642

EP - 657

JO - Journal of informetrics

JF - Journal of informetrics

SN - 1751-1577

IS - 3

ER -