TY - JOUR
T1 - Genome scans for selection and introgression based on k-nearest neighbour techniques
AU - Pfeifer, Bastian
AU - Alachiotis, Nikolaos
AU - Pavlidis, Pavlos
AU - Schimek, Michael G.
N1 - Funding Information:
We thank Fernando Racimo and Ben Rosenzweig for their valuable comments on our manuscript.
Publisher Copyright:
© 2020 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd
PY - 2020/11
Y1 - 2020/11
N2 - In recent years, genome-scan methods have been extensively used to detect local signatures of selection and introgression. Most of these methods are either designed for one or the other case, which may impair the study of combined cases. Here, we introduce a series of versatile genome-scan methods applicable for both cases, the detection of selection and introgression. The proposed approaches are based on nonparametric k-nearest neighbour (kNN) techniques, while incorporating pairwise Fixation Index (FST) and pairwise nucleotide differences (dxy) as features. We benchmark our methods using a wide range of simulation scenarios, with varying parameters, such as recombination rates, population background histories, selection strengths, the proportion of introgression and the time of gene flow. We find that kNN-based methods perform remarkably well compared with the state-of-the-art. Finally, we demonstrate how to perform kNN-based genome scans on real-world genomic data using the population genomics R-package popgenome.
AB - In recent years, genome-scan methods have been extensively used to detect local signatures of selection and introgression. Most of these methods are either designed for one or the other case, which may impair the study of combined cases. Here, we introduce a series of versatile genome-scan methods applicable for both cases, the detection of selection and introgression. The proposed approaches are based on nonparametric k-nearest neighbour (kNN) techniques, while incorporating pairwise Fixation Index (FST) and pairwise nucleotide differences (dxy) as features. We benchmark our methods using a wide range of simulation scenarios, with varying parameters, such as recombination rates, population background histories, selection strengths, the proportion of introgression and the time of gene flow. We find that kNN-based methods perform remarkably well compared with the state-of-the-art. Finally, we demonstrate how to perform kNN-based genome scans on real-world genomic data using the population genomics R-package popgenome.
KW - Adaptation
KW - Genome scans
KW - Introgression
KW - k-nearest neighbours
KW - UT-Hybrid-D
U2 - 10.1111/1755-0998.13221
DO - 10.1111/1755-0998.13221
M3 - Article
SN - 1755-098X
VL - 20
SP - 1597
EP - 1609
JO - Molecular Ecology Resources
JF - Molecular Ecology Resources
IS - 6
ER -