Accelerated inference of positive selection on whole genomes

Nikolaos Alachiotis, Charalampos Vatsolakis, Grigorios Chrysos, Dionisios Pnevmatikatos

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

2 Citations (Scopus)

Abstract

Positive selection is the tendency of beneficial traits to increase in prevalence in a population. Its detection carries theoretical significance and has practical applications, from shedding light on the forces that drive adaptive evolution to identifying drug-resistant mutations in pathogens. With next-generation sequencing producing a plethora of genomic data for population genetic analyses, the increased computational complexity of existing methods and/or inefficient memory management hinders the efficient analysis of large-scale datasets. To this end, we devise a system-level solution that couples a generic out-of-core algorithm for parsing genomic data with a decoupled access/execute accelerator architecture, thereby providing a method-independent infrastructure for the rapid and scalable inference of positive selection. We employ a novel detection mechanism that mostly relies on integer arithmetic operations, which fit well to FPGA fabric, while yielding qualitatively superior results than current state-of-the-art methods. We deploy a high-end system that pairs Hybrid Memory Cube with a mid-range FPGA, forming a high-throughput streaming accelerator that achieves 751x, 62x, and 20x faster analyses of simulated genomes than the widely used software tools SweepFinder2 (1 thread), OmegaPlus (40 threads), and SweeD (40 threads), respectively. Importantly, our solution can scan thousands of human genomes and millions of genetic polymorphisms (1000 Genomes dataset, 5,008 samples) in a matter of hours, requiring between 4 and 22 minutes per autosome, depending on the chromosomal length.

Original languageEnglish
Title of host publicationProceedings - 2018 International Conference on Field-Programmable Logic and Applications, FPL 2018
Place of PublicationPiscataway, NJ
PublisherIEEE
Pages202-209
Number of pages8
ISBN (Electronic)978-1-5386-8517-4
ISBN (Print)978-1-5386-8518-1
DOIs
Publication statusPublished - 9 Nov 2018
Externally publishedYes
Event28th International Conference on Field-Programmable Logic and Applications, FPL 2018 - Dublin, Ireland
Duration: 26 Aug 201830 Aug 2018
Conference number: 28

Publication series

NameProceedings - International Conference on Field-Programmable Logic and Applications (FPL)
PublisherIEEE
Volume2018
ISSN (Print)1946-147X
ISSN (Electronic)1946-1488

Conference

Conference28th International Conference on Field-Programmable Logic and Applications, FPL 2018
Abbreviated titleFPL 2018
Country/TerritoryIreland
CityDublin
Period26/08/1830/08/18

Keywords

  • Decoupled access/execute architecture
  • Hardware accelerator
  • Population genomics
  • Selective sweep detection

Fingerprint

Dive into the research topics of 'Accelerated inference of positive selection on whole genomes'. Together they form a unique fingerprint.

Cite this