Efficient computation of linkage disequilibria as dense linear algebra operations

Nikolaos Alachiotis, Thom Popovici, Tze Meng Low

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

9 Citations (Scopus)

Abstract

Genomic datasets are steadily growing in size asmore genomes are sequenced and new genetic variants arediscovered. Datasets that comprise thousands of genomes andmillions of single-nucleotide polymorphisms (SNPs), exhibitexcessive computational demands that can lead to prohibitivelylong analyses, yielding the deployment of high-performancecomputational approaches a prerequisite for the thoroughanalysis of current and future large-scale datasets. In this work, we demonstrate that the computational kernel for calculatinglinkage disequilibria (LD) in genomes, i.e., the non-randomassociations between alleles at different loci, can be cast interms of dense linear algebra (DLA) operations, leveraging thecollective knowledge in the DLA community in developing high-performance implementations for various microprocessor ar-chitectures. The proposed approach for computing LD achievesbetween 84% and 95% of the theoretical peak performance ofthe machine, and is up to 17X faster than existing LD kernelimplementations. Furthermore, we argue that, the currenttrend of increasing the SIMD (Single Instruction MultipleData) register width in microprocessors yields minor benefitsfor assessing LD, resulting in an increasing gap betweenperformance attainable by LD computations and the theoreticalpeak of the microprocessor architecture, suggesting the needfor hardware support.

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE 30th International Parallel and Distributed Processing Symposium, IPDPS 2016
Place of PublicationPiscataway, NJ
PublisherIEEE
Pages418-427
Number of pages10
ISBN (Electronic)978-1-5090-3682-0
ISBN (Print)978-1-5090-3683-7
DOIs
Publication statusPublished - 18 Jul 2016
Externally publishedYes
Event30th IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2016 - Chicago, United States
Duration: 23 May 201627 May 2016
Conference number: 30

Conference

Conference30th IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2016
Abbreviated titleIPDPSW 2016
Country/TerritoryUnited States
CityChicago
Period23/05/1627/05/16

Keywords

  • Dense linear algebra
  • Linkage disequilibrium
  • Matrix multiplication
  • Population genetics

Fingerprint

Dive into the research topics of 'Efficient computation of linkage disequilibria as dense linear algebra operations'. Together they form a unique fingerprint.

Cite this