A deep learning approach to prediction of blood group antigens from genomic data

Camous Moslemi*, Susanne Sækmose, Rune Larsen, Thorsten Brodersen, Jakob T. Bay, Maria Didriksen, Kaspar R. Nielsen, Mie T. Bruun, Joseph Dowsett, Khoa M. Dinh, Christina Mikkelsen, Kati Hyvärinen, Jarmo Ritari, Jukka Partanen, Henrik Ullum, Christian Erikstrup, Sisse R. Ostrowski, Martin L. Olsson, Ole B. Pedersen

*Corresponding author

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Abstract

Background: Deep learning methods are revolutionizing natural science. In this study, we aim to apply such techniques to develop blood type prediction models based on cheap to analyze and easily scalable screening array genotyping platforms. Methods: Combining existing blood types from blood banks and imputed screening array genotypes for ~111,000 Danish and 1168 Finnish blood donors, we used deep learning techniques to train and validate blood type prediction models for 36 antigens in 15 blood group systems. To account for missing genotypes a denoising autoencoder initial step was utilized, followed by a convolutional neural network blood type classifier. Results: Two thirds of the trained blood type prediction models demonstrated an F1-accuracy above 99%. Models for antigens with low or high frequencies like, for example, Cw, low training cohorts like, for example, Cob, or very complicated genetic underpinning like, for example, RhD, proved to be more challenging for high accuracy (>99%) DL modeling. However, in the Danish cohort only 4 out of 36 models (Cob, Cw, D-weak, Kpa) failed to achieve a prediction F1-accuracy above 97%. This high predictive performance was replicated in the Finnish cohort. Discussion: High accuracy in a variety of blood groups proves viability of deep learning-based blood type prediction using array chip genotypes, even in blood groups with nontrivial genetic underpinnings. These techniques are suitable for aiding in identifying blood donors with rare blood types by greatly narrowing down the potential pool of candidate donors before clinical grade confirmation.
OriginalsprogEngelsk
TidsskriftTransfusion
Vol/bind64
Udgave nummer11
Sider (fra-til)2179-2195
Antal sider17
ISSN0041-1132
DOI
StatusUdgivet - nov. 2024

Emneord

  • AI
  • Blood antigen
  • Blood types
  • Convolutional neural network
  • Deep learning
  • Denoising autoencoder
  • Genetic prediction
  • Illumina GSA

Citer dette