Wellcome Sanger Institute
Sanger Institute Science Collaboration

Quantum Pangenomics

Genomics is currently driving a huge improvement in our understanding of Human development and disease. Pangenomics is the next step on this journey, as understanding the variation in many genomes is key to unravelling how genetic traits can affect health outcomes. Unfortunately, building and analysing a pangenome is computationally intensive. Quantum computing may offer ways of overcoming this complexity, allowing more accurate pangenome representations to be constructed. This will enable more precise description of genetic variation, leading to improved diagnostics and treatments for a diverse range of medical conditions.

Genome sequencing is vital for applications used in monitoring disease outbreaks and personalised medicine. The structure of many challenging problems in computational genomics and pangenomics in particular makes them suitable candidates for speedups promised by quantum computing. The resulting advances have the potential to unlock transformative health benefits that depend on large-scale genomic analysis.

In Phase 1 of the project, our team adapted complex problems like genome assembly and construction of phylogenetic trees into a hybrid quantum-classical framework, enabling promising quantum speedups with emerging technology. Our key innovations include scalable quantum data encoding algorithms, setting the stage for storing and manipulating significant amounts of genomic data, and faster algorithms for genome assembly and phylogenetic tree inference. 

As we move into Phase 2, we look forward to simulating our new approaches with HPC using machine-learning-oriented encoding schemes and tensor network methods. Alongside this work, we will test the ability of our algorithms to resolve parts of the genome graph that are intractable classically. We will gain further insight into performance at scale, enabling us to move forward in Phase 3 to implementation on real quantum hardware. In collaboration with quantum hardware vendors, we will ensure that our proposed implementations account for hardware-specific architecture and noise properties.

 

Sanger people

Photo of Dave Holland

Dave Holland

Principal Systems Administrator

Photo of Dr Peter Clapham

Dr Peter Clapham

ISG Team Leader

Photo of Dr James McCafferty

Dr James McCafferty

Chief Information Officer

Photo of Robert Davies

Robert Davies

Senior Scientific Manager

Photo of James Bonfield

James Bonfield

Principal Software Developer

Photo of Andrew Whitwham

Andrew Whitwham

Senior Software Developer

External Contributors

Photo of Dr Sergii Strelchuk

Dr Sergii Strelchuk

Principal Investigator

Department of Applied Mathematics and Theoretical Physics at the University of Cambridge

Photo of Professor Richard Durbin, FRS

Professor Richard Durbin, FRS

Principal Investigator

Department of Genetics at the University of Cambridge

Photo of Dr David Yuan

Dr David Yuan

Project Lead at EMBL-EBI

Archive Infrastructure and Technology

Photo of Joshua Cudby

Joshua Cudby

Student

Department of Applied Mathematics and Theoretical Physics at the University of Cambridge

Photo of Orson Ye

Orson Ye

Student

Department of Applied Mathematics and Theoretical Physics at the University of Cambridge

External partners and funders

External

Wellcome Leap Quantum for Bio (Q4Bio) Supported Challenge Program.

The ultimate goal is to demonstrate the potential of quantum computing in solving critical health challenges

External

Faculty of Mathematics

University of Cambridge

External

EMBL-EBI

External

Department of Mathematics

Kyiv Academic University

 
See full index

Publications

Loading publications...