2024 February 29 | David Porubsky, Evan E. Eichler
Over the past 25 years, advancements in genomic technology have significantly transformed the discovery of germline variants in human genetics. The field has evolved from microarrays to short-read sequencing (SRS) and now long-read sequencing (LRS), enabling comprehensive detection of all forms of genetic variation with a single assay. This transition promises to deepen our understanding of human health and biology, offering new insights into the dynamic mutational processes shaping our genomes.
Early technologies like chromosomal microarrays (CMAs) and short-read sequencing (SRS) enabled the detection of copy number variations (CNVs) and single-nucleotide variants (SNVs), respectively. SRS revolutionized variant discovery by providing precision at the base-pair level, while LRS offers longer reads, native DNA sequencing, and direct detection of modified base pairs, allowing for more comprehensive genetic variation analysis.
The integration of LRS with SRS data has led to the creation of the first telomere-to-telomere (T2T) human genome, providing access to previously uncharted regions of the genome, including centromeres and segmental duplications. This has improved the accuracy of variant detection and enabled the identification of complex structural variants (SVs), including inversions and duplications.
LRS has also facilitated the development of phased genome assemblies, allowing for the resolution of both maternal and paternal haplotypes. This has enhanced the understanding of genetic variation and its impact on disease, including the identification of recurrent inversions and interlocus gene conversion (IGC) events.
The future of human genetics lies in the development of a single assay capable of characterizing all forms of genetic variation. This would involve the use of pangenome references and advanced graph-based assemblies to improve genotyping accuracy and enable the discovery of novel variants. While challenges remain in terms of cost, computational complexity, and clinical adoption, the potential benefits for personalized medicine and genetic counseling are significant.
The ongoing advancements in LRS technology, coupled with the development of pangenome references, are paving the way for a more comprehensive understanding of human genetic variation and its implications for health and disease.Over the past 25 years, advancements in genomic technology have significantly transformed the discovery of germline variants in human genetics. The field has evolved from microarrays to short-read sequencing (SRS) and now long-read sequencing (LRS), enabling comprehensive detection of all forms of genetic variation with a single assay. This transition promises to deepen our understanding of human health and biology, offering new insights into the dynamic mutational processes shaping our genomes.
Early technologies like chromosomal microarrays (CMAs) and short-read sequencing (SRS) enabled the detection of copy number variations (CNVs) and single-nucleotide variants (SNVs), respectively. SRS revolutionized variant discovery by providing precision at the base-pair level, while LRS offers longer reads, native DNA sequencing, and direct detection of modified base pairs, allowing for more comprehensive genetic variation analysis.
The integration of LRS with SRS data has led to the creation of the first telomere-to-telomere (T2T) human genome, providing access to previously uncharted regions of the genome, including centromeres and segmental duplications. This has improved the accuracy of variant detection and enabled the identification of complex structural variants (SVs), including inversions and duplications.
LRS has also facilitated the development of phased genome assemblies, allowing for the resolution of both maternal and paternal haplotypes. This has enhanced the understanding of genetic variation and its impact on disease, including the identification of recurrent inversions and interlocus gene conversion (IGC) events.
The future of human genetics lies in the development of a single assay capable of characterizing all forms of genetic variation. This would involve the use of pangenome references and advanced graph-based assemblies to improve genotyping accuracy and enable the discovery of novel variants. While challenges remain in terms of cost, computational complexity, and clinical adoption, the potential benefits for personalized medicine and genetic counseling are significant.
The ongoing advancements in LRS technology, coupled with the development of pangenome references, are paving the way for a more comprehensive understanding of human genetic variation and its implications for health and disease.