Creating a universal SNP and small indel variant caller with deep neural networks

Creating a universal SNP and small indel variant caller with deep neural networks

March 20, 2018 | Ryan Poplin12, Pi-Chuan Chang2, David Alexander2, Scott Schwartz2, Thomas Colthurst2, Alexander Ku2, Dan Newburger1, Jojo Dijamco1, Nam Nguyen1, Pegah T. Afshar1, Sam S. Gross1, Lizzie Dorfman12, Cory Y. McLean12, Mark A. DePristo*1,2
The paper introduces DeepVariant, a deep learning-based approach for calling genetic variants from next-generation sequencing (NGS) data. Traditional variant callers, such as GATK, rely on hand-crafted statistical models that can produce thousands of errors and missed variants. DeepVariant uses a deep convolutional neural network (CNN) to learn statistical relationships between images of read pileups around putative variant sites and ground-truth genotype calls. This approach outperforms existing tools, winning the "highest performance" award for SNPs in a FDA-administered variant calling challenge. DeepVariant generalizes across genome builds and even to other mammalian species, allowing non-human sequencing projects to benefit from human ground truth data. The model can also learn to call variants in various sequencing technologies and experimental designs, demonstrating its robustness and versatility. The paper provides detailed descriptions of the DeepVariant workflow, including candidate variant identification, image creation, and deep learning model training and inference.The paper introduces DeepVariant, a deep learning-based approach for calling genetic variants from next-generation sequencing (NGS) data. Traditional variant callers, such as GATK, rely on hand-crafted statistical models that can produce thousands of errors and missed variants. DeepVariant uses a deep convolutional neural network (CNN) to learn statistical relationships between images of read pileups around putative variant sites and ground-truth genotype calls. This approach outperforms existing tools, winning the "highest performance" award for SNPs in a FDA-administered variant calling challenge. DeepVariant generalizes across genome builds and even to other mammalian species, allowing non-human sequencing projects to benefit from human ground truth data. The model can also learn to call variants in various sequencing technologies and experimental designs, demonstrating its robustness and versatility. The paper provides detailed descriptions of the DeepVariant workflow, including candidate variant identification, image creation, and deep learning model training and inference.
Reach us at info@study.space
Understanding A universal SNP and small-indel variant caller using deep neural networks