Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool

Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool

2021, 7(2), 1–9 | Aíne O'Toole, Emily Scher, Anthony Underwood, Ben Jackson, Verity Hill, John T. McCrone, Rachel Colquhoun, Chris Ruis, Khalil Abu-Dahab, Ben Taylor, Corin Yeats, Louis du Plessis, Daniel Maloney, Nathan Medd, Stephen W. Attwood, David M. Aanensen, Edward C. Holmes, Oliver G. Pybus, Andrew Rambaut
The article introduces the Pangolin tool, a computational method designed to assign the most likely Pango lineage to SARS-CoV-2 genome sequences. The tool addresses the challenge of managing and interpreting the vast amount of genomic data generated during the pandemic. Pangolin uses a machine learning model, pangoLEARN, trained on designated sequences and lineage designations, to assign lineages to new sequences. The tool is available as a web application and command-line tool, facilitating access to actionable information for researchers. The authors describe the development, performance, and limitations of Pangolin, highlighting its robustness to missing data, ambiguous sequences, and novel recombinant genomes. They also discuss future directions, including the potential for adapting the tool to other viral outbreaks and the need for regular updates to the training data. The Pangolin tool has been instrumental in facilitating genomic epidemiology and public health decision-making during the pandemic.The article introduces the Pangolin tool, a computational method designed to assign the most likely Pango lineage to SARS-CoV-2 genome sequences. The tool addresses the challenge of managing and interpreting the vast amount of genomic data generated during the pandemic. Pangolin uses a machine learning model, pangoLEARN, trained on designated sequences and lineage designations, to assign lineages to new sequences. The tool is available as a web application and command-line tool, facilitating access to actionable information for researchers. The authors describe the development, performance, and limitations of Pangolin, highlighting its robustness to missing data, ambiguous sequences, and novel recombinant genomes. They also discuss future directions, including the potential for adapting the tool to other viral outbreaks and the need for regular updates to the training data. The Pangolin tool has been instrumental in facilitating genomic epidemiology and public health decision-making during the pandemic.
Reach us at info@study.space