2020 | David Lähnemann, Johannes Köster, Ewa Szczurek, Davis J. McCarthy, Stephanie C. Hicks, Mark D. Robinson, Catalina A. Vallejos, Kieran R. Campbell, Niko Beerenwinkel, Ahmed Mahfouz, Luca Pinello, Pavel Skums, Alexandros Stamatakis, Camille Stephan-Otto Attolini, Samuel Aparicio, Jasmijn Baaijens, Marlene Balvert, Buys de Barbanson, Antonio Cappuccio, Giacomo Corleone, Bas E. Dutilh, Maria Florescu, Victor Guryev, Rens Holmer, Katharina Jahn, Thamar Jessurun Lobo, Emma M. Keizer, Indu Khatri, Szymon M. Kielbasa, Jan O. Korbel, Alexey M. Kozlov, Tzu-Hao Kuo, Boudewijn P.F. Lelieveldt, Ion I. Mandoiu, John C. Marion, Tobias Marschall, Felix Mölder, Amir Niknejad, Alicja Rączkowska, Marcel Reinders, Jeroen de Ridder, Antoine-Emmanuel Saliba, Antonios Somarakis, Oliver Stegle, Fabian J. Theis, Huan Yang, Alex Zelikovsky, Alice C. McHardy, Benjamin J. Raphael, Sohrab P. Shah and Alexander Schönhuth
The recent surge in microfluidics and combinatorial indexing strategies, along with low sequencing costs, has enabled single-cell sequencing technology. This has led to a data revolution in single-cell biology, posing unique data science challenges. The authors outline eleven key challenges for advancing single-cell data science (SCDS). These challenges span transcriptomics, genomics, and phylogenomics, and include issues such as handling sparsity in scRNA-seq, quantifying measurement uncertainty, integrating data across samples and experiments, and scaling to higher dimensionalities. The paper emphasizes the need for computationally efficient and statistically sound methods to manage the vast amounts of data generated by single-cell sequencing. It also highlights the importance of benchmarking and validating analysis tools. The challenges are categorized into recurring themes and specific challenges, with a focus on the need for flexible statistical frameworks, reliable reference systems, and generalizing trajectory inference. The paper also discusses the importance of integrating multiple types of data and the need for methods that can handle the unique challenges of single-cell data, such as high sparsity and technical noise. The authors conclude that SCDS is entering a new era, requiring innovative solutions to address the complex data science problems arising from single-cell sequencing.The recent surge in microfluidics and combinatorial indexing strategies, along with low sequencing costs, has enabled single-cell sequencing technology. This has led to a data revolution in single-cell biology, posing unique data science challenges. The authors outline eleven key challenges for advancing single-cell data science (SCDS). These challenges span transcriptomics, genomics, and phylogenomics, and include issues such as handling sparsity in scRNA-seq, quantifying measurement uncertainty, integrating data across samples and experiments, and scaling to higher dimensionalities. The paper emphasizes the need for computationally efficient and statistically sound methods to manage the vast amounts of data generated by single-cell sequencing. It also highlights the importance of benchmarking and validating analysis tools. The challenges are categorized into recurring themes and specific challenges, with a focus on the need for flexible statistical frameworks, reliable reference systems, and generalizing trajectory inference. The paper also discusses the importance of integrating multiple types of data and the need for methods that can handle the unique challenges of single-cell data, such as high sparsity and technical noise. The authors conclude that SCDS is entering a new era, requiring innovative solutions to address the complex data science problems arising from single-cell sequencing.