2020 | David Lähnemann, Johannes Köster, Ewa Szczurek, Davis J. McCarthy, Stephanie C. Hicks, Mark D. Robinson, Catalina A. Vallejos, Kieran R. Campbell, Niko Beerenwinkel, Ahmed Mahfouz, Luca Pinello, Pavel Skums, Alexandros Stamatakis, Camille Stephan-Otto Attolini, Samuel Aparicio, Jasmijn Baaijens, Marleen Balvert, Buys de Barbanson, Antonio Cappuccio, Giacomo Corleone, Bas E. Dutilh, Maria Florescu, Victor Guryev, Rens Holmer, Katharina Jahn, Thamar Jessurun Lobo, Emma M. Keizer, Indu Khatri, Szymon M. Kielbasa, Jan O. Korbel, Alexey M. Kozlov, Tzu-Hao Kuo, Boudewijn P.F. Lelieveldt, Ion I. Mandouiu, John C. Marioni, Tobias Marschall, Felix Mölder, Amir Niknejad, Alicja Rączkowska, Marcel Reinders, Jeroen de Ridder, Antoine-Emmanuel Saliba, Antonios Somarakis, Oliver Stegle, Fabian J. Theis, Huan Yang, Alex Zelikovsky, Alice C. McHardy, Benjamin J. Raphael, Sohrab P. Shah, Alexander Schönthuth
The article outlines eleven grand challenges in single-cell data science, highlighting the unique data science problems arising from the recent advancements in microfluidics and low sequencing costs. These challenges are categorized into transcriptomics, genomics, and phylogenomics, with each challenge addressing specific issues such as handling sparsity in single-cell RNA sequencing, defining flexible statistical frameworks for complex differential patterns, mapping single cells to reference atlases, and generalizing trajectory inference. The authors emphasize the need for scalable and statistically sound methods to handle the vast amounts of data generated, quantify measurement uncertainties, and integrate data from different types of measurements. They also discuss the importance of benchmarking and validating analysis tools to ensure the reliability and accuracy of single-cell data analysis. The challenges are designed to guide future research and development in single-cell data science, making it accessible to researchers from various fields.The article outlines eleven grand challenges in single-cell data science, highlighting the unique data science problems arising from the recent advancements in microfluidics and low sequencing costs. These challenges are categorized into transcriptomics, genomics, and phylogenomics, with each challenge addressing specific issues such as handling sparsity in single-cell RNA sequencing, defining flexible statistical frameworks for complex differential patterns, mapping single cells to reference atlases, and generalizing trajectory inference. The authors emphasize the need for scalable and statistically sound methods to handle the vast amounts of data generated, quantify measurement uncertainties, and integrate data from different types of measurements. They also discuss the importance of benchmarking and validating analysis tools to ensure the reliability and accuracy of single-cell data analysis. The challenges are designed to guide future research and development in single-cell data science, making it accessible to researchers from various fields.