SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler

SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler

2012 | Ruibang Luo, Binghang Liu, Yinlong Xie, Zhenyu Li, Weihua Huang, Jianying Yuan, Guangzhu He, Yanxiang Chen, Qi Pan, Yunjie Liu, Jingbo Tang, Gengxiong Wu, Hao Zhang, Yujian Shi, Yong Liu, Chang Yu, Bo Wang, Yao Lu, Changlei Han, David W Cheung, Siu-Ming Yiu, Shaoliang Peng, Zhu Xiaoqian, Guangming Liu, Xiangke Liao, Yingrui Li, Huanming Yang, Jian Wang, Tak-Wah Lam, Jun Wang
The paper introduces SOAPdenovo2, an improved version of the de novo genome assembler SOAPdenovo. The authors address several challenges in short-read de novo assembly, including continuity, accuracy, and coverage, particularly in repeat regions. Key improvements in SOAPdenovo2 include enhanced error correction, reduced memory consumption during graph construction, better resolution of repeat regions, increased assembly length and coverage in scaffolding, and improved gap closure. Benchmarking using the Assemblathon1 and GAGE datasets shows that SOAPdenovo2 outperforms its predecessor and other assemblers in terms of assembly length and accuracy. The updated assembly of the 2008 Asian (YH) genome using SOAPdenovo2 results in significantly longer contigs and scaffolds, higher genome coverage, and lower memory consumption. The paper also provides an updated assembly of the YH genome, which is 3-fold longer in contig length and 50-fold longer in scaffold length compared to the first published version.The paper introduces SOAPdenovo2, an improved version of the de novo genome assembler SOAPdenovo. The authors address several challenges in short-read de novo assembly, including continuity, accuracy, and coverage, particularly in repeat regions. Key improvements in SOAPdenovo2 include enhanced error correction, reduced memory consumption during graph construction, better resolution of repeat regions, increased assembly length and coverage in scaffolding, and improved gap closure. Benchmarking using the Assemblathon1 and GAGE datasets shows that SOAPdenovo2 outperforms its predecessor and other assemblers in terms of assembly length and accuracy. The updated assembly of the 2008 Asian (YH) genome using SOAPdenovo2 results in significantly longer contigs and scaffolds, higher genome coverage, and lower memory consumption. The paper also provides an updated assembly of the YH genome, which is 3-fold longer in contig length and 50-fold longer in scaffold length compared to the first published version.
Reach us at info@study.space