Hybracter: enabling scalable, automated, complete and accurate bacterial genome assemblies

Hybracter: enabling scalable, automated, complete and accurate bacterial genome assemblies

08 May 2024 | George Bouras¹,², Ghaus Houtak¹,², Ryan R. Wick³, Vijini Mallawaarachchi⁴, Michael J. Roach⁴,⁵, Bhavya Papudeshi⁴, Louise M. Judd³, Anna E. Sheppard⁶, Robert A. Edwards⁴ and Sarah Vreugde¹,²
Hybracter is a new automated tool for assembling complete bacterial genomes using long-read sequencing. It allows for fast, accurate, and scalable assembly using either long-reads alone or in combination with short-reads. Hybracter outperforms existing tools like Unicycler in accuracy and speed, especially for hybrid assemblies. It also excels in recovering small plasmids, which are often missed by long-read assemblers. Hybracter is designed to be efficient and scalable, supporting high-performance computing and cloud environments. It is freely available as open-source software on GitHub, Bioconda, and PyPI. The tool uses a comprehensive workflow that includes long-read assembly, polishing with short-reads, and plasmid recovery. Hybracter is particularly effective in recovering complete plasmid genomes and is compatible with both hybrid and long-read-only assembly approaches. It is also optimized for parallel processing, enabling the assembly of hundreds or thousands of genomes with a single command. Hybracter is available with detailed documentation and benchmarking data, making it a valuable tool for bacterial genome assembly. The study demonstrates that Hybracter provides accurate and efficient genome assemblies, especially for plasmid recovery, and is suitable for large-scale genomic studies.Hybracter is a new automated tool for assembling complete bacterial genomes using long-read sequencing. It allows for fast, accurate, and scalable assembly using either long-reads alone or in combination with short-reads. Hybracter outperforms existing tools like Unicycler in accuracy and speed, especially for hybrid assemblies. It also excels in recovering small plasmids, which are often missed by long-read assemblers. Hybracter is designed to be efficient and scalable, supporting high-performance computing and cloud environments. It is freely available as open-source software on GitHub, Bioconda, and PyPI. The tool uses a comprehensive workflow that includes long-read assembly, polishing with short-reads, and plasmid recovery. Hybracter is particularly effective in recovering complete plasmid genomes and is compatible with both hybrid and long-read-only assembly approaches. It is also optimized for parallel processing, enabling the assembly of hundreds or thousands of genomes with a single command. Hybracter is available with detailed documentation and benchmarking data, making it a valuable tool for bacterial genome assembly. The study demonstrates that Hybracter provides accurate and efficient genome assemblies, especially for plasmid recovery, and is suitable for large-scale genomic studies.
Reach us at info@study.space