The Ensembl genome database project

The Ensembl genome database project

2002, Vol. 30, No. 1 | T. Hubbard, D. Barker, E. Birney1,* G. Cameron1, Y. Chen1, L. Clark, T. Cox, J. Cuff, V. Curwen, T. Down, R. Durbin, E. Eyras, J. Gilbert, M. Hammond1, L. Humminiecki1, A. Kasprzyk1, H. Lehtovaslaioh1, P. Lijnzaad1, C. Melsopp1, E. Mongin1, R. Pettett, M. Pocock, S. Potter, A. Rust1, E. Schmidt1, S. Searle, G. Slater1, J. Smith, W. Spooner, A. Stabenau1, J. Stalker, E. Stupka1, A. Ureta-Vidal1, I. Vastrik1 and M. Clamp
The Ensembl genome database project, a collaboration between the Wellcome Trust Sanger Institute and the European Bioinformatics Institute (EBI), provides a comprehensive and stable source of automatic annotation for the human genome sequence. It integrates confirmed gene predictions with external data sources and is available as an interactive web site or flat files. Ensembl is also an open-source software project designed to handle large genomes and associated data from sequence analysis to storage and visualization. The project aims to be portable and easy to apply to different organisms and data types, following the spirit of open-source community projects like Linux. Ensembl's gene annotation process includes ab initio predictions, homology, and gene prediction HMMs, with a focus on minimizing false positives. The Ensembl web site offers various views and tools for data analysis, including BLAST and SSAHA search capabilities. The software system is built on a relational database and reusable components, with a core feature being the 'Virtual Contig' object for seamless handling of draft genome data. Ensembl also supports the Distributed Annotation System (DAS) standard, enabling users to view and compare annotations from multiple sources. The project is funded by the Wellcome Trust and EMBL, and it collaborates with other genome communities to annotate additional genomes.The Ensembl genome database project, a collaboration between the Wellcome Trust Sanger Institute and the European Bioinformatics Institute (EBI), provides a comprehensive and stable source of automatic annotation for the human genome sequence. It integrates confirmed gene predictions with external data sources and is available as an interactive web site or flat files. Ensembl is also an open-source software project designed to handle large genomes and associated data from sequence analysis to storage and visualization. The project aims to be portable and easy to apply to different organisms and data types, following the spirit of open-source community projects like Linux. Ensembl's gene annotation process includes ab initio predictions, homology, and gene prediction HMMs, with a focus on minimizing false positives. The Ensembl web site offers various views and tools for data analysis, including BLAST and SSAHA search capabilities. The software system is built on a relational database and reusable components, with a core feature being the 'Virtual Contig' object for seamless handling of draft genome data. Ensembl also supports the Distributed Annotation System (DAS) standard, enabling users to view and compare annotations from multiple sources. The project is funded by the Wellcome Trust and EMBL, and it collaborates with other genome communities to annotate additional genomes.
Reach us at info@study.space
[slides and audio] The Ensembl genome database project