Received September 22, 2004; Accepted October 5, 2004 | Aron Marchler-Bauer*, John B. Anderson, Praveen F. Cherukuri, Carol DeWeese-Scott, Lewis Y. Geer, Marc Gwadz, Siqian He, David I. Hurwitz, John D. Jackson, Zhaoxi Ke, Christopher J. Lanczycki, Cynthia A. Liebert, Chunlei Liu, Fu Lu, Gabriele H. Marchler, Mikhail Mullokandov, Benjamin A. Shoemaker, Vahan Simonyan, James S. Song, Paul A. Thiessen, Roxanne A. Yamashita, Jodie J. Yin, Dachuan Zhang and Stephen H. Bryant
The Conserved Domain Database (CDD) is a protein classification component of NCBI's Entrez query and retrieval system, linked to other databases such as Proteins, Taxonomy, and PubMed. CDD provides a fast, interactive tool called CD-Search to identify conserved domains in new protein sequences. The database is updated and curated to reduce redundancy and improve functionality. CDD contains models from Pfam, SMART, COG, and NCBI-curated models, with pre-calculated CD-Search results available for proteins in Entrez. Users can explore domain hierarchies, sequence trees, and taxonomic diversity, and access alignment data and analysis algorithms. Future developments include distributing software for visualizing CD family hierarchies and sequence trees.The Conserved Domain Database (CDD) is a protein classification component of NCBI's Entrez query and retrieval system, linked to other databases such as Proteins, Taxonomy, and PubMed. CDD provides a fast, interactive tool called CD-Search to identify conserved domains in new protein sequences. The database is updated and curated to reduce redundancy and improve functionality. CDD contains models from Pfam, SMART, COG, and NCBI-curated models, with pre-calculated CD-Search results available for proteins in Entrez. Users can explore domain hierarchies, sequence trees, and taxonomic diversity, and access alignment data and analysis algorithms. Future developments include distributing software for visualizing CD family hierarchies and sequence trees.