2014-07-18 | The International Wheat Genome Sequencing Consortium (IWGSC), Mayer, Klaus FX, Rogers, Jane et al.
The International Wheat Genome Sequencing Consortium (IWGSC) produced a chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome. This sequence includes 124,201 gene loci distributed across the homeologous chromosomes and subgenomes. Comparative gene analysis showed that high sequence similarity and structural conservation are retained after polyploidization, with limited gene loss. However, dynamic gene gain, loss, and duplication have occurred since the divergence of wheat lineages. Subgenomes exhibit high transcriptional autonomy and no global dominance. These insights aid in faster gene isolation, genetic marker development, and precise breeding to meet global food demands.
The hexaploid bread wheat genome consists of three subgenomes (A, B, D), with each subgenome containing about 5.5 Gb of DNA. The genome is highly repetitive, making it challenging to generate a reference sequence. Early efforts focused on coding sequences, but recent advances in sequencing technologies have enabled the assembly of the genome. The genome contains 106,000 functional protein-coding genes, with 124,201 annotated gene loci distributed across the subgenomes. The A, B, and D subgenomes show varying levels of gene conservation and synteny with related species.
The genome contains a high proportion of transposable elements, with 81% of raw reads and 76.6% of assembled sequences containing repeats. The distribution of transposons across the subgenomes shows that class I elements (retroelements) are more abundant in the A genome, while class II elements (DNA transposons) are more abundant in the D genome. The B genome has a lower number of LTR retroelements, suggesting that transposon activity post-polyploidization has introduced more recent amplifications into the B genome.
The genome contains 270 different putative microRNA molecules, with 98,068 predicted miRNA-coding loci. These miRNAs may play a role in gene regulation and development. The genome also contains 976,962 loci with 1,265,548 distinct splicing variants. These genes show varying levels of expression and are distributed across the subgenomes. The A, B, and D subgenomes have similar numbers of genes, with the B subgenome containing the highest number of gene loci.
The genome has a high degree of synteny with related species, with 58% of genes on the D genome chromosomes showing higher synteny than those on the A and B chromosomes. The genome also shows variation in gene copy retention or loss patterns depending on the gene family considered. The D subgenome, the most recent addition to the hexaploid genome, shows slightly lower gene loss than the more ancient A and B subgenomes.
The genome hasThe International Wheat Genome Sequencing Consortium (IWGSC) produced a chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome. This sequence includes 124,201 gene loci distributed across the homeologous chromosomes and subgenomes. Comparative gene analysis showed that high sequence similarity and structural conservation are retained after polyploidization, with limited gene loss. However, dynamic gene gain, loss, and duplication have occurred since the divergence of wheat lineages. Subgenomes exhibit high transcriptional autonomy and no global dominance. These insights aid in faster gene isolation, genetic marker development, and precise breeding to meet global food demands.
The hexaploid bread wheat genome consists of three subgenomes (A, B, D), with each subgenome containing about 5.5 Gb of DNA. The genome is highly repetitive, making it challenging to generate a reference sequence. Early efforts focused on coding sequences, but recent advances in sequencing technologies have enabled the assembly of the genome. The genome contains 106,000 functional protein-coding genes, with 124,201 annotated gene loci distributed across the subgenomes. The A, B, and D subgenomes show varying levels of gene conservation and synteny with related species.
The genome contains a high proportion of transposable elements, with 81% of raw reads and 76.6% of assembled sequences containing repeats. The distribution of transposons across the subgenomes shows that class I elements (retroelements) are more abundant in the A genome, while class II elements (DNA transposons) are more abundant in the D genome. The B genome has a lower number of LTR retroelements, suggesting that transposon activity post-polyploidization has introduced more recent amplifications into the B genome.
The genome contains 270 different putative microRNA molecules, with 98,068 predicted miRNA-coding loci. These miRNAs may play a role in gene regulation and development. The genome also contains 976,962 loci with 1,265,548 distinct splicing variants. These genes show varying levels of expression and are distributed across the subgenomes. The A, B, and D subgenomes have similar numbers of genes, with the B subgenome containing the highest number of gene loci.
The genome has a high degree of synteny with related species, with 58% of genes on the D genome chromosomes showing higher synteny than those on the A and B chromosomes. The genome also shows variation in gene copy retention or loss patterns depending on the gene family considered. The D subgenome, the most recent addition to the hexaploid genome, shows slightly lower gene loss than the more ancient A and B subgenomes.
The genome has