摘要:Bread wheat (Triticum aestivum L.) hexaploid genome is a result of several multiple hybridization events between three diploid progenitor species, comprising three individual subgenomes: A, B, and D. To track the events that have shaped the modern wheat genome, a phylogenetic-based in silico model was developed. The model is based on a comparative analysis between each of the three subgenomes and its progenitor: the subgenomes A and D were compared respectively to Triticum urartu and Aegilops tauschii, while for the still disputable B subgenome predecessor the model plant Brachypodium distachyon was used. The phylogenetic trees for this analysis were obtained from Ensembl Plants. All analyses were done with custom-developed Python scripts. The dataset shows almost equal gene distribution between the three subgenomes: A – 31%, B – 37,3%, and D – 31,7%. The evolutionary relationships were traced using a Species Overlap algorithm, and 54 807 groups of homologs involving wheat genes were detected, including 35 933 orthologous and 18 874 paralogous relationships. The homeologous relations between the three subgenomes were included within the paralogs. 77% of the orthologous groups involved Triticum urartu, 79% - Aegilops tauschii and 64% - Brachypodium distachyon. The paralogs between Triticum urartu and Triticum aestivum were 68%, between Aegilops tauschii and Triticum aestivum, – 71%, and for the Brachypodium distachyon – 42%. Comparative phylogenetic analysis allows identification of the closest homolog and probable predecessor for majority of the available wheat genes. The distribution of genes, orthologs and paralogs also implies an opportunity for subsequent functional prediction and other related analyses
关键词:hexaploid wheat; evolution; phylogeny; in silico comparative analysis