Entary Figures S1 and S2). Most duplicated genes also showed similar expression pattern in leaf except GrKMT1A;4b/4c/4d (Supplementary Figures S1 and S2), suggesting that some duplicated genes undergone functional differentiation but others not.MethodsSequences of SET domain-containing proteins from Arabidopsis thaliana were retrieved from the official website (https://www.arabidopsis.org/Blast/index.jsp). The sequences of SET domain of these sequences were used as queries to search G. raimondii homologs (http://www.phytozome.net, version 10.3) using the BLASTp. The sequence of SET domain-containing proteins of rice was extracted from Huang et al.9 and web http://www.phytozome.net (version 10.3). All the sequences were re-confirmed in SMART database (http://smart.embl-heidelberg. de/). The gene loci information of G. raimondii was used to generate the chromosome maps by the Mapchart 2.2 program55. When candidate genes was found to be both > 70 coverage of shorter full-length-CDS sequence and >70 identical in the sequence of their encoding amino acids, they were regarded as duplicated genes21. When the duplicated genes were located within 100 kb and were separated by ten or fewer non-homologues, they were defined as tandem duplicated genes22. The coverage of full-length-CDS sequence and the get XAV-939 similarity of amino acid sequences were detected by Blastn/Blastp in NCBI.Identification of SET domain-containing proteins and construction of chromosome map.Analysis of gene structure, domain organization and phylogenetic tree. The gene structure was reconstructed using Gene Structure Display Server (http://gsds.cbi.pku.edu.cn/). Domain organization was confirmed by SMART and NCBI (http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi), and the low-complexity filter was turned off, and the Expect Value was set at 10. Then the site information of domains was subjected to Dog2.0 to construct the proteins organization sketch map56. Multiple sequence alignments of SET domains were carried out by the Clustal W program57 and the resultant file was subjected to phylogenic analysis using the MEGA 6.0 program58. Based on the full-length protein sequences, the phylogenetic trees were constructed using Neighbor-Joining methods with Partial deletion and p-distance Method, Bootstrap test of 1000 replicates for internal branch reliability. Plant material and high temperature treatment.G. raimondii seedlings were grown in greenhouse at 28 under a 10 h day/14 h night cycle. 5-week-old seedlings with 5? true leaves were placed in a growth chamber at high temperature condition (38 ; 28 as a mock) for 12, 24, and 48 h. The leaves were harvested at the appropriate time points as indicated (triplicate samples were collected at each time point) for detecting genes expression in response to HT. The roots, stems and leaves were collected from plants at the stage of 5? true leaves and the petals, anther and ovary were sampled on the day of flowering for gene expression analysis of tissue/ organ. The materials were quick frozen in liquid nitrogen and stored at -70 for further analysis.RNA extraction and real-time CI-1011 manufacturer quantitative RT-PCR. Total RNA was extracted from the materials mentioned above using TRIzol reagent kit (Invitrogen, Carlsbad, CA, US) according to the manufacturer’s specification. The yield of RNA was determined using a NanoDrop 2000 spectrophotometer (Thermo Scientific, USA), and the integrity was evaluated using agarose gel electrophoresis stained with et.Entary Figures S1 and S2). Most duplicated genes also showed similar expression pattern in leaf except GrKMT1A;4b/4c/4d (Supplementary Figures S1 and S2), suggesting that some duplicated genes undergone functional differentiation but others not.MethodsSequences of SET domain-containing proteins from Arabidopsis thaliana were retrieved from the official website (https://www.arabidopsis.org/Blast/index.jsp). The sequences of SET domain of these sequences were used as queries to search G. raimondii homologs (http://www.phytozome.net, version 10.3) using the BLASTp. The sequence of SET domain-containing proteins of rice was extracted from Huang et al.9 and web http://www.phytozome.net (version 10.3). All the sequences were re-confirmed in SMART database (http://smart.embl-heidelberg. de/). The gene loci information of G. raimondii was used to generate the chromosome maps by the Mapchart 2.2 program55. When candidate genes was found to be both > 70 coverage of shorter full-length-CDS sequence and >70 identical in the sequence of their encoding amino acids, they were regarded as duplicated genes21. When the duplicated genes were located within 100 kb and were separated by ten or fewer non-homologues, they were defined as tandem duplicated genes22. The coverage of full-length-CDS sequence and the similarity of amino acid sequences were detected by Blastn/Blastp in NCBI.Identification of SET domain-containing proteins and construction of chromosome map.Analysis of gene structure, domain organization and phylogenetic tree. The gene structure was reconstructed using Gene Structure Display Server (http://gsds.cbi.pku.edu.cn/). Domain organization was confirmed by SMART and NCBI (http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi), and the low-complexity filter was turned off, and the Expect Value was set at 10. Then the site information of domains was subjected to Dog2.0 to construct the proteins organization sketch map56. Multiple sequence alignments of SET domains were carried out by the Clustal W program57 and the resultant file was subjected to phylogenic analysis using the MEGA 6.0 program58. Based on the full-length protein sequences, the phylogenetic trees were constructed using Neighbor-Joining methods with Partial deletion and p-distance Method, Bootstrap test of 1000 replicates for internal branch reliability. Plant material and high temperature treatment.G. raimondii seedlings were grown in greenhouse at 28 under a 10 h day/14 h night cycle. 5-week-old seedlings with 5? true leaves were placed in a growth chamber at high temperature condition (38 ; 28 as a mock) for 12, 24, and 48 h. The leaves were harvested at the appropriate time points as indicated (triplicate samples were collected at each time point) for detecting genes expression in response to HT. The roots, stems and leaves were collected from plants at the stage of 5? true leaves and the petals, anther and ovary were sampled on the day of flowering for gene expression analysis of tissue/ organ. The materials were quick frozen in liquid nitrogen and stored at -70 for further analysis.RNA extraction and real-time quantitative RT-PCR. Total RNA was extracted from the materials mentioned above using TRIzol reagent kit (Invitrogen, Carlsbad, CA, US) according to the manufacturer’s specification. The yield of RNA was determined using a NanoDrop 2000 spectrophotometer (Thermo Scientific, USA), and the integrity was evaluated using agarose gel electrophoresis stained with et.