Wuhan Zhang,Pingyong Sun,Qiang He,Fu Shu,Jie Wang,Huafeng Deng*
Rice is one of the most important grain crops and staple foods for more than half of the global population [1].Improving rice yield is an important means to fight hunger caused in part by a rapidly growing population along with reduced arable land area and occurring climate change and disease.Grain weight is a key component of rice grain yield,which is primarily defined by grain shape that is determined by length,width and thickness.It is well known that all these three gain traits are controlled by quantitative trait loci(QTLs),affected by the environment[2,3].High-precision genome sequences and various molecular markers have allowed mapping and identification of hundreds of QTLs associated with grain traits in rice (http://www.gramene.org/).Many QTLs have been identified from different rice germplasms by a map-based cloning approach and these accomplishments imply the promise to help understand the molecular mechanisms underlying seed development and find ways to improve rice yield.
GS3,a major QTL for grain weight and length with a minor role in grain width and thickness,was recently fine mapped to a genomic region of 7.9 kb on chromosome 3 using 5,740 BC3F2plants [4].GS3 encodes a putative transmembrane protein composed of four domains,and each functions differently in regulating grain size[5].Sequence analyses showed that large grains are due to an early stop codon from a substitution in the second exon and,suggesting that GS3 functions as a negative regulator of grain size.Similarly,loss of GS3 leads to grain enlargement,which is true for GW2 [6],qSW5 [7] and TGW6[8].The major QTL for thousand-grain weight,TGW6,is mapped on chromosome 6 and encodes a novel protein with indole-3-acetic acid(IAA)-glucose hydrolase activity.Deletion of 1-bp in TGW6 exon results in a premature stop codon to prevent the production of the mature protein.It has been shown that function loss of the TGW6 allele results in simultaneous increase of grain weight and yield[8].
Furthermore,GS5 is a recently cloned QTL,which variation is associated with grain size diversity in rice,thus may be useful in improving yield in rice and,potentially,other crops[9].Its spatial expression patterns demonstrate that higher expression of GS5 results in larger grains,suggesting that GS5 is a positive regulator of grain size [9].Another QTL affecting grain width and yield,GW8,encodes a protein to positively regulate grain size.The GW8 function on grain is attributable to a critical deletion polymorphism in the promoter region.In contrast,a loss-of-function mutation brings about a better quality of appearance[10].
In this study,we report the identification and fine mapping of GS2 candidate gene.Our results demonstrated that GS2 was a novel gene involved in the regulation of grain length and width in rice.The identification and functional characterization of GS2 will help breed high-yield rice varieties and understand the underlying molecular mechanisms to control grain shape in rice and other crops.
Big-grain rice line CDL was crossed with a medium-grain line R1126.The resultant F1plants were selfed to yield a F2population of 1000 individuals,and the following recombined inbred lines(RIL).A differentiation of grain shape was observed in RIL28 line of F6,indicating heterozygous.The individual plants of RIL28 were selfed to generate a F7population.800 F7RIL28 population segregated medium-grain (length ≤11 mm,width ≤3 mm) and big-grain (length >11 mm,width >3 mm)progenies in a ratio of 3:1,and these progenies were used to preliminarily map the grain-shape QTL using bulked segregant analysis (BSA) and recessive-class analysis (RCA) method.A locus between markers RM13819 and RM13863 on chromosome 2,designated as GS2,was clearly associated with the variation of grain phenotypes.The segregating populations were developed for fine-mapping of GS2 from the 10th plant in F7of RIL28 line,named RIL28-10 which was heterozygous in the GS2 region flanked by RM13819 and RM13863.The selfing progenies of the selected residual heterozygous line (RHL) RIL28-10 produced RHL-F2(3000 individuals) and RHL-F3(30,000 individuals)population.Grain length and width were averaged from randomly chosen ten mature,filled and grains of 100 RHL-F2individuals.The ten grains were lined up end to end along Vernier calipers to measure the length,and then arranged by breadth to measure grain width.Genetic analyses were conducted according to the frequency distribution maps of grain length and the χ2-test.All rice materials were provided by the Hunan Hybrid Rice Research Center and planted in a field in Chunhua,Changsha City(summer)or Sanya,Hainan(winter).
Simple sequence repeat (SSR) markers distributed in whole genome were identified using publicly available rice genomic sequences (http://www.gramene.org/).Single feature polymorphism (SFP) and intron length polymorphism (ILP)markers were obtained from Edwards et al.,[11]and Wang et al.,[12].Other additional primers were designed and evaluated using Primer Premier 5.Sequence comparisons of the japonica cultivar Nipponbare (http://www.ncbi.nlm.nih.gov/guide/) and the indica cultivar R1126 (the whole genome of R1126 was resequenced by BGI)within the target region of the chromosome were first analyzed online to obtain information on potential insertions/deletions (InDels).Primers were designed and evaluated for potential InDels containing the R1126 sequence using Primer Premier 5.Eight newly developed InDel markers are listed in Table 1.Total genomic DNA was extracted from fresh leaves using the CTAB method [13],and PCR analysis was performed according to Sun et al.[14].
Molecular marker analysis was carried out according to the method described by Zuo et al.,[15]with minor modifications.Briefly,polymorphic markers between the two parents were first analyzed in a small population including the two parents,ten F7medium-grain plants,and ten F7big-grain plants.Then,the markers selected from this small population were further utilized to screen a part of big-grain individuals and all medium-grain individuals in the same segregating population for linkage analysis.Finally,data were collected and transformed according to the requirement of MAPMAKER 3.0[16]to construct the linkage map.
Random evaluation of grain length and width of 100 RHL-F2individuals revealed a continuous bimodal distribution(Fig.1).Individuals of RHL-F2population were classified into medium-grain(length ≤11 mm,width ≤3 mm,24 individuals)and big-grain (length >11 mm,width >3 mm,76 individuals)groups with a segregation ratio for big versus medium grain fit to a ratio of 3:1 (χ2= 0.05 <χ20.05,1= 3.84).Large populations were investigated in F7,RHL-F2and RHL-F3with 179,720 and 7400 medium grain individuals found in the total populations of 800,3000,30,000 individuals,respectively.Likewise,the segregation ratios of big versus medium grain fit to a ratio of 3:1(χ2= 2.80,1.55,1.76 <χ20.05,1= 3.84).
A total of 129 polymorphic markers were detected between R1126 and CDL from 400 SSR,SFP and ILP markers,and 113 well-distributed polymorphic markers were used to survey the ten medium-grain plants,ten big-grain plants of F7population and parents.The GS2 gene was roughly mapped to the interval between RM13819 and RM13863 on the long arm of chromosome 2.We found that six SSR markers,namely RM3289,RM1342,RM5305,RM13819,RM3212 and RM13863,located on chromosome 2 were clearly associated with the medium-grain phenotype.After further studying 179 F7medium-grain plants using these six markers,the GS2 gene was located between RM13819 and RM13863 with genetic distances of 0.84 cM and 0.28 cM,respectively.Furthermore,0 recombinant was detected by marker RM3212.These data were derived according to the recombinants revealed by each marker,covering a ~553-kb physical segment on the region of rice chromosome 2 (Fig.2-A).
To fine-map the GS2 locus,29 polymorphic InDels were selected from 142 InDels developed according to the information on the sequence (R1126 and Nipponbare) between RM3212 and RM13863.Further genotyping 2576 medium grain plants of the RHL-F3revealed one recombinant in the proximity of GL2-35-1 and GL2-12.In addition,RM3212 and GL2-11 were verified to be linked to the GS2 gene.The GS2 locus was therefore finally narrowed down to the genomic region flanked by GL2-35-1 and GL2-12,a fragment of approximately 33.2 kb in length(Fig.2-B).
Table 1-InDel markers developed for fine mapping of the GS2 locus.
Fig.1-Frequency distribution in RHL-F2 population of R1126/CDL.A:Grain phenotypes of GS2GS2 and gs2gs2,Scale bar represents 1 cm;B:Frequency distribution in RHL-F2 individuals of R1126/CDL showing a continuous bimodal distribution,which suggested that GS2 functions as a dominant gene in control of grain length and width(GS2GS2:Grain length = 13.08,Grain width = 3.39;gs2gs2:Grain length = 10.06,Grain width = 2.96).
In the 33.2-kb genomic interval of the Nipponbare genome,a total of three putative genes including LOC_Os02g47280,LOC_Os02g47290 and LOC_Os02g47300 were predicted by TIGR rice annotation (http://rice.plantbiology.msu.edu/cgibin/gbrowse/rice/) (Fig.2-B).LOC_Os02g 47280 encoded a putative growth-regulating factor; LOC_Os02g47290 and LOC_Os02g47300 encoded hypothetical proteins with no further evidence such as expressed sequence tag (EST) or RNA.
Fig.2-Fine mapping of the GS2 gene.A:local linkage map of SSR markers and GS2 gene in the F7 population of R1126/CDL.The genetic map distance(cM)between two adjacent molecular markers(right)is shown on the left;B:physical map encompassing the GS2 locus,showing the ~33.2-kb chromosomal region delimited by GL2-35-1 and GL2-12.Numbers in bracket indicate recombinant events detected for each marker among 2576 medium grain RHL-F3 plants.Numbers under the markers represent the relative genetic distance(cM)between two adjacent markers.The physical distances between markers were deduced by analyzing the sequences released from http://rice.plantbiology.msu.edu/,and three candidates were predicted by TIGR annotation within this region(http://rice.plantbiology.msu.edu/cgi-bin/gbrowse/rice/).
Because of the recent developments in bioinformatics and genome sequencing to yield an impressive number of molecular markers,many major QTLs responsible for grain shape and yield have been fine mapped and cloned in the past 20 years.In this paper,we fine mapped GS2 using RHL population developed from a big-grain rice line CDL and a medium-grain line R1126.GS2,which controls grain length and width,was narrowed down to a candidate genomic region of 33.2 kb defined by the InDel markers GL2-35-1 and GL2-12 in the long arm of rice chromosome 2.Three annotated genes(LOC_Os02g47280,LOC_Os02g47290 and LOC_Os02g47300)were identified within the critical 33.2-kb genomic region of Nipponbare (japonica) genome (http://rice.plantbiology.msu.edu/).LOC_Os02g47290 and LOC_Os02g47300 encoded hypothetical proteins with no gene ontology annotation; thus those two genes might have no or marginal direct relevance to the grain shape development according to their putative functions.
The LOC_Os02g47280 encodes a growth-regulating factor protein,which belongs to the GRF family of proteins consisting of twelve members.The protein of LOC_Os02g47280 has two putative alternative splice forms,and both contain a WRC domain and a QLQ domain.Interestingly,the protein of LOC_Os02g47280 shares homology with a protein in Brachypodium,Zea mays L.,Populus L.and Sorghum vulgare Pers.(http://rice.plantbiology.msu.edu/).The WRC domain contains a putative nuclear localization signal and a zinc-finger motif (C3H).The WRC domain was suggested to be involved in DNA binding while QLQ domain was shown to affect protein–protein interactions [17].Recently it was demonstrated that LOC_Os02g47280 is downregulated by miR396 during grain development in rice [18].Therefore,LOC_Os02g47280 should be considered the most likely candidate for GS2.We are currently investigating a genetic complementation of the candidate gene by transformation and other functional analyses.
To date,more than 40 QTLs related to grain shape and yield have been primarily mapped on chromosome 2 of rice(http://www.gramene.org/).Some of these are located in the proximity of GS2.For example,the QTL qGL-2a,which affects grain length,was mapped in an interval between the restriction fragment length polymorphism (RFLP) marker C560 and C1408,accounting for 11.7% of total phenotypic variations[19].Another QTL qGL-2 responsible for grain length was detected between the RFLP marker C601and R3393 using chromosome segment substitution line (CSSL) populations across eight environments [20].Of note,the QTL associated with grain number,gpl2.1,was detected at the end of the long arm of chromosome 2 using an IR64/O.rufipogon BC2F2population [21].However the QTL gpl2.1 explained only 4.8%of phenotypic variations.In addition,four QTLs controlling grain shape and yield have been cloned on chromosome 2:qGY2–1 [22],GW2 [6],LP [23] and PGL2 [24].Precisely,GW2,a major QTL of grain width and weight,was mapped to a 8.2-kb interval flanked by markers W024 and W004 with a set of near-isogenic lines (NILs,BC2F2/BC3F2/BC3F4) developed from a sub-specific cross of the japonica cultivar Wy3 and indica cultivar FAZ1 [6].The LARGER PANICLE (LP) gene encodes a Kelch repeat-containing F-box protein and plays an important role in regulating plant architecture,particularly panicle architecture.Therefore,LP is an important genetic resource for improvement of grain production in plants.
In the present study,GS2 was characterized as a dominant gene for grain length and width,and the allele from indica cultivar CDL was dominant for big-grain.The GS2 gene was finally localized to an interval of ~33.2 kb between the InDel markers GL2-35-1 and GL2-12,which with approximately 2557 kb and 20,741 kb distant from the PGL-2 (LOC_Os02g51320) and GW2(LOC_Os02g14720) loci,respectively,was not allelic to the previously reported PGL-2 and GW2.Thus we consider GS2 as a new dominant gene for rice grain length and width.
Only few studies have been reported on the molecular mechanisms underlying rice grain shape[3].Therefore,identification of novel genetic loci that regulate grain shape and characterization of their respective genes would enhance our understanding on rice seed development.We report GS2 as a novel gene controlling grain length and width.The molecular markers closely linked to GS2 can promote the breeding of high-yield rice varieties and the characterization of GS2 function(s)will provide a new approach to understanding seed development in rice and other crops.
The research presented in this study identified a novel gene GS2 responsible for grain length and width in rice.GS2 was localized to an interval of ~33.2 kb between the markers GL2-35-1 and GL2-12 on chromosome 2.Three annotated genes were identified within the GS2 locus from Nipponbare genome,and the LOC_Os02g47280 was considered the most likely candidate for GS2.No QTL responsible for grain shape and yield has been fine mapped and cloned at GS2 locus.
This work is financially supported by the National High Technology Research and Development Program of China(2011AA10A101) and the Hunan Provincial Natural Science Foundation of China(10JJ2025).
[1] Y.H.Wang,J.Y.Li,Branching in rice,Curr.Opin.Plant Biol.14(2011) 94–99.
[2] X.J.Song,M.Ashikari,Toward an optimum return from crop plants,Rice 1(2008) 135–143.
[3] Y.Z.Xing,Q.F.Zhang,Genetic and molecular bases of rice yield,Annu.Rev.Plant Biol.61(2010) 421–442.
[4] C.C.Fan,Y.Z.Xing,H.L.Mao,T.T.Lu,B.Han,C.G.Xu,X.H.Li,Q.F.Zhang,GS3,a major QTL for grain length and weight and minor QTL for grain width and thickness in rice,encodes a putative transmembrane protein,Theor.Appl.Genet.112(2006) 1164–1171.
[5] H.L.Mao,S.Y.Sun,J.L.Yao,C.R.Wang,S.B.Yu,C.G.Xu,X.H.Li,Q.F.Zhang,Linking differential domain functions of the GS3 protein to natural variation of grain size in rice,Proc.Natl.Acad.Sci.U.S.A.107 (2010) 19579–19584.
[6] X.J.Song,W.Huang,M.Shi,M.Z.Zhu,H.X.Lin,A QTL for rice grain width and weight encodes a previously unknown RING-type E3 ubiquitin ligase,Nat.Genet.39(2007) 623–630.
[7] A.Shomura,T.Izawa,K.Ebana,T.Ebitani,H.Kanegae,S.Konishi,M.Yano,Deletion in a gene associated with grain size increased yields during rice domestication,Nat.Genet.40(2008) 1023–1028.
[8] K.Ishimaru,N.Hirotsu,Y.Madoka,N.Murakami,N.Hara,H.Onodera,T.Kashiwagi,K.Ujiie,B.Shimizu,A.Onishi,H.Miyagawa,E.Katoh,Loss of function of the IAA-glucose hydrolase gene TGW6 enhances rice grain weight and increases yield,Nat.Genet.45(2013) 707–711.
[9] Y.B.Li,C.C.Fan,Y.Z.Xing,Y.H.Jiang,L.J.Luo,L.Sun,D.Shao,C.J.Xu,X.H.Li,J.H.Xiao,Y.Q.He,Q.F.Zhang,Natural variation in GS5 plays an important role in regulating grain size and yield in rice,Nat.Genet.43(2011) 1266–1269.
[10] S.K.Wang,K.Wu,Q.B.Yuan,X.Y.Liu,Z.B.Liu,X.Y.Lin,R.Z.Zeng,H.T.Zhu,G.J.Dong,Q.Qian,G.Q.Zhang,X.D.Fu,Control of grain size,shape and quality by OsSPL16 in rice,Nat.Genet.44(2012) 950–954.
[11] J.D.Edwards,J.Janda,M.T.Sweeney,A.B.Gaikwad,B.Liu,H.Leung,D.W.Galbraith,Development and evaluation of a high-throughput,low-cost genotyping platform based on oligonucleotide microarrays in rice,Plant Methods (2008),http://dx.doi.org/10.1186/1746-4811-4-13.
[12] X.S.Wang,X.Q.Zhao,J.Zhu,W.R.Wu,Genome-wide investigation of intron length polymorphisms and their potential as molecular markers in rice (Oryza sativa L.),DNA Res.12 (2005) 417–427.
[13] M.G.Murray,W.F.Thompson,The isolation of high molecular weight plant DNA,Nucl.Acids Res.8(1980) 4321–4325.
[14] P.Y.Sun,J.L.Liu,Y.Wang,N.Jiang,S.H.Wang,Y.S.Dai,J.Gao,Z.Q.Li,S.J.Pan,D.Wang,W.Li,X.L.Liu,Y.H.Xiao,E.M.Liu,G.L.Wang,L.Y.Dai,Molecular mapping of the blast resistance gene Pi49 in the durably resistant rice cultivar Mowanggu,Euphytica 192(2013)45–54.
[15] L.Zuo,S.C.Li,M.G.Chu,S.Q.Wang,Q.M.Deng,L.Ding,J.Zhang,Y.Wen,A.P.Zheng,P.Li,Phenotypic characterization,genetic analysis,and molecular mapping of a new mutant gene for male sterility in rice,Genome 51 (2008) 303–308.
[16] E.Lander,P.Green,J.Abrahamson,Mapmarker:an interactive computer package for constructing primary genetics linkage maps of experimental and natural populations,Genetics 1(1987)174–181.
[17] J.H.Kim,D.Choi,H.Kende,The AtGRF family of putative transcription factors is involved in leaf and cotyledon growth in Arabidopsis,Plant J.36 (2003) 94–104.
[18] Y.Lan,N.Su,Y.Shen,R.Z.Zhang,F.Q.Wu,Z.J.Cheng,J.L.Wang,X.Zhang,X.P.Guo,C.L.Lei,J.Wang,L.Jiang,L.Mao,J.M.Wan,Identification of novel MiRNAs and MiRNA expression profiling during grain development in indica rice,BMC Genomics 13(2012),http://dx.doi.org/10.1186/1471-2164-13-264.
[19] Z.F.Li,J.M.Wan,J.F.Xia,H.Q.Zhai,Mapping quantitative trait loci underlying appearance quality of rice grains(Oryza sativa L.),Acta Genet.Sin.30 (2003) 251–259.
[20] X.Y.Wan,J.M.Wan,J.F.Weng,L.Jiang,J.C.Bi,C.M.Wang,H.Q.Zhai,Stability of QTLs for rice grain dimension and endosperm chalkiness characteristics across eight environments,Theor.Appl.Genet.110(2005)1334–1346.
[21] E.M.Septiningsih,J.Prasetiyono,E.Lubis,T.H.Tai,T.Tjubaryat,S.Moeljopawiro,S.R.McCouch,Identification of quantitative trait loci for yield and yield components in an advanced backcross population derived from the Oryza sativa variety IR64 and the wild relative O.rufipogon,Theor.Appl.Genet.107 (2003) 1419–1432.
[22] G.M.He,X.J.Luo,F.Tian,K.G.Li,Z.F.Zhu,W.Su,X.Y.Qian,Y.C.Fu,X.K.Wang,C.Q.Sun,J.S.Yang,Haplotype variation in structure and expression of a gene cluster associated with a quantitative trait locus for improved yield in rice,Genome Res.16 (2006) 618–626.
[23] M.Li,D.Tang,K.J.Wang,X.R.Wu,L.L.Lu,H.X.Yu,M.H.Gu,C.J.Yan,Z.K.Cheng,Mutations in the F-box gene LARGER PANICLE improve the panicle architecture and enhance the grain yield in rice,Plant Biotechnol.J.9 (2011) 1002–1013.
[24] D.Heang,H.Sassa,An atypical bHLH protein encoded by POSITIVE REGULATOR OF GRAIN LENGTH 2 is involved in controlling grain length and weight of rice through interaction with a typical bHLH protein APG,Breed.Sci.62(2012)133–141.