Hong Zhou ,Wei Luo ,Shng Go ,Jin M ,Meixue Zhou ,Yuming Wei ,Youling Zheng ,Yxi Liu ,*,Chunji Liu
a Triticeae Research Institute,Sichuan Agricultural University,Wenjiang,Chengdu 611130,Sichuan,China
b CSIRO Agriculture Flagship,306 Carmody Road,St.Lucia,QLD 4067,Australia
c School of Land and Food and Tasmanian Institute of Agriculture,University of Tasmania,Hobart 7250,Australia
ABSTRACT Kernel weight (KW),together with kernel number per unit area,determines yield of cereal crops.Here,two barley recombinant inbred lines (RILs) populations with a shared parent were used to identify loci controlling KW.One is Baudin/AWCS276 (BA) for which a linkage map was available.Several largeeffect QTL controlling KW were detected in this population.Another is Morex/AWCS276(MA).A linkage map with 5273 makers formed 1454 clusters,was constructed by the genotyping by sequence(GBS)data of 201 RILs from this population.A single marker was selected to represent each of the clusters and the linkage map constructed with these markers covers a total length of 1022.4 cM with an average interval of approximately 0.7 cM between loci.Three of the large-effect loci controlling KW(located on 2HL,6HL,and 7HL,respectively)identified from the BA population were also detected in the MA population under different environments.The locus on 6HL was detected in each of the experiments conducted for both populations thus was selected for developing near isogenic lines(NILs).Apart from KW,the two isolines for each pair of the putative NILs obtained showed no significant difference for any of the morphological characteristics assessed.The average difference in KW between the isolines for all the NILs obtained was about 15%based on assessments under both glasshouse and field conditions.Taken advantage that high quality genome assemblies for both Morex and AWCS276 are available,we identified candidate genes underlying two of the three loci based on an orthologous analysis.The NILs developed and the candidate genes identified in this study should facilitate the cloning and functional analysis of genes regulating KW in barley.
Keywords:Barley Kernel weight Near isogenic lines AWCS276
Barley (Hordeum vulgareL.) was one of the first domesticated grains and it is widely cultivated in temperate regions worldwide.The global production of barley in the 2017–2018 crop years was about 142 million metric tons.Barley is commonly used in beverage production,as animal feed,as well as edible grain in bread,cereal and soup[1].Like other cereal crops,barley yield is determined by kernel weight(KW)and kernel number harvested per unit area.KW also affects both malt yield and feed quality in barley [2,3].
Similar to the situations in other crop species,KW in barley has been intensively investigated [4–7].Results from QTL mapping studies indicate that the genetic control of KW is likely to be highly quantitative as several loci can routinely be detected from a single population [7].Further,results from previous studies also show that strong interactions may exist between a targeted characteristic and other traits [8–11].In regarding to KW in barley,it can be affected by grain size,endosperm hardness and grain density [5].It can also be affected by inflorescence structure [7,12–14].Due to the heterogeneity in genetic background in segregating populations routinely used for QTL mapping,lines containing a given allele do not always perform better than those containing the alternative allele thus average performances between lines with or without a target allele must be compared.
The existence of possible interactions between the characteristic under study and other traits mean that QTL detected in a mapping study can only be treated as putative and they need to be validated [15–17].Near isogenic lines (NILs) are powerful genetic stocks that can be effectively used to validate and evaluate the effects of a given locus in different genetic backgrounds[11,15,16].With the availability of linked markers,the method based on the heterogeneous inbred family (HIF) analysis has been widely used in generating NILs for various characteristics[11,15,18].
In the study reported here,we mapped loci controlling KW in barley using two populations of recombinant inbred lines (RILs).One of the populations was generated from two genotypes for which the whole genome assemblies were available,making it possible to identify candidate genes for each of the loci detected through sequence homology analysis.We also generated NILs targeting the most stable locus detected in both populations and used them to validate the effect of this loci in different genetic backgrounds.
Two populations of RILs were used to detect QTL for KW in this study.AWCS276,a wild barley (H.spontaneumL.) was the shared parent between the two populations (Baudin/AWCS276 (BA) and Morex/AWCS276 (MA)).The linkage map for the BA population was reported in a previous study[19].The MA population,containing 201 RILs,was generated from a cross between Morex and AWCS276 using the single-seed-descendent method with eight cycles of self-pollination based on the accelerated generation technique [20].
NILs were developed based on the method of HIF described earlier [15,21].A single marker linked closely to the targeted locus was selected for developing the NILs.PCR reactions for the marker(Table S1) were performed in 25 μL mixtures containing 50 ng genomic DNA,0.4 μmol L-1of forward and reverse primer,3 mmol L-1MgCl2,1 mmol L-1dNTPs,and 2.5 UTaqDNA polymerase.The PCR program included an initial denaturation at 94 °C for 5 min,followed by 35 cycles of 94 °C for 1 min,60 °C for 1 min,72 °C for 1 min,and a final extension at 72 °C for 10 min.The PCR products were mixed with Gel-Red stain and separated on 2.5% agarose gels.The marker was used to identify heterozygous plants from F4lines of the MA population.The identified plants were self-pollinated,and eight plants derived from each of them were used for the next round of selection.This process of identifying heterozygous individuals and self-pollination was repeated until F9generation.Two lines,one with the allele from Morex and the other with the allele from AWCS276,were then selected from each of the heterozygous individuals and treated as a putative NIL pair.
The BA population was evaluated in three consecutive seasons at the Wenjiang Research Station in Chengdu,China(30°36′N/103°41′E) in 2013 (WJ13),2014 (WJ14),and 2015(WJ15),respectively.Each of these experiments consisted of two replicates.Each replicate had ten plants grown in a single row(1.5 m in length),and distance between rows was 15 cm.Five plants from the middle of each row were harvested for measuring KW.
The MA population was assessed once in a field trial and twice in the glasshouses.The field trial was conducted at the CSIRO Gatton Research Station(27°33′S,152°16′E) in June 2019.It consisted of two experiments,one with 4-week-vernalized seedlings(experiment GA19R1)and the other 6-week-vernalized seedlings(experiment GA19R2).Seeds were germinated in Petri dishes on three layers of filter paper saturated with water and then placed in a 6 °C cold room with constant lighting for vernalization.Each of the experiments contained two replicates,each with ten vernalized seedlings spaced(20 cm)planted in a single row.Row spacing was 25 cm.The two glasshouse trials were all conducted at the Queensland Bioscience Precinct (QBP) in Brisbane,one in December 2018(experiment GH18)and the other in July 2019(experiment GH19).Each of the glasshouse experiments consisted of three replicates.Three plants,each in a separate 2.0 L pot,were used in each of the replicates.A randomized complete block design was used for each of the experiments.Seeds were harvested from mature plants and dried in a 32 °C oven for ten days before being used for measuring KW.KW was based on the average of three replicated measurements of 100 randomly chosen kernels from each line used in these experiments.
The NILs were assessed in two experiments,one in a QBP glasshouse (experiment GH19) and the other at the CSIRO Gatton Research Station in June 2019(experiment GA19).GH19 consisted of three replicates,each with five plants in different pots of 2.0 L.GA19 also consisted of three replicates,each with five plants growing in a single row.The plants within each row have a spacing of 20 cm,and the distance between rows was 25 cm.Kernel length(KL),kernel width (KWI),flag leaf width (FLW),flag leaf length(FLL),plant height (PH),spike exertion length (PEL),spike length(SL),tiller number (TN),and kernel number per spike (KNS) were measured.Measurements were taken from the main tillers of the middle three plants in each row and used for analysis.
Significance test was conducted using the Student’st-test.Broad-sense heritability (H2) for each trait was estimated aswhereis the genetic variance,is the G × E variance,is the error,n is the number of environments,and r is the number of replicates[22].Theandvalues were obtained based on the analysis of variance (ANOVA) using SAS 9.2 (SAS Institute Inc.,Cary,NC,USA).
For the MA population,total genomic DNA was isolated from young leaves of all the RILs and the two parents using the CTAB method.The GBS data were generated by the Agriculture Victoria Research,Australia.Paired-end reads for each sample were error corrected and overlapped to produce single reads using the PEAR software with default parameters [23].PEAR sequences from each sample were trimmed of the GBS probe sequences,concatenated and filtered to produce a unique list of allele sequences referred to as the population-specific allele-specific reference (psASR).Filtering applied included rejecting allele sequences having a frequency of occurrence of <4 in any sample or a minor allele frequency of<0.25 across the samples.The psASR sequences where aligned to the Morex genome assembly using the GYDLE software(Gydle Inc.).Alignment parameters included requirement for a minimum of 70% homology between the psASR sequence and reference genome across 95% of the psASR sequence.Exhaustive searching of the alignment space ensured that GYDLE returned the best alignment for each psASR sequence.Alignment information for each psASR sequence was used to identify alleles originating from the same locus in order to facilitate codominant genotype calling.Those psASR sequences not aligned to the Morex genome assembly or had multiple hits with equal alignment score were classified as unmapped loci and treated as dominant markers for genotype calling.PEAR sequences for each sample were aligned to the psASR using GYDLE software using parameters that allowed up to 2 mismatches and guaranteed reporting of the best alignment.Those PEAR sequences having more than one hit to the psASR with the same alignment score were discarded.Genotype calling for mapped loci with multiple alleles was performed using a custom script that implements Bayesian probability statistics to call codominant genotypes based on the number of PEAR sequences aligned to each allele at a given locus.Genotype calling for mapped loci with only one allele (i.e.monomorphic loci) or unmapped loci was performed using a custom script that applies a minimum read coverage of 2 to call dominant genotypes.Raw SNP calls with minor allele frequency (MAF) of >0.05 in the biparental population were concatenated into a single variant call file(VCF)and used for linkage map construction.SNPs with more than 15% missing rate and those that were monomorphic between the two parents were eliminated.The retained SNPs were sorted into groups using MSTMap in the R/ASMap package with the default parameters [24].
QTL analysis was conducted using MAPQTL 6.0 [25].A test of 1000 permutations was used to identify the LOD threshold,corresponding to a genome-wide false discovery rate of 5% (P<0.05).
QTL detected in multiple environments were used to delineate the physical intervals.Tag sequences in GBS dataset were used to Blast against the Morex Assembly of RefSeq v2.0 [26] to get physical positions.Gene lists in the candidate regions were downloaded from the website ftp://ftp.ensemblgenomes.org/pub/release-44/pl ants/gff3/ hordeum_vulgare.At the same time,sequences of kernel-related genes from rice were collected and used to blast against Morex genes in the putative QTL intervals.Candidate genes detected in Morex were then used to Blast against the genome assembly of AWCS276[27]to detect sequence differences between the two parents of the MA population.
Of the three parental genotypes for the two RIL populations used in this study,AWCS276 has the lowest KW (Table 1;Fig.1).Transgressive segregation with normal distribution for KW was detected in each of the experiments conducted for both populations.Phenotypic variation coefficients ranged from 18.1% to 20.3% among the experiments for the BA population and ranged from 19.4% to 23.8% for the MA population (Tables 1,S2,and S3;Fig.S1).The broad-sense heritability was 0.681 and 0.684 for BA and MA,respectively,suggesting that genetic effects are the major determinant of the phenotypic variance on KW.
Approximately 173 million sequencing reads were obtained from 201 RILs of the MA population.The numbers of reads for each of these lines varied from 301,340 to 2178,836 with an average of nearly 0.85 million.A total of 21,598 SNPs with >25% minimum allele frequency (MAF) were identified from the GBS data.Following the discarding of those monomorphic and those missing more than 15% of the values,5273 markers were retained and they formed 1454 clusters.A single marker was used to represent each of these clusters in linkage map construction.They formed seven linkage groups corresponding to each of the chromosomes.The genetic lengths of the linkage groups varied from 115.7 cM (4H)to 178.2 cM (2H).The total length of the linkage map is about 1022.4 cM with an average distance of 0.7 cM between markers(Tables S4 and S5).SNP distribution and characteristics of the linkage map are presented in detail in Tables S3 and S4.For the BA population,we used the genetic map reported in a previous study[19].
Five QTL for KW were detected from the BA population from the experiments conducted over the three seasons.They were located on chromosomes 2H,3H,3H,6H,and 7H,respectively.These QTL contributed between 10.5 (KW-BA-2H) and 30.3% (KW-BA-3H.2)of the phenotypic variance among the experiments.Only one of them,KW-BA-6H,was constantly detected in each of the experiments.LOD scores for this locus varied from 3.8 to 5.8 among the three experiments (Table 2;Fig.S2).
There were also five QTL for KW detected from the MA population.They located on 2H,3H,6H,7H,and 7H,respectively.Two of them,includingKW-MA-2HandKW-MA-6H,were detected in each of the four experiments.KW-MA-3Hon chromosome arm 3HS was detected in two of the experiments.KW-MA-7H.1on chromosome arm 7HS andKW-MA-7H.2on 7HL were both detected in only one of the experiments (Table 2;Fig.2).
Three of the QTL detected between the two populations likely shared similar physical locations:KW-BA-2HandKW-MA-2Hlocated in the interval of 627–700 Mb on chromosome arm 2HL,KW-BA-6HandKW-MA-6Hin the interval of 450–545 Mb on 6HL,andKW-BA-7HandKW-MA-7H.2in the interval of 593–637 Mb on 7HL.These were the three loci we focused on in further analyses in the study reported here and they were named asKW-2H,KW-6H,andKW-7H,respectively.The alleles for increased KW for all the three QTL were derived from the female parents (Baudin and Morex,respectively).
NILs targeting the locusKW-6Hwere obtained based on the approach of HIF analysis [32] with the use of one closely linked marker.Nine pairs of putative NILs were obtained.Significant difference in KW between the isolines for seven of these putative NILs was detected in both the assessments conducted.The differences between the isolines for each of the NIL pairs ranged from 9.7%to 23.3%with an average of 15.5%(Table 3).Consistent differences for any of the other characteristics assessed (including KL,KWI,FLW,PEL,SL,TN,FLL,PH,and KNS) were not detected between the isolines for any of the NIL pairs (Table 3;Fig.S3).
Sequences for 71 kernel-related genes from rice were identified.They were used to BLAST the assembled genomes of Morex and AWCS276.This analysis detected three orthologs (includingHORVU2Hr1G102800,HORVU2Hr1G096350,andHORVU2Hr1G099220) in theKW-2Hinterval.These genes were orthologous to rice genesSPL33,SMG2,andSPL29,respectively.Sequence similarities of the three genes between rice and barley ranged from 75.1% to 85.9%.There were 2 SNPs (G →A) between Morex and AWCS276 in an exon ofHORVU2Hr1G102800,and they were both synonymous mutations.Three SNPs in exons and a 260 bp InDel in the second intron were detected forHOR-VU2Hr1G096350between the two parental genotypes.They included one C →G transversion and two T →C transitions.Of these three SNPs,only one was non-synonymous and it produces an amino acid residue substitution at position 763(Proline →Alanine).Two SNPs (G →A) were found in an exon ofHORVU2Hr1G099220between the two parents.The one at position 1411 produced an amino acid residue substitution(Alanine → Threonine).Only a single ortholog (HOR-VU6Hr1G068370) was identified in theKW-6Hinterval and it is homologous to the rice geneOsGRF4with a sequence similarity of 62.0%.There was only one SNP (G →T) in the last exon which leaded to an amino acid substitution (Aspartic acid →Tyrosine).There were also three insertion-deletions in the upstream region of the translation start codon between the two parental genotypes.No orthologs controlling KW in rice were detected in the interval harbouring theKW-7Hlocus (Table 4;Fig.3).
Table 1 Phenotypic performance and distribution of kernel weight among the three parents and two populations of recombinant inbred lines used in this study.
Table 2 QTL for barley kernel weight detected in each of the two populations of recombinant lines in each of the environments.
Table 3 Difference in various characteristics measured between the two isolines for each of the seven pairs of near isogenic lines targeting the kernel weight locus on chromosome arm 6HL.
Table 4 High confidence genes in the targeted intervals of barley and their orthologs in rice.
Fig.1.Seed morphologies of the three parental lines for the two RIL populations used in this study.
Fig.2.Locations of QTL for kernel weight in the Morex/AWCS276 population.QTL regions are marked with bars in different colours.QTL detected in GH18 (red),in GH19(green),in GA19R2 (blue),in GA19R1 (pink).
KW is a key component affecting not only kernel yield but also malting quality in barley [2,3].By assessing two populations of recombinant inbred lines with a shared parent,we investigated the genetic control of KW in the study reported here.One of these populations was developed as part of this study between two genotypes for which whole genome assemblies were available.GBS data for 201 of the RILs from the MA population was used in generating the linkage map which had an average interval of about 0.7 cM between loci.Three of the large-effect loci controlling KW,located on 2HL,6HL,and 7HL,respectively,were detected from both populations.NILs targeting the locus on 6HL were generated based on the method of HIF with the use of a single marker.Apart from KW,significant differences for any other characteristics assessed were not detected between the two isolines for any of the NIL pairs obtained.Significant differences in KW between the isolines were detected for seven of the NIL pairs.The presence of the AWCS276 allele reduced the KW by an average of 15.3%based on results from the glasshouse assays and by an average of 15.5%based on results from the field assay.
It is of interesting to note that,although many loci conditioning KW have been reported,none of them seems to be more distally located than any of the three loci reported here.The locus on 2HL is in a similar region as those described by Xue et al.[28],Lex et al.[29],and Wang et al.[30].Ren et al.[39] also reported a locus on this chromosome arm but it was about 130 Mb more proximally located.The locus on 6HL locates in a similar region as the ones reported by Walker et al.[5]and Lex et al.[29].Arifuzzaman et al.[31]also reported a locus on this chromosome arm butit located about 100 Mb more proximally located than the one described here.The one on chromosome arm 7HL locates in a similar region as those reported by Arifuzzaman et al.[31] and Wang et al.[30].Ren et al.[39]also reported a locus on this chromosome arm but it located about 105 Mb proximally compared with the one described here.It is well known that recombination rates can vary among populations [40–42] but they do not explain whether the more distal locations for all the three loci reported here are simply a coincidence.
The initial method of HIF for developing NILs was based on the use of flanking markers.If both the flanking markers are closely linked with their targeted locus,the rate of failure for this approach should be rare.However,linkage drag can be a significant issue when flanking markers derived from QTL mapping are used.This is because that the minimum difference between the two isolines for a given NIL pair would be the full length of the chromosomal fragment flanked by the markers.The fragment can be large if the markers are not tightly linked with the targeted locus,which is often the case when only QTL mapping results are available in selecting the markers.To obtain NILs with reduced sizes of the chromosome segments differentiating the isolines,a single marker linked with a targeted locus can be used [15,18].However,recombination between a marker and its targeted locus could occur thus some of the NILs obtained using a single linked marker could be false which do not segregate at the targeted locus.This is the likely reason why,similar to the results obtained in previous studies [18,43],two of the putative NIL pairs generated in this study failed to show significant difference between their isolines for KW.
Fig.3.Gene structures and genome sequence alignments of candidate genes for KW-2H and KW-6H between Morex and AWCS276.
Sequence comparisons between Morex and AWCS276 for genes with known effects on KW in rice identified candidate genes underlying KW for two of the three loci reported in this study.Assessing the functions of these candidate genes could lead to rapid isolation of genes underlying these loci.These results demonstrate the huge advantage of genetic mapping based on populations for which the genome assemblies of their parental genotypes are available.With the rapid advancement in sequencing capacity,it should not take long before this approach can be routinely used for crop species with large and complex genomes thus dramatically enhancing our ability to identify and clone genes from such species.
CRediT authorship contribution statement
Chunji Liu,Youliang Zheng,Yuming Wei,Meixue Zhou,Jian Ma,and Yaxi Liu:conceived the study.Hong Zhou,Wei Luo,and Shang Gao:collected and analysed data.Hong Zhou and Chunji Liu:prepared the manuscript with contribution from other authors.
Declaration of competing interest
Authors declare that there are no conflicts of interest.
Acknowledgments
This study was supported by the National Natural Science Foundation of China(31771794),the National Key Research and Development Program of China (2017YFD0100900 and 2016YFD0101004),the Outstanding Youth Foundation of the Department of Science and Technology of Sichuan Province(2016JQ0040),and the International Science&Technology Cooperation Program of the Bureau of Science and Technology of Chengdu China(2015DFA306002015-GH03-00008-HZ).The authors are also grateful to Drs.Matt Hayden and Kerrie L.Forrest at Agriculture Victoria Research for helping in generating the GBS data on the mapping population.Hong Zhou,Wei Luo,and Shang Gao are grateful to the Sichuan Agricultural University and the China Scholarship Council for funding their visit to CSIRO.
Appendix A.Supplementary data
Supplementary data for this article can be found online at https://doi.org/10.1016/j.cj.2020.07.010.