Skip to main content

Analysis of Population Structure and Genetic Diversity in Rice Germplasm Using SSR Markers: An Initiative Towards Association Mapping of Agronomic Traits in Oryza Sativa



Genetic diversity is the main source of variability in any crop improvement program. It serves as a reservoir for identifying superior alleles controlling key agronomic and quality traits through allele mining/association mapping. Association mapping based on LD (Linkage dis-equilibrium), non-random associations between causative loci and phenotype in natural population is highly useful in dissecting out genetic basis of complex traits. For any successful association mapping program, understanding the population structure and assessing the kinship relatedness is essential before making correlation between superior alleles and traits. The present study was aimed at evaluating the genetic variation and population structure in a collection of 192 rice germplasm lines including local landraces, improved varieties and exotic lines from diverse origin.


A set of 192 diverse rice germplasm lines were genotyped using 61 genome wide SSR markers to assess the molecular genetic diversity and genetic relatedness. Genotyping of 192 rice lines using 61 SSRs produced a total of 205 alleles with the PIC value of 0.756. Population structure analysis using model based and distance based approaches revealed that the germplasm lines were grouped into two distinct subgroups. AMOVA analysis has explained that 14 % of variation was due to difference between with the remaining 86 % variation may be attributed by difference within groups.


Based on these above analysis viz., population structure and genetic relatedness, a core collection of 150 rice germplasm lines were assembled as an association mapping panel for establishing marker trait associations.


Rice, being the staple food crop for more than 50 % of the world population is cultivated in 163 million hectares with the production of 491 million tonnes. About 90 % of the world’s rice is produced in Asia and India contributes 20 % of the world’s production. This record level production and productivity is due to the availability and exploitation of rich genetic diversity existing in rice germplasm of India. For precise genetic manipulation of complex quantitative traits like, yield, tolerance against biotic/abiotic stresses, quality etc., understanding the genetic/molecular basis of target traits needs to be investigated thoroughly.

The genetic basis of important agronomic traits has been unraveled through Quantitative Trait Loci (QTL) mapping either through linkage mapping (bi-parental mapping populations) or through LD mapping (natural populations). Although traditional linkage based QTL-mapping has become an important tool in gene tagging of crops, it has few limitations viz., 1) classical linkage mapping involves very high cost; 2) it has low resolution as it can resolve only a few alleles and 3) it has limitations towards fine mapping of QTLs as it needs BC-NILs. These limitations can be overcome by the LD based approach of “Association Mapping” using the natural populations. Association mapping serves as a tool to mine the elite genes by structuring the natural variation present in a germplasm. It was successfully exploited in various crops such as rice, maize, barley, durum wheat, spring wheat, sorghum, sugarcane, sugarbeet, soybean, grape, forest tree species and forage grasses (Abdurakhmonov and Abdukarimov 2008).

Before performing an association analysis in a population, it is essential to determine the population structure which can reduce type I and II errors in association mapping due to unequal allele frequency distribution between subgroups that causes spurious association between molecular markers and trait of interest (Pritchard et al. 2000). Similar attempts were recently undertaken to define population structure in rice using different germplasm lines and by developing core collection from national collections and international collections (Ebana et al. 2008; Jin et al. 2010; Zhang et al. 2011; Agrama et al. 2010 and Liakat Ali et al. 2011). Simple Sequence repeat (SSR) markers have been commonly used in genetic diversity studies in rice because of high level of polymorphism which helps to establish the relationship among the individuals even with less number of markers (McCouch et al. 1997). For similar studies, SSR markers were used alone by Jin et al. (2010); Hesham et al. (2008); Sow et al. (2014); Das et al. (2013) and Choudhury et al. (2013) or along with SNP markers by Courtois et al. (2012) and Zhao et al. (2011). The objectives of this present study were to evaluate the genetic variation and to examine the population structure of 192 rice germplasm accessions that comprises of local landraces, improved varieties and exotic lines from diverse origin.


Genetic Diversity

All the 192 rice germplasm lines were genotyped using 61 SSR (microsatellite) markers which produced a total of 205 alleles (Additional file 1: Figure S1). Among these 205 alleles, 5 % were considered as rare (showed an allele frequency of < 5 %). The number of alleles per loci varied from 2 to 7 with an average of 3 alleles per locus. The highest number of alleles were detected for the loci RM316 (7) and the lowest was detected for a group of markers viz., RM171, RM284, RM455, RM514, RM277, RM 5795, HvSSR0247, RM 559, RM416 and RM1227. PIC value represents the relative informativeness of each marker and in the present study, the average PIC value was found to be 0.468. The highest genetic diversity is explained by the landraces included in this study with the mean PIC value of 0.416. PIC values ranged between 0.146 for RM17616 to 0.756 for RM316. Heterozygosity was found to be very low which may be due to autogamous nature of rice. Expected heterozygosity or Gene diversity (He) computed according to Nei (1973) varied from 0.16 (RM17616) to 0.75 (RM287) with the average of 0.52 (Table 1).

Table 1 Details of SSR loci used for genotyping in the 192 rice accessions and their genetic diversity parameters


Population structure of the 192 germplasm lines was analysed by Bayesian based approach. The estimated membership fractions of 192 accessions for different values of k ranged between 2 and 5 (Fig. 1). The log likelihood revealed by structure showed the optimum value as 2 (K = 2). Similarly the maximum of adhoc measure ΔK was found to be K = 2 (Fig. 2), which indicated that the entire population can be grouped into two subgroups (SG1 and SG2). Based on the membership fractions, the accessions with the probability of ≥ 80 % were assigned to corresponding subgroups with others categorized as admixture (Fig. 3).

Fig. 1

Pattern of variation of 192 accessions based on 61 SSR markers. The K values are based on the run with highest likelihood. Bar length represent the membership probability of accessions belonging to different subgroups

Fig. 2

Population structure of 192 accessions based on 61 SSR markers (K = 2) and Graph of estimated membership fraction for K = 2. The maximum of adhoc measure ΔK determined by structure harvester was found to be K = 2, which indicated that the entire population can be grouped into two subgroups (SG1 and SG2)

Fig. 3

Population structure of 192 accessions arranged based on inferred ancestry. Based on the membership fractions, the accessions with the probability of ≥ 80 % were assigned to corresponding subgroups with others categorized as admixture

SG1 consisted of 134 accessions with most of the landraces and varieties of Indian origin and SG2 consisted of 38 accessions which composed of non Indian accessions. Twenty accessions were retained to be admixture. The subgroup SG1 was dominated by indica subtype whereas the subgroup SG2 consisted mostly of japonica group. When the number of subgroups increased from two to five, the accessions in both the subgroups were classified into sub-sub groups (Table 2). As SG1 consisted of 134 accessions mostly of Indian origin, an independent STRUCTURE analysis was performed for this subgroup. ΔK showed its maximum value for K =3 which indicated that SG1 could be further classified into three sub-sub groups (Fig. 4). The differentiation in origin and seasonal differentiation of rice varieties contributed for this clustering.

Table 2 Population structure group of accessions based on Inferred ancestry values
Fig. 4

Population structure of 134 accessions in sub group-1 and membership probability of assigning genotypes of sub group-1 (K = 3)

Clustering analysis based on Unweighted Pair Group Method with Arithmetic Mean (UPGMA) method using DARwin separated the accessions into two main groups which showed similar results as STRUCTURE analysis. The group I in UPGMA tree consists of both indigenous and agronomically improved varieties whereas the other group consists of exotic accessions. In UPGMA tree, the accessions within group 1 and 2 clustered into smaller sub groups based on their origin and types. Most of the landraces and varieties have been clustered in upper branches of the tree whereas the exotic accessions have been clustered in lower branches of the tree (Fig 5). Hence the clustering analysis by two classification methods revealed high level of similarity in clustering the genotypes. PCoA was used to characterize the subgroups of the germplasm set. A two- dimensional scatter plot involving all 192 accessions has shown that the first two PCA axes accounted for 12.6 and 4.9 % of the genetic variation among populations (Fig 6).

Fig. 5

Unrooted neighbour joining tree of 192 rice varieties. The landraces and varieties used in the study has clustered in the upper branches of the tree whereas the exotic accessions has positioned in the lower branches of the tree

Fig. 6

Principal Coordinates of 192 accessions based on 61 SSR loci. Coord 1 and Coord 2 represent first and second coordinates, respectively. The two PCA axes accounted for 12.6 and 4.9 % of the genetic variation among populations

Genetic Variance Analysis

The hierarchial distribution of molecular variance by AMOVA and pair-wise analysis revealed highly significant genetic differentiation among the groups. It revealed that 14 % of the total variation was between the groups, while 86 % was among individuals within groups (Tables 3 and 4). Calculation of Wright’s F statistic at all SSR loci revealed that FIS was 0.50 and FIT was 0.56. Determination of FST for the polymorphic loci across all accessions has shown FST as 0.14 which implies high genetic variation (Table 4). The pairwise FST estimate among sub-groups has indicated that the two groups are significantly different from each other (Table 3).

Table 3 AMOVA between groups and Pair wise comparison using Fst values (GenAlEx)
Table 4 AMOVA between groups and accessions and Fixation indices (Arlequin software)


Genetic diversity is the key determinant of germplasm utilization in crop improvement. Population with high level of genetic variation is the valuable resource for broadening the genetic base in any breeding program. The panel of 192 accessions in this study with landraces, varieties as well as breeding lines has different salient agronomic traits. Few landraces included in this study i.e., Mappillai samba (Krishnanunni et al. 2015), Jyothi, Njavara (Deepa et al. 2008), Kavuni (Valarmathi et al. 2015) derived breeding line has therapeutic properties. Many lines included in this study are drought tolerant (Nootripathu, Norungan, Vellaikudaivazhai, kallundaikar, kodai, kalinga 3, Kinandang patong, azucena, mattaikar, IR65907-116-1, karuthakar, mattakuruvai, manavari, kallundai, kodaikulathan, kattikar, poongar, thogai samba, vellaikattai, kattukuthalam, kalvalai, chivapu chithiraikar, vellai chithiraikar, kudaivazhai and murugankar). Few lines have significant level of micronutrients in it (Nachimuthu et al. 2014). This panel has its importance because of its major component as traditional landraces with valuable agronomic traits that are cultivated in the small pockets of Tamil Nadu, India.

Molecular markers help us to understand the level of genetic diversity that exists among traditional races, varieties and exotic accessions which can be exploited in rice breeding programs. The genetic architecture of diverse germplasm lines can be precisely estimated by assessing the STRUCTURE of the population using molecular markers viz., SSRs or SNPs etc., (Horst and Wenzel 2007; Powell et al. 1996; Varshney et al. 2007). In this study, the genetic diversity among the accessions was evaluated by model based clustering and distance based clustering approach using the SSR genotypic data.

Regarding genetic divergence of the population consisting of local landraces, exotic cultivars and breeding lines, 61 polymorphic markers have detected a total of 205 alleles across 192 individuals. The number of alleles varied from 2 to 7 per locus and the average was 3 alleles per locus. Several previous reports have indicated the number of alleles per locus, polymorphic information content and gene diversity of 4.8–14.0, 0.63–0.70 and 6.2–6.8 respectively (Garris et al. 2005; Ram et al. 2007). In the current study, the average number of alleles (3 alleles/locus) is slightly lesser than the average number of alleles (3.88 alleles/ locus) reported by Zhang et al. (2011) in rice core collection with 150 rice varieties from south Asia and Brazil and Jin et al. (2010) who has reported the average alleles per locus as 3.9 in 416 rice accessions collected from China. Using three sets of germplasm lines (Thai (47), IRRI germplasm (53) amd other Oryza species (5)), Chakhonkaen et al. (2012) has reported 127 alleles for all loci, with a mean of 6.68 alleles per locus, and a mean Polymorphic Information Content (PIC) of 0.440 by screening with 19 InDel markers.

Chen et al. (2011) has reported the average gene diversity of 0.358 and polymorphic information content of 0.285 from 300 rice accessions from different rice growing areas of the world with 372 SNP markers. The gene diversity detected in this study (0.52) is comparable to overall gene diversity of rice core collection (0.544) from China, North Korea, Japan, Philippines, Brazil, Celebes, Java, Oceanina and Vietnam (Zhang et al. 2011) and it is higher than US accession panel with average gene diversity of 0.43 (Agrama and Eizenga 2008) and Chinese rice accession panel by Jin et al. (2010) with the average gene diversity of 0.47. The gene diversity reported in our study is lesser than gene diversity (0.68) reported by (Liakat Ali et al. 2011). Most of the diversity panel with global accessions has the gene diversity of 0.5 to 0.7 (Garris et al. 2005; Liakat Ali et al. 2011; Ni et al. 2002). These results on global accessions help to infer that this diversity panel of 192 germplasm lines represents a large proportion of the genetic diversity that exists in major rice growing Asian continent.

The PIC value was 0.468 which varied from 0.146 for RM17616 with only 2 two alleles to 0.756 for RM316 that allowed the amplification of 7 alleles. The PIC value was found to be 0.418 for SG1 which had the majority of indica accessions. The subgroup SG2 dominated by japonica accessions had the PIC value of 0.414. Hence, both the subgroups contribute in a major way for population diversity. As this population encompass different rice materials i.e., landraces, varieties and breeding lines, the molecular diversity is contributed majorly by landraces. These values are similar to those found by Courtois et al. (2012) who reported the PIC value from 0.16 to 0.78 with the average of 0.49 in European rice germplasm collection and in Chinese rice collection of 416 accessions by Jin et al. (2010), who has given similar PIC value of 0.4214. It is also consistent with PIC value (0.48) attained by Zhang et al. (2011). In this study, significant amount of rare alleles was identified which indicates that these rare alleles contribute well to the overall genetic diversity of the population.

Model based approach by STRUCTURE is implemented frequently for studying population structure by various researchers (Agrama et al. 2007, Agrama and Eizenga 2008; Garris et al. 2005; Zhang et al. 2007, 2011; Jin et al. 2010; Liakat Ali et al. 2011, Chakhonkaen et al. 2012 Courtois et al. 2012, Das et al. 2013). Courtois et al. (2012) has successfully detected two subgroups in their study population and assigned rice varieties into two groups with few admixture lines. Jin et al. (2010) has identified seven sub populations among 416 rice accessions from China. Das et al. (2013) has grouped a collection of 91 accessions of rice landraces from eastern and north eastern India into four groups.

Assigning of genotypes to the subgroups based on ancestry threshold vary between different research groups. Zhao et al. (2010) and Courtois et al. (2012) used an ancestry threshold of 80 % to identify accessions belonging to a specific subpopulation. Liakat Ali et al. (2011) has steup the threshold as 60 % and identified 33 accessions as admixtures as the threshold of 80 % consider more genotypes as admixtures. In the current study, a stringent threshold of 80 % ancestry value leaves only 20 genotypes as admixtures.

Population structure analysis in different rice diversity panel has indicated the existence of two to eight sub population in rice (Zhang et al. 2007, Zhang et al. 2009, Zhang et al. 2011, Garris et al. 2005, Agrama et al. 2007, Liakat Ali et al. 2011, Chakhonkaen et al. 2012 and Das et al. 2013). In the current rice diversity panel of 192 accessions based on the criterion of maximum membership probabilities, 134 accessions were assigned to SG1 which is dominated by indica subtype with most of the landraces and varieties of Indian origin and SG2 consisted of 38 accessions which composed mostly of japonica accessions of exotic origin. Similar population structure of two subgroups was observed in previous research by Zhang et al. (2009) in a collection of 3024 rice landraces in China. Zhang et al. (2011) has reported two distinct subgroups in a rice core collection. Courtois et al. (2012) has successfully classified two subgroups as japonica and non japonica accessions in European core collection of rice. The results indicated that two subgroups are due to the different adaptation behavior of accessions to different ecological environment as indica and japonica accessions has independent evolution frame and the origin of Indian rice accessions from indica cultivars. Hence the major criterion for population structure in this panel is indica – japonica subtype. This study includes large number of traditional landraces and varieties from Indian Subcontinent and few exotic accessions randomly selected from IRRI worldwide collection. It clarifies the relationship between Indian germplasm and exotic accessions which indicates that germplasm lines varies based on its ecology and also shows higher level of genetic diversity exists within this population.

Further structure analysis of SG1 that consisted of 134 lines indicated that it can be further subdivided in to three sub sub-groups. The three sub sub-groups classification has the factor of ecosystem and seasonal variation as the major factors for population structure. This results is in accordance with the inference that indica group has higher genetic diversity than japonica accessions which was given by various researchers (Gao et al. 2005; Lu et al. 2005; Lapitan et al. 2007; Caicedo et al. 2007; Liakat Ali et al. 2011; Garris et al. 2005; Qi et al. 2006; Qi et al. 2009); as this subgroup has indica accessions. Liakat Ali et al. (2011) has substantiated this statement with the reason of the indica subpopulation occupying the largest rice growing region which has a varied environments, ecological conditions and soil type.

The result of model based analysis is in accordance with the clustering pattern of Neighbour joining tree and Principal Coordinate Analysis. The first two principal coordinates explained 12.6 and 4.8 % of the molecular variance. Similar pattern of molecular variance explanation was observed by Zhang et al. (2011) for two population subgroups.

Calculation of Wright’s F Statistic at all loci revealed the deviation from Hardy- Weinberg law for molecular variation within the population. The result of Fst indicates higher divergence existing between subgroups of the population. Higher FIT, which is measured at subgroup level in whole population, has indicated lack of equilibrium across the groups and lack of heterozygosity most likely due to the inbreeding nature of rice.

The present study revealed that several unexploited landraces of Tamil Nadu, India which is widely cultivated by the farmers in different parts of the state. Ecological and evolutionary history contributes for the genetic diversity maintained in a population. The varieties with diverse ecosystems and wide eco-geographical conditions contribute for the genetic diversity among rice varieties in this population.

For establishing a core collection for association studies, two step approach followed by Breseghello and Sorrells (2006) and Courtois et al. (2012) was used. This approach involves the determination of population structure and then sampling can be done based on the relatedness of the accessions in the population. Those accessions that show high magnitude of genetic relatedness can be eliminated to develop core collection with diverse representatives. Based on this idea, out of 192 accessions, 150 (Table 5) were selected to form association mapping panel which can be utilized either by genome wide or candidate gene specific association mapping for linking the genotypic and phenotypic variation.

Table 5 Genotypes selected for association mapping panel


This study analyze the pattern of divergence exists in a population of 192 rice accessions that constitute our rice diversity panel for association mapping. Based on various statistical methods, we identified two sub groups within 192 rice accessions selected for establishing association mapping panel. The average number of alleles per locus and gene diversity has indicated the existence of broad genetic base in this collection. The result of structure analysis is in accordance with clustering method of neighbor joining tree and principal coordinate analysis. Thus, the results of this study which indicates the genetic diversity of the accessions can be utilized to predict approaches such as association analysis, classical mapping population development; parental line selection in breeding programs and hybrid development for exploiting the natural genetic variation exists in this population.


Plant Material

A collection consisting of 192 rice accessions was used in this study, which consist of land races and varieties collected from nine different states of India as well as from Argentina, Bangladesh, Brazil, Bulgaria, China, Colombia, Indonesia, Philippines, Taiwan, Uruguay, Venezuela and United States (Table 6).

Table 6 Germplasm accessions used in the study

Microsatellite Genotyping

DNA Isolation and PCR Amplification

DNA was extracted from leaf tissue by grinding with liquid nitrogen using CTAB method (Saghai-Maroof et al. 1984.). It was diluted to a final concentration of 30 ng μl−1 for enabling polymerase chain reactions. DNA amplification parameters such as specificity, efficiency and fidelity are strongly influenced by the components of the PCR reaction and by thermal cycling conditions (Caetano-Anolles and Brant 1991). Therefore, the careful optimization of reaction components and conditions will ultimately result in more reproducible and efficient amplification. The concentrations of primers, template DNA, Master Mix, and annealing temperature was optimized on eight diverse accessions for 156 SSR markers distributed on the 12 chromosomes by modified Taguchi method (Cobb and CIarkson 1994). Microsatellite primer sequences, annealing temperature and chromosomal locations are obtained from GRAMENE database ( Sixty one SSR primer pairs which produce polymorphic allele amplification were chosen to genotype the entire set of germplasm collection.

The volume of the PCR reaction system was 10 μl. The PCR reaction mixture of 10 μl had 0.4 mM dNTPs, 4 mM of MgCl2, 150 mM of Tris–HCl, 10 pmoles of forward and reverse primer and 0.05 U Taq polymerase with 30 ng of DNA. Polymerase chain reaction was performed in BIORAD THERMAL CYCLER using the following program: 94 °C for 2 min, 35 cycles of 94 °C for 45 sec, 50–60 °C for 1 min, 72 °C for 2 min with a final extension of 72 °C for ten min.

Polyacrylamide Gel Electrophoresis

Amplified products were size separated in native polyacrylamide gel electrophoresis using 6 % (w/v) polyacrylamide gel according to Sambrook et al. (2001) in vertical electrophoresis tank with 1X TBE at 150 V. The gel size was determined using standard molecular weight size markers after the bands were detected by silver staining.

Allele Scoring

The bands were visualized in a cluster of two to six in the stained gels for most of the markers. Based on the expected product size given in the GRAMENE website (Additional file 2: Table S1), the size of the most intensely amplified bands around the expected product size for each microsatellite marker was identified using standard molecular weight size markers (20 bp DNA ladder, GeNeI Company). Then the stained gel was dried and documented using light box. Allele score was given based on the presence of a particular size allele in each of the germplasm. The presence was denoted as 1 and absence of an allele as 0 and it was rechecked manually (Additional file 3: Table S2).

Data Analysis

A 1/0 matrix was constructed based on the presence and absence of alleles for the set of 61 markers. This SSR genotype data was analyzed for genetic diversity and population structure.

Genetic Diversity

For a set of accessions, genetic diversity parameters such as number of alleles per locus, allele frequency, heterozygosity and polymorphic information index (PIC) was estimated using the program POWERMARKER Ver3.25 (Liu and Muse 2005). Allele frequency represents the frequency of particular allele for each marker. Heterozygosity is the proportion of heterozygous individuals in the population. Polymorphic information content that represent the amount of polymorphism within a population was estimated based on Botstein et al. (1980).

To assess genetic structure, model based approach and distance based approach were used. Model based approach was utilized with Structure ver 2.3.4 software (Pritchard et al. 2000). The actual number of subpopulation which is denoted by K was identified by this method. For that, the project was run with the following parameter set: the possibility of admixture and allele frequency correlated. Run length was given as 150,000 burning period length followed by 150,000 Markov Chain Monte Carlo (MCMC) replication. Each k value was run for 10 times with k value varying from 1 to 10. The optimum k value was determined by plotting the mean estimate of the log posterior probability of the data (L (K) against the given K value. True number of subpopulation was identified using the maximal value of L (K). An adhoc quantity ΔK proposed by (Evanno et al. 2005) based on second order rate of change of the likelihood function with respect to K estimated using Structure Harvester (Earl 2012) has also shown a clear peak at the optimal K value.

Distance based approach which is based on calculating pair wise distance matrix was computed by calculating a dissimilarity matrix using a shared allele index with DARwin software (Perrier and Jacquemoud-Collet 2006). An unweighted neighbor joining tree was constructed using the calculated dissimilarity index. The genetic distance between accessions was estimated using NEI coefficient (Nei 1972) with bootstrap procedure of resampling (1000) across markers and individuals from allele frequencies. To determine the association among the accessions, unweighted pair group method with arithmetic mean (UPGMA) tree was also drawn using Powermarker and viewed in MEGA 6.0 software (Tamura et al. 2013).

The presence of molecular variance within and between hierarchical population structure estimated by Structure was assessed via Analysis of molecular variance (AMOVA) by Arlequin (Excoffier et al. 2005). F statistics which include FIT, deviations from Hardy- Weinberg expectation across the whole population, FIS deviation from Hardy- Weinberg expectation within a population and FST, correlation of alleles between subpopulation was calculated using AMOVA approach in Arlequin. AMOVA and Principal Coordinate analysis of the germplasm set was performed based on Nei (Nei 1973) distance matrix using GenAlEx 6.5 (Peakall and Smouse 2012).


  1. Abdurakhmonov IY, Abdukarimov A (2008) Application of association mapping to understanding the genetic diversity of plant germplasm resources. Int J Plant Genomics 2008:574927

    PubMed Central  Article  PubMed  Google Scholar 

  2. Agrama H, Eizenga G (2008) Molecular diversity and genome-wide linkage disequilibrium patterns in a worldwide collection of Oryza sativa and its wild relatives. Euphytica 160(3):339–355

    CAS  Article  Google Scholar 

  3. Agrama H, Eizenga G, Yan W (2007) Association mapping of yield and its components in rice cultivars. Mol Breed 19(4):341–356

    Article  Google Scholar 

  4. Agrama HA, Yan W, Jia M, Fjellstrom R, McClung AM (2010) Genetic structure associated with diversity and geographic distribution in the USDA rice world collection. Nat Sci 2(04):247

    Google Scholar 

  5. Botstein D, White RL, Skolnick M, Davis RW (1980) Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am J Hum Genet 32(3):314

    PubMed Central  CAS  PubMed  Google Scholar 

  6. Breseghello F, Sorrells ME (2006) Association mapping of kernel size and milling quality in wheat (Triticum aestivum L.) cultivars. Genetics 172(2):1165–1177

    PubMed Central  Article  PubMed  Google Scholar 

  7. Caetano-Anolles G, Brant B (1991) DNA amplification fingerprinting using very short arbitrary oligonucleotide primers. Nat Biotechnol 9(6):553–557

    CAS  Article  Google Scholar 

  8. Caicedo AL, Williamson SH, Hernandez RD, Boyko A, Fledel-Alon A, York TL, Polato NR, Olsen KM, Nielsen R, McCouch SR (2007) Genome-wide patterns of nucleotide polymorphism in domesticated rice. PLoS Genet 3(9):e163

    PubMed Central  Article  Google Scholar 

  9. Chakhonkaen S, Pitnjam K, Saisuk W, Ukoskit K, Muangprom A (2012) Genetic structure of Thai rice and rice accessions obtained from the international rice research institute. Rice 5(1):19

    Article  Google Scholar 

  10. Chen H, He H, Zou Y, Chen W, Yu R, Liu X, Yang Y, Gao Y-M, Xu J-L, Fan L-M (2011) Development and application of a set of breeder-friendly SNP markers for genetic analyses and molecular breeding of rice (Oryza sativa L.). Theor Appl Genet 123(6):869–879

    Article  PubMed  Google Scholar 

  11. Choudhury B, Khan ML, Dayanandan S (2013) Genetic structure and diversity of indigenous rice (Oryza sativa) varieties in the Eastern Himalayan region of Northeast India. Springer Plus 2(1):228

    PubMed Central  Article  PubMed  Google Scholar 

  12. Cobb BD, CIarkson JM (1994) A simple procedure for optimising the polymerase chain reaction (PCR) using modified Taguchi methods. Nucleic Acids Res 22(18):3801–3805

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  13. Courtois B, Frouin J, Greco R, Bruschi G, Droc G, Hamelin C, Ruiz M, Clément G, Evrard J-C, van Coppenole S (2012) Genetic diversity and population structure in a European collection of rice. Crop Sci 52(4):1663–1675

    Article  Google Scholar 

  14. Das B, Sengupta S, Parida SK, Roy B, Ghosh M, Prasad M, Ghose TK (2013) Genetic diversity and population structure of rice landraces from Eastern and North Eastern States of India. BMC Genet 14(1):71

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  15. Deepa G, Singh V, Naidu KA (2008) Nutrient composition and physicochemical properties of Indian medicinal rice–Njavara. Food Chem 106(1):165–171

    CAS  Article  Google Scholar 

  16. Earl DA (2012) STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the evanno method. Conserv Genet Resour 4(2):359–361

    Article  Google Scholar 

  17. Ebana K, Kojima Y, Fukuoka S, Nagamine T, Kawase M (2008) Development of mini core collection of Japanese rice landrace. Breed Sci 58(3):281–291

    Article  Google Scholar 

  18. Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol 14(8):2611–2620

    CAS  Article  PubMed  Google Scholar 

  19. Excoffier L, Laval G, Schneider S (2005) Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evol Bioinformatics Online 1:47

    CAS  Google Scholar 

  20. Gao L-Z, Zhang C-H, Chang L-P, Jia J-Z, Qiu Z-E, Dong Y-S (2005) Microsatellite diversity within Oryza sativa with emphasis on indica–japonica divergence. Genet Res 85(01):1–14

    CAS  Article  PubMed  Google Scholar 

  21. Garris AJ, Tai TH, Coburn J, Kresovich S, McCOUCH S (2005) Genetic structure and diversity in Oryza sativa L. Genetics 169(3):1631–1638

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  22. Hesham AA, Yan W, Fjellstrom R, Jia M, McClung A (2008) Genetic diversity and relationships assessed by SSRs in the USDA World-Wide Rice Germplasm Collection. In: The 2008 joint annual meeting

    Google Scholar 

  23. Horst L, Wenzel G (2007) Molecular marker systems in plant breeding and crop improvement, vol 55. Springer, Berlin

    Google Scholar 

  24. Jin L, Lu Y, Xiao P, Sun M, Corke H, Bao J (2010) Genetic diversity and population structure of a diverse set of rice germplasm for association mapping. Theor Appl Genet 121(3):475–487

    Article  PubMed  Google Scholar 

  25. Krishnanunni K, Senthilvel P, Ramaiah S, Anbarasu A (2015) Study of chemical composition and volatile compounds along with in-vitro assay of antioxidant activity of two medicinal rice varieties: Karungkuravai and Mappilai samba. Journal of Food Science and Technology 52 (5):2572-2584. doi:10.1007/s13197-014-1292-z

  26. Lapitan VC, Brar DS, Abe T, Redoña ED (2007) Assessment of genetic diversity of Philippine rice cultivars carrying good quality traits using SSR markers. Breed Sci 57(4):263–270

    CAS  Article  Google Scholar 

  27. Liakat Ali M, McClung AM, Jia MH, Kimball JA, McCouch SR, Georgia CE (2011) A rice diversity panel evaluated for genetic and agro-morphological diversity between subpopulations and its geographic distribution. Crop Sci 51(5):2021–2035

    Article  Google Scholar 

  28. Liu K, Muse SV (2005) PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics 21(9):2128–2129

    CAS  Article  PubMed  Google Scholar 

  29. Lu H, Redus MA, Coburn JR, Rutger JN, McCOUCH SR, Tai TH (2005) Population structure and breeding patterns of 145 US rice cultivars based on SSR marker analysis. Crop Sci 45(1):66–76

    CAS  Article  Google Scholar 

  30. McCouch SR, Chen X, Panaud O, Temnykh S, Xu Y, Cho YG, Huang N, Ishii T, Blair M (1997) Microsatellite marker development, mapping and applications in rice genetics and breeding. Plant Mol Biol 35(1-2):89–99

    CAS  Article  PubMed  Google Scholar 

  31. Nachimuthu VV, Robin S, Sudhakar D, Rajeswari S, Raveendran M, Subramanian K, Tannidi S, Pandian BA (2014) Genotypic variation for micronutrient content in traditional and improved rice lines and its role in biofortification programme. Indian J Sci Technol 7(9):1414–1425

    Google Scholar 

  32. Nei M (1972) Genetic distance between populations. Am Nat 106:283–292

    Article  Google Scholar 

  33. Nei M (1973) Analysis of gene diversity in subdivided populations. Proc Natl Acad Sci 70(12):3321–3323

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  34. Ni J, Colowit PM, Mackill DJ (2002) Evaluation of genetic diversity in rice subspecies using microsatellite markers. Crop Sci 42(2):601–607

    CAS  Article  Google Scholar 

  35. Peakall R, Smouse PE (2012) GenAlEx 6.5: genetic analysis in excel. Population genetic software for teaching and research—an update. Bioinformatics 28(19):2537–2539

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  36. Perrier X, Jacquemoud-Collet J (2006) DARwin software

    Google Scholar 

  37. Powell W, Morgante M, Andre C, Hanafey M, Vogel J, Tingey S, Rafalski A (1996) The comparison of RFLP, RAPD, AFLP and SSR (microsatellite) markers for germplasm analysis. Mol Breed 2(3):225–238

    CAS  Article  Google Scholar 

  38. Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155(2):945–959

    PubMed Central  CAS  PubMed  Google Scholar 

  39. Qi Y, Zhang D, Zhang H, Wang M, Sun J, Wei X, Qiu Z, Tang S, Cao Y, Wang X (2006) Genetic diversity of rice cultivars (Oryza sativa L.) in China and the temporal trends in recent fifty years. Chinese Sci Bull 51(6):681–688

    CAS  Article  Google Scholar 

  40. Qi Y, Zhang H, Zhang D, Wang M, Sun J, Ding L, Wang F, Li Z (2009) Assessing indica japonica differentiation of improved rice varieties using microsatellite markers. J Genet Genomics 36(5):305–312

    CAS  Article  PubMed  Google Scholar 

  41. Ram SG, Thiruvengadam V, Vinod KK (2007) Genetic diversity among cultivars, landraces and wild relatives of rice as revealed by microsatellite markers. J Appl Genet 48(4):337–345

    Article  PubMed  Google Scholar 

  42. Saghai-Maroof MA, Soliman KM, Jorgensen AR, Allard RW (1984) Ribosomal DNA spacer length polymorphisms in barley: Mendelian inheritance, chromosomal location and population dynamics. Proc Natl Acad Sci U S A 81:8014–8018

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  43. Sambrook J, Russell DW, Russell DW (2001) Molecular cloning: a laboratory manual (3-volume set), vol 999. Cold spring harbor laboratory press, Cold Spring Harbor, New York

    Google Scholar 

  44. Sow M, Ndjiondjop M-N, Sido A, Mariac C, Laing M, Bezançon G (2014) Genetic diversity, population structure and differentiation of rice species from Niger and their potential for rice genetic resources conservation and enhancement. Genet Resour Crop Evol 61(1):199–213

    Article  Google Scholar 

  45. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S (2013) MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol 30(12):2725–2729

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  46. Thornsberry JM, Goodman MM, Doebley J, Kresovich S, Nielsen D, Buckler ES (2001) Dwarf8 polymorphisms associate with variation in flowering time. Nat Genet 28(3):286–289

    CAS  Article  PubMed  Google Scholar 

  47. Valarmathi R, Raveendran M, Robin S, Senthil N (2015) Unraveling the nutritional and therapeutic properties of ‘Kavuni’ a traditional rice variety of Tamil Nadu. Journal of Plant Biochemistry and Biotechnology 24 (3):305-315. doi:10.1007/s13562-014-0274-6 

  48. Varshney RK, Chabane K, Hendre PS, Aggarwal RK, Graner A (2007) Comparative assessment of EST-SSR, EST-SNP and AFLP markers for evaluation of genetic diversity and conservation of genetic resources using wild, cultivated and elite barleys. Plant Sci 173(6):638–649

    CAS  Article  Google Scholar 

  49. Zhang D, Zhang H, Wei X, Qi Y, Wang M, Sun J, Ding L, Tang S, Cao Y, Wang X (2007) Genetic structure and diversity of Oryza sativa L. in Guizhou, China. Chinese Sci Bull 52(3):343–351

    CAS  Article  Google Scholar 

  50. Zhang D, Zhang H, Wang M, Sun J, Qi Y, Wang F, Wei X, Han L, Wang X, Li Z (2009) Genetic structure and differentiation of Oryza sativa L. in China revealed by microsatellites. Theor Appl Genet 119(6):1105–1117

    CAS  Article  PubMed  Google Scholar 

  51. Zhang P, Li J, Li X, Liu X, Zhao X, Lu Y (2011) Population structure and genetic diversity in a rice core collection (Oryza sativa L.) investigated with SSR markers. PLoS One 6(12):e27565

    PubMed Central  CAS  Article  PubMed  Google Scholar 

  52. Zhao K, Tung C-W, Eizenga GC, Wright MH, Ali ML, Price AH, Norton GJ, Islam MR, Reynolds A, Mezey J (2011) Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa. Nat Commun 2:467

    PubMed Central  Article  PubMed  Google Scholar 

  53. Zhao K, Wright M, Kimball J, Eizenga G, McClung A, Kovach M, Tyagi W, Ali ML, Tung C-W, Reynolds A (2010) Genomic diversity and introgression in O. sativa reveal the impact of domestication and breeding on the rice genome. PloS one 5(5):e10780

Download references


This work was supported by a grant from Department of Biotechnology, Government of India under Rice biofortification with enhanced iron and zinc in high yielding non basmati cultivars through marker assisted breeding and transgenic approaches- Phase II (E28SO) scheme. I thank Dr. Yasodha from Institute of Forest Genetics and Tree Breeding, Coimbatore for helping in the analysis.

Author information



Corresponding author

Correspondence to Vishnu Varthini Nachimuthu.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

RoS, VVN, RS and SKK prepared the samples. VVN carried out the genotyping, data analysis and drafted the manuscript. BAP, GP, KG participated in genotyping and data preparation. RoS designed the experiment and revised the manuscript. SD, RS, RM, SM participated in study design and revised the manuscript. All authors read and approve the final manuscript.

Additional files

Additional file 1: Figure S1.

Allelic pattern of different SSR markers used in this study. (JPG 1.03 MB)

Additional file 2: Table S1.

Expected product size obtained from Gramene and observed product size for the SSR markers used in this study. (XLS 170 kb)

Additional file 3: Table S2.

Allele matrix of 192 accessions x 61 SSRs. (XLSX 10 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Nachimuthu, V.V., Muthurajan, R., Duraialaguraja, S. et al. Analysis of Population Structure and Genetic Diversity in Rice Germplasm Using SSR Markers: An Initiative Towards Association Mapping of Agronomic Traits in Oryza Sativa . Rice 8, 30 (2015).

Download citation


  • Rice
  • Genetic diversity
  • Population structure
  • Polymorphism information content
  • Molecular variance
  • Association mapping