Genetic Diversity and Population Structure of Korean Soybean Collection Using 75 Microsatellite Markers

Gi-An Lee; Yu-Mi Choi; Jung-Yoon Yi; Jong-Wook Chung; Myung-Chul Lee; Kyung-Ho Ma; Sok-young Lee; Jin-Woong Cho; Jung-Ro Lee

doi:10.7740/kjcs.2014.59.4.492

Preview

Original Research Article

The Korean Journal of Crop Science. 31 December 2014. 492-497
https://doi.org/10.7740/kjcs.2014.59.4.492

Genetic Diversity and Population Structure of Korean Soybean Collection Using 75 Microsatellite Markers

Gi-An Lee¹

Yu-Mi Choi¹

Jung-Yoon Yi¹

Jong-Wook Chung¹

Myung-Chul Lee¹

Kyung-Ho Ma¹

Sok-young Lee¹

Jin-Woong Cho²^*

Jung-Ro Lee¹^*

¹National Agrobiodiversity Center, National Academy of Agricultural Science, RDA, Jeonju-si, Jeollabuk-do, Korea

²College of Agricultural and Life Science, Chungnam National University, Daejeon 305-764, Korea

^{*Corresponding Author}

License (open-access, http://creativecommons.org/licenses/by-nc/3.0/):

This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

ABSTRACT

Soybean (Glycine max L.) is crucial legume crop as source of high quality vegetable protein and oil, and Korea is regarded as a part of center of soybean origin. To expand the information of conserved genetic diversity, we analyzed the genetic variability of soybean collection mainly introduced Korean accessions using 75 microsatellite markers. A total of 1,503 alleles with an average value of 20.0 alleles were detected among 644 accessions. Korean collection revealed average allele number of 13.4 while Chinese, Japanese and Southeast Asian accessions showed 9.0, 5.4 and 6.5 mean alleles, respectively. Especially, Korean accessions showed more number of private allele per locus as 3.4 contrary to other geographical groups. The mean expected heterozygosity and polymorphic information content was 0.654 and 0.616, respectively, and expected heterozygosity values were not significantly distinguished according to the geographical groups. The phylogenetic dendrogram and deduced population structure based on DNA profiles of 75 SSR loci showed Korean accessions formed distinct gene pool against Chinese accessions, and could be divided into five subpopulations. Korean soybean accessions have specific genetic diversity and might be serve the valuable alleles for bio-industry as a part of the center of soybean origin.

Keywords

microsatellites

genetic diversity

soybean

Glycine max L

MAIN

Soybean (Glycine max L.) is cultivated over the world and especially has been a crucial legume crop in eastern Asia region including Korea, China, and Japan because this crop has been important source of high quality vegetable protein and oil for food products. Recently, the usage of soybean for biodiesel has made this crop more economically important, and the whole-genome sequencing of Williams 82 (G. max) and IT182932 (G. soja) have been completed accordingly (Hisano et al., 2007; Kim et al., 2010; Schmutz et al., 2010). Yu & Kiang (1993) reported that Korea might be one of the major center of soybean origins based on electrophoretic assays of wild soybean (G. soja) and cultivated soybeans (G. max). Lee et al. (2008) also suggested that Korea is a major center of diversity for wild soybean and potentially could serve useful alleles not found in other parts of the world while Dong et al. (2004) regarded northeastern region of China as major center of soybean origin. Furthermore, the collection of Korea and China revealed divergent genotypes while accessions of Japan was genetically similar to Korean accessions (Lee et al., 2011; Li & Nelson, 2001).

The soybean cultivars, derived from a few accessions, show narrow genetic background, and this means cultivated varieties might be vulnerable to new biotic and abiotic stress along with global environmental changes (Hyten et al., 2006; Mulato et al., 2010). The genetic diversity of soybean including landraces and wild relatives is needed for breeding program and bioindustry as a valuable allele source. Especially, landraces collected in center of origins are useful for genetic improvement of cultivars in that wild relatives generally reveal many unwanted traits and show low introgression rate (Song et al., 2013). Genetic analysis using molecular markers have been consistently studied to understand the genetic variability of soybeans, and have revealed the genetic structure and diversity of various soybean collections (Choi et al., 1999; Dong et al., 2004; Guan et al., 2010; Lee et al., 2008; Li et al., 2008; Wang et al., 2006; Yoon et al., 2009).

Microsatellite or simple sequence repeats (SSRs) markers has been used for genetic diversity analysis because microsatellites are plentiful variations over the genome and highly reproducible and easy for genotyping as a codominant markers. The high level of polymorphism was detected in several soybean collections; 18.3 alleles (6 microsatellite loci) in Korean soybean accessions (Yoon et al., 2009), 25.5 alleles (46 microsatellite loci) in wild soybean collection (Lee et al., 2008), 12.2 alleles (60 microsatellite loci) in Chinese soybean accessions (Wang et al., 2006), 19.7 alleles (59 microsatellite loci) in soybean landraces of China (Li et al., 2008). Using microsatellite markers, Choi et al. (1999) analyzed the collection acquired from five major rivers in Korea and they supported that SSR markers are useful tool to classify cultivated soybeans and wild soybeans, and Dong et al. (2004) analyzed genetic diversity of cultivated soybean accessions collected from several provinces of China and proposed one possible diversity center.

In this paper, we analyzed the genetic diversity of soybean collection mainly introduced Korean accession from USDA National Plant Germplasm System of USA using genome-wide 75 microsatellite markers.

Materials and Methods

Plant materials and DNA extraction

A total of 644 soybean and wild soybean accessions were used in this study; 619 soybean (G. max) accessions (507 - Korea, 57 - China, 19 - Japan, 36 - Southeastern Asia) and 25 wild soybean (G. soja) accessions (6 - Korea, 9 - China, 10 – Japan) were included for diversity analysis (Table 1, supplementary Table S1). The majority of Korean soybean accessions were re-introduced from USDA National Plant Germplasm System of USA because these accessions were not conserved in Korean National Agrobiodiversity Center before.

Table 1.

Summary information on soybean (Glycine max & G. soja) accessions used in this study.

Species	Origin	Accession number

Soybean (Glycine max)	Korea	507
	China	57
	Japan	19
	Southeastern Asia	36

Sub total		619

Wild soybean (Glycine soja)	Korea	6
	China	9
	Japan	10

Sub total		25

Total		644

All plant materials were distributed from the National Agrobiodiverstiy Center, National Academy of Agricultural Science (NAAS), Rural Development Administration (RDA), Republic of Korea. Total genomic DNA was extracted by modified cetyltrimethylammonium bromide (CTAB) method (Dellaporta et al., 1983) using young leaf tissue after grinding in liquid nitrogen. DNA concentration and quality were determined with a NanoDrop ND-1000 spectrometer (NanoDrop Technologies, Wilmington, DE, USA), and DNA stocks were finally diluted to working solutions of of 20 ng/μl.

Genotyping with microsatellite markers

Genome-wide 75 microsatellite markers were chosen along the soybean genetic linkage map. The reproducibility of PCR product size was evaluated by performing three replications using two different samples for each SSR primer. The M13Ftail PCR method of Schuelke (2000) was used to measure the size of PCR products, as described previously (Lee et al., 2014). The PCR master mix was prepared as follows: Each 20 μl reaction mixture contained 20 ng template gDNA, 1X PCR buffer, 2 uM specific primer, 200 uM dNTPs, 4 uM M13 universal primer, 6 uM normal reverse primer, and 0.5 U Taq-polymerase (SolGent, Korea). PCR was performed as follows: denaturation at 94°C (3 min) then 30 cycles each of 94°C (30 s), 55°C (45 s), and 72°C (1 min), followed by 10 cycles of 94°C (30 s), 53°C (45 s), and 72°C (1 min), and a final extension at 72°C for 10 min with a PTC-200 thermocycler (MJ Research, Waltham, MA, USA). Amplified PCR products were resolved using an ABI PRISM 3130xl Genetic Analyzer (Applied Biosystems, Foster City, CA, USA) with GeneScan 500 ROX (6-carboxy-rhodamine) as molecular size standard (35–500 bp).

Data analysis

As a indicator of genetic diversity, the major allele frequency (M_AF), number of alleles (N_A), observed heterozygosity (H_O), expected heterozygosity (He) and polymorphism information content (PIC), were calculated using the genetic analysis package PowerMarker V3.25 (Liu & Muse, 2005). Phylogenetic dendrogram was constructed with an unweighted pair-group method of the arithmetic average (UPGMA) based on genetic distance matrix using the same software. The Structure software (V2.2) (Pritchard et al., 2007) was utilized to detect possible subpopulations (K=1 to K=10) with a model allowing for admixture and correlated allele frequencies using a burn-in period of 50,000 and MCMC repeats of 50,000 followed by 3 iterations. The optimal number of populations corresponds to the highest peak in the ΔK graph (Evanno et al., 2005), and the accessions with membership probabilities ≥ 70% were assigned to each subpopulation. The molecular variance for geographical regions was calculated by molecular variance (AMOVA) approach using the GenALEx software (ver. 6.1) (Peakall & Smouse, 2006).

Results

Genetic variability of soybeans

To reveal the genetic diversity of retained Korean soybean accessions in 75 SSR loci, major allele frequency (M_AF), number of alleles (N_A), expected heterozygosity (H_E), observed heterozygosity (H_O) and polymorphic information content (PIC) are analyzed and presented in Table 2 and supplementary Tabel S2. The mean value of M_AF was 0.476 (satt276; 0.111~ satt328; 0.913) and 0.278 (satt316 and satt157; 0.091~satt229; 0.833) in soybean and wild soybean accessions, respectively. A total of 1,503 alleles (soybean; 1,200 alleles, wild soybean; 835 alleles) with an average value of 20.0 alleles (soybean; 16.0 mean alleles, wild soybean; 11.4 mean alleles) were detected in 75 SSR loci. Among the tested soybean samples, The Korean accessions revealed average allele number of 13.4 (2~39) while Chinese, Japanese and Southeast Asian accessions showed 9.0, 5.4 and 6.5 mean alleles, respectively. Interestingly, the mean private allele number of Korean group was 3.4 contrary to other soybean geographical groups ranged from 0.2 to 0.7 mean alleles (Fig. 1). The mean expected heterozygosity (H_E) and polymorphic information content (PIC) among soybean accessions was ranged from a 0.162 on satt328 to 0.954 on satt276 (mean = 0.654) and from 0.156 on satt328 to 0.952 on satt323 (mean = 0.616), respectively. The average HE values were not significantly distinguished according to the geographical group as Korea of 0.628, China of 0.637, Japan of 0.638 and Southeast Asia of 0.600 while wild soybeans revealed higher mean H_E values of 0.826 ranged from 0.299 to 0.941.

http://static.apub.kr/journalsite/sites/kjcs/2014-059-04/A03505904013/images/KJCS-59-492_F1.jpg

Fig. 1.

Allele variability according to the origin of tested 644 soybean accessions.

Table 2.

Summary of genetic diversity indices in 644 soybean accessions using 75 microsatellite markers.

Group	Subgroup	N	M_AF	N_A	H_E	H_o	PIC

Soybean (G. max)	Sub-total	619	0.476 (0.111~0.913)	16 (3~41)	0.654 (0.162~0.954)	0.019 (0~0.13)	0.616 (0.156~0.952)
	Region-1 (Korea)	507	0.502 (0.115~0.944)	13.427 (2~39)	0.628 (0.106~0.951)	0.008 (0~0.141)	0.589 (0.101~0.949)
	Region-2 (China)	57	0.507 (0.125~0.889)	8.987 (3~22)	0.637 (0.201~0.928)	0.038 (0~0.109)	0.602 (0.187~0.924)
	Region-3 (Japan)	19	0.483 (0.167~0.842)	5.44 (2~13)	0.638 (0.266~0.898)	0.02 (0~0.368)	0.589 (0.231~0.89)
	Region-4 (Southeastern Asia)	36	0.522 (0.167~0.972)	6.52 (2~21)	0.6 (0.054~0.914)	0.137 (0~0.371)	0.559 (0.053~0.908)
Wild soybean (G. soja)	Sub-total	25	0.278 (0.091~0.833)	11.373 (3~20)	0.826 (0.299~0.941)	0.045 (0~0.261)	0.81 (0.289~0.938)
	Region-1 (Korea)	6	0.368 (0.167~1)	4.627 (1~7)	0.71 (0~0.847)	0.092 (0~0.5)	0.668 (0~0.828)
	Region-2 (China)	9	0.356 (0.111~1)	5.627 (1~9)	0.742 (0~0.889)	0.017 (0~0.2)	0.71 (0~0.878)
	Region-3 (Japan)	10	0.378 (0.111~0.9)	5.88 (2~10)	0.733 (0.18~0.889)	0.044 (0~0.222)	0.702 (0.164~0.878)

*Abbreviation : M_AF, major allele frequency; N_A, number of alleles, H_O, observed heterozygosity; H_E, expected heterozygosity; PIC, polymorphic information content.

Genetic relationship and population structure

To reveal genetic relationship of tested 644 accessions, we constructed the phylogenetic dendrogram based on DNA profiles of 75 SSR loci (Fig. 2). The soybean and wild soybean were critically divergent, and soybean accessions could be divided into three groups in a genetic distance value of 0.72; Group-1 was mainly composed of Korean and Japanese soybean accessions, main Chinese accessions were appertained into Gropu-2 and Group-3 inculded mainly Southeast Asian accessions. According to the phylogenetic analysis Korean and Chinese soybean gene pools were divergent, and Japanese accessions tied into Korean gene pool while Southeast Asian accessions showed near genetic relationship to Chinese accessions.

http://static.apub.kr/journalsite/sites/kjcs/2014-059-04/A03505904013/images/KJCS-59-492_F2.jpg

Fig. 2.

Phylogenetic dendrogram (a) and model-based population structure based on SSR profiles of 644 soybean accessions.

The model-based Bayesian clustering method was used to detect the possible subpopulations along the tested soybean gene pools, and we could find significant population structure. For each K value (K = 1 to 10), we determined the optimal value of K by calculating ΔK. As we used soybean and wild soybean accessions in this study, we surveyed the highest ΔK value over “K=3”. The seven subpopulations, referred to as Pop-1~ Pop-7, were observed with the highest ΔK value (data not shown), and 331 accessions were admixture type with an membership probability < 70% (Fig. 2). Most accessions were distinguished according to the geographical origin; soybean accessions could be divided into six subpopulations (Pop-2~ Pop-7) while wild soybean accessions were tied into Pop-1 group. Korean accessions were sub-divided into five subpopulations (Pop-2~Pop-6), and the Pop-3 included three Japanese accessions. Chinese and Southeast Asian accessions were located in Pop-7, and some Korean accessions were included in this group. The AMOVA analysis showed that the molecular variance between the geographical groups accounted for 13% and the remaining 87% were due to the differences within the geographical groups.

Discussion

Soybean is important legume crop for protein and oil source, but the recent cultivated varieties reveals narrow genetic background and weak adaptability to global environment changes (Hyten et al., 2006; Mulato et al., 2010). As Korea is one of the center of soybean origins, we analyzed the genetic diversity of Korean soybean collection using 75 microsatellite markers for a sustainable conservation and utilization.

In this study, Korean soybean accessions revealed average allele number (N_A) of 13.4, H_E of 0.628 and PIC of 0.589, and the N_A of Korean accession was higher than other regional groups (China – 9.0, Japan – 5.4, Southeast Asia - 6.5). As we analyzed the more Korean samples than other geographical regions, detection of more alleles in Korean collection is expectable results, but the average private allele number of Korean accessions was 3.4 contrary to other soybean geographical groups ranged from 0.2 to 0.7 mean private alleles. This means that the Korean accessions conserve distinct alleles and might be valuable sources for breeding program and bio-industry with the regional specific alleles. In previous studies, (Li et al., 2011) reported average 14.2 alleles and 0.806 genetic diversity in Chinese core soybean collection at 55 microsatellite loci, and (Guan et al., 2010) reported 16.2 alleles with a mean gene diversity of 0.84 in Chinese and Japanese soybean collection at 46 SSR loci. Our results showed a little low allele numbers and genetic diversity values, but determining the diversity level is inappropriate because of different microsatellite loci and collection. The wild soybean (G. soja) revealed higher mean H_E values of 0.826 than soybean (G. max) of 0.654, and this is reasonable in that wild relative generally conserve more genetic diversity than domesticated landraces.

The phylogenetic dendrogram and deduced population structure based on DNA profiles of 75 SSR loci showed that Korean and Chinese soybean accessions formed divergent gene pools. The AMOVA for geographical groups revealed that 13% and 87% of total variation were due to the differences between and within geographical groups, respectively, and this indicated that the significant geographical differentiation was present. Japanese accessions showed high genetic relationship to Korean soybean gene pools, whereas Southeast Asian accessions were similar to Chinese accessions. As previous studies (Lee et al., 2011; Lee et al., 2008; Li & Nelson, 2001; Yu & Kiang, 1993), soybean germplasm of Korea region conserve distinct genetic diversity compared to Chinese accessions, and Japanese and Korean accessions were more similar to each other. By model-based bayesian clustering method showed that Korean soybean accessions could be subdivided into five subpopulation (Pop-2~Pop6). The results in this study indicate that Korean soybean accessions have specific genetic diversity and might be serve the valuable alleles for bio-industry as a part of the center of soybean origin.

Continuous utilization of limited elite cultivars reduces the genetic background of cultivated soybean, and the adaptability against rapidly changing circumstance might decline accordingly. This study serves that Korean soybean accessions show distinct genetic diversity and is composed of several subpopulations. The detected specific alleles and population structures could elevate the utilization Korean soybean genetic resources.

Acknowledgements

This study was supported by the Rural Development Administration (RDA), a grant (Code # PJ008625) from the National Academy of Agricultural Science, RDA, Republic of Korea.

REFERENCES

IY Choi, JH Kang, HS Song and NS Kim, Genes Genet. Syst, Genetic diversity measured by simple sequence repeat variations among the wild soybean, Glycine soja, collected along the riverside of five major rivers in Korea, 74(4); 169-177 (1999)

SL Dellaporta, J Wood and JB Hicks, Plant Mol. Biol. Rep, A plant DNA minipreparation: version II, 1(4); 19-21 (1983)

Y Dong, L Zhao, B Liu, Z Wang, Z Jin and H Sun, Theor. Appl. Genet, The genetic diversity of cultivated soybean grown in China, 108(5); 931-936 (2004)

G Evanno, S Regnaut and J Goudet, Mol. Ecol, Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study, 14(8); 2611-2620 (2005)

R Guan, R Chang, Y Li, L Wang, Z Liu and L Qiu, Genet. Resour. Crop Evol, Genetic diversity comparison between Chinese and Japanese soybeans (Glycine max (L.) Merr.) revealed by nuclear SSRs, 57(2); 229-242 (2010)

H Hisano, S Sato, S Isobe, S Sasamoto, T Wada, A Matsuno, T Fujishiro, M Yamada, S Nakayama and Y Nakamura, DNA Res, Characterization of the soybean genome using EST-derived microsatellite markers, 14(6); 271-281 (2007)

DL Hyten, Q Song, Y Zhu, IY Choi, RL Nelson, JM Costa, JE Specht, RC Shoemaker and PB Cregan, Proc. Natl. Acad. Sci. US, Impacts of genetic bottlenecks on soybean genome diversity, 103(45); 16666-16671 (2006)

JY Song, XM Piao, YM Choi, GA Lee, JW Chung, JR Lee, YI Jung, HJ Park and MC Lee, Plant Breed Biotechno, Evaluation of genetic diversity and comparison of biochemical traits of soybean (Glycine max L.) germplasm collections, 1(4); 374-384 (2013)

MY Kim, S Lee, K Van, TH Kim, SC Jeong, IY Choi, DS Kim, YS Lee, D Park and J Ma, Proc. Natl Acad. Sci. US, Whole-genome sequencing and intensive analysis of the undomesticated soybean (Glycine soja Sieb. and Zucc.) genome, 107(51); 22032-22037 (2010)

GA Lee, JS Sung, SY Lee, JW Chung, JY Yi, YG Kim and MC Lee, Mol Ecol Resou, Genetic assessment of safflower (Carthamus tinctorius L.) collection with microsatellite markers acquired via pyrosequencing method, 14(1); 69-78 (2014)

JD Lee, TD Vuong, H Moon, JK Yu, R Nelson, HT Nguyen and JG Shannon, Crop Sci, Genetic diversity and population structure of Korean and Chinese soybean [Glycine max (L.) Merr.] accessions, 51(3); 1080-1088 (2011)

JD Lee, JK Yu, YH Hwang, S Blake, YS So, GJ Lee, H Nguyen and JG Shannon, Crop Sci, Genetic Diversity of Wild Soybean (Glycine soja Sieb. and Zucc.) Accessions from South Korea and Other Countries, 48(2); 606-616 (2008)

YH Li, MJ Smulders, RZ Chang and LJ Qiu, Conserv. Genet, Genetic diversity and association mapping in a collection of selected Chinese soybean accessions based on SSR marker analysis, 12(5); 1145-1157 (2011)

Y Li, R Guan, Z Liu, Y Ma, L Wang, L Li, F Lin, W Luan, P Chen and Z Yan, Theor. Appl. Genet, Genetic structure and diversity of cultivated soybean (Glycine max (L.) Merr.) landraces in China, 117(6); 857-871 (2008)

Z Li and RL Nelson, Crop Sci, Genetic diversity among soybean accessions from three countries measured by RAPDs, 41(4); 1337-1347 (2001)

K Liu and SV Muse, Bioinformatics, PowerMarker: an integrated analysis environment for genetic marker analysis, 21(9); 2128-2129 (2005)

BM Mulato, M Möller, MI Zucchi, V Quecini and JB Pinheiro, Pesqui. Agropecu. Bras, Genetic diversity in soybean germplasm identified by SSR and EST-SSR markers, 45(3); 276-283 (2010)

R Peakall and PE Smouse, Mol. Ecol. Note, GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research, 6(1); 288-295 (2006)

J Pritchard, X Wen and D Falush, Documentation for structure software: Version 2, Department of Human Genetics, University of Chicago; Department of Statistics, University of Oxford. (2007)

J Schmutz, SB Cannon, J Schlueter, J Ma, T Mitros, W Nelson, DL Hyten, Q Song, JJ Thelen and J Cheng, Nature, Genome sequence of the palaeopolyploid soybean, 463(7278); 178-183 (2010)

M Schuelke, Nat. Biotechnol, An economic method for the fluorescent labeling of PCR fragments, 18(2); 233-234 (2000)

L Wang, R Guan, L Zhangxiong, R Chang and L Qiu, Crop Sci, Genetic diversity of Chinese cultivated soybean revealed by SSR markers, 46(3); 1032-1038 (2006)

MS Yoon, J Lee, CY Kim, JH Kang, EG Cho and HJ Baek, Euphytic, DNA profiling and genetic diversity of Korean soybean (Glycine max (L.) Merrill) landraces by SSR markers, 165(1); 69-77 (2009)

H Yu and YT Kiang, Euphytic, Genetic variation in South Korean natural populations of wild soybean (Glycine soja), 68(3); 213-221 (1993)

The Korean Journal of Crop Science ISSN:0252-9777(Print) 2287-8432(Online) 한국작물학회지

Preview

Genetic Diversity and Population Structure of Korean Soybean Collection Using 75 Microsatellite Markers

ABSTRACT

MAIN

Table 1.

Fig. 1.

Table 2.

Fig. 2.

Acknowledgements

REFERENCES