Journal Search Engine

ISSN : 1225-8504(Print)
ISSN : 2287-8165(Online)

Journal of the Korean Society of International Agriculture Vol.28 No.1 pp.73-83
DOI : https://doi.org/10.12719/KSIA.2016.28.1.73

Assessment of Genetic Diversity and Population Structure of the Sub Core Set in Sesame (Sesamum indicum) using SSR Markers

Sun-Kyung Min, Buwoong Choi, Jong-Hyun Park^*, Jong-Wook Chung^**, Kyu-Won Kim, Yong-Jin Park^†

Department of Plant Resources, College of Industrial Sciences, Kongju National University, Yesan 32439, Korea.
^*Food Grain Policy Division, Ministry of Agriculture, Food and Rural Affairs, Sejong-si 30110, Korea.
^**National Agrobiodiversity Centre, National Academy of Agricultural Science, Rural Development Administration, Jeonju 54875, Korea

Corresponding author +82-41-330-1213yjpark@kongju.ac.kr

Received August 6, 2015 Review January 13, 2016 Accepted February 11, 2016

Abstract

Sesame is queen of oil seed crops and widely cultivated in Asia and Africa. The aim of this study was to develop a mini sub core set representing the diverse germplasm of sesame and to assess the genetic diversity, population structure and phylogenetic relationship of the resulted sub core set to be used in whole genome resequencing platform. One hundred twelve accessions out of 277 accessions were selected by the PowerCore program. A total of 155 alleles were captured from the 158 alleles detected in the primary core population, and rare alleles and specific alleles were also maintained in the sub core set accessions representing almost 100% of the primary core population. Among the sub core set accessions, four sub populations were observed with some admixture accessions. Although the genetic diversity of Pop-1 which includes most accessions from Korea is relatively lower than that of other three sub populations, it can maintain maximum number of accessions in the sub core set with the same percentage as in the primary core set probably because of the specific features of these accessions. Based on this framework of genetically defined populations, the effective use and conservation management of Sesamum indicum for crop improvement might be possible.

Key Words : genetic diversity , Sesamum indicum , power marker , sub core set

SSR 마커를 이용한 참깨(Sesamum indicum) 소규모 핵심집단의 유전적 다양성 및 집단 구조 분석

민 선경, 최 부웅, 박 종현^*, 정 종욱^**, 김 규원, 박 용진^†

공주대학교 식물자원학과
^*농림축산식품부 식량정책과
^**국립농업과학원 농업유전자원센터

초록

키워드 :

This article has been cited by 0 article in crossref

Cited-By

Funding:

Ministry of Education
National Research Foundation
2014R1A1A2057073

참깨(Sesamum indicum L.) (2n = 26)는 주요 유 료 작물로 옛날부터 아시아, 아프리카 및 남미의 열대 지방에 서 재배되어왔다(Bedigian, 2003). 참깨는 게놈 크기가 최대 369Mb로 추정되며(Zhang et al., 2013), Sesamum 속은 36종 이 보고되었고(Kobayashi, 1981) 그 중에서 Sasemum indicum L.은 가장 많이 재배되고 있는 종이다(Nayar and Mehra 1970). 일반적으로 자식성 작물로 여겨지며 자연교잡률의 범위 가 5%미만이다(Pathirana, 1994; Rheenen, 1980). 참깨는 사람 이 사용한 가장 오래된 유료종자 작물로 알려져 있으며(Joshi 1961; Weiss 1983; Mabberley 1997), 인디아 또는 아프리카가 기원으로 추정되고 있다(Bedigian and Harlan 1986; Bedigian et al., 1985; Bedigian 2003; Bedigian 2010; Nanthakumar et al., 2000; Fuller 2003; Kumar and Hiremath, 2008).

참깨 종자는 땅콩, 대두, 유채종자를 포함한 주요 유료종자 작물가운데 가장 많은 기름함량을 가지며(Anilakumar et al., 2010), 단백질, 비타민이 풍부할 뿐만 아니라, 세사민, 세사몰린 과 같은 항산화 물질을 다수 포함하고 있어(Namiki 1995; Moazzami and Kamal-Eldin 2006) ‘유류종자의 여왕’으로 불 린다. 그러나 세계의 많은 유전자 은행은 작물 유전자원 수집 품종 수의 급격한 증가로 인해 개발, 평가 및 활용에 어려움 을 가지고 있으며, 유지비용 문제에 직면하고 있다(Holden, 1984). Frankel and Brown (1984)는 이 문제를 해결하기 위 해 최소한의 개체 수를 통한 선발로 최대한 유전적 다양성을 포함하는 핵심집단(core collection) 개념을 제안하였다. 재배종 과 야생종의 관계에 대한 유전적 다양성 및 유연관계의 정보 는 식물 유전자원을 효과적으로 활용하는데 필요한 필수적인 것이라 할 수 있다.

Microsatellites과 SSRs는 작물 유전연구에 유용한 방법으로 서 집단 유전학과 유전자 매핑 분야에서 가깝게 연관되어 있 는 종이나 품종 사이의 유전적 관계를 밝히는데 활용되고 있 다(Zhao et al., 2009; Cho et al., 2010; Ma et al., 2010; Li et al., 2011; Hong et al., 2013). 교배 방법에서 참깨 Polymorpic SSR 마커의 사용은 Dixit et al., (2005), Jin et al., (2009), Jyothi (2009), Spandana et al., (2012)에 의해 보고 된 바가 있다. 참깨의 표현형과 유전자형을 이용하는 다양한 핵심집단 선발 방법론이 개발되어 왔다 그러나 농업형질과 형 태적 형질과 같은 표현형은 환경 요인에 의해서 강하게 편의 가 발생하고 또한 재배 조건에 의해서도 영향을 받게 된다. 이러한 문제를 해결하기 위해 유전자형을 이용한 핵심집단 선 발 방법이 발달하였는데 그 중 PowerCore 소프트웨어(Kim et al., 2007)는 기존의 방법보다 효과적으로 유전적 다양성을 포 함한 다양한 정보를 압출할 수 있다. 벼, 대두, 아마란스의 연 구에서 이를 사용하여 핵심집단을 선발한 바 있다.(Lu et al., 2009; Zhao et al., 2010a,b; Moe et al., 2012; Aye et al., 2013).

본 연구는 참깨의 유전적 다양성과 집단 구조분석을 통해 품종 육성 및 기초자료로 제공하기 위해, Park et al., (2014) 이 발표한 참깨 유전자원 핵심집단 277점을 농촌진흥청 국립 농업과학원 국립유전자원센터에서 분양을 받아 총 14개의 SSR 마커로 PowerCore의 휴리스틱 선발 분석을 실시하여 소규모 핵심집단을 구축하고 효율적인 자원 관리 및 육종에서 사용할 수 있는 기초 자료로서 제공하고자 한다.

재료 및 방법

식물 재료

농촌진흥청 유전자원센터에서 제공받은 모 핵심집단 277점 을 Park et al., (2014)이 분석에 이용한 14개의 SSR 마커로 genotype하여 112점의 핵심집단을 구축하였으며(Table 1), 각 참깨의 원산지는 한국 19점, 일본 5점, 이란 7점, 인도 12점, 중국 7점, 미국 5점 등 총 15개국이 포함되었다.

SSR genotyping

전체 DNA 추출은 Qiagen DNA extraction kit (Qiagen, Seoul, Republic of Korea)를 사용하여 추출하였다. DNA 추 출 후 14개의 SSR marker 정보(Dixit et al., 2005., Jin et al., 2009.)를 활용해 Schuelke (2000) 방법으로 Polymerase Chain Reaction (PCR)을 수행하였다(Lee et al., 2008). SSR allele 분석은 ABI-PRISM 3100 DNA sequencer (Applied Biosystems, Foster City, CA, USA)의 GeneScan 3.7 software (Applied Biosystems)를 이용하였으며, allele의 정확한 측정을 위해 GeneScan 500 ROX (6-carbon-X-rhodamine) molecular size standards (35-500bp)를 Genotyper 3.7 software (Applied Biosystems)와 함께 사용하였다.

Development of a core set for sesame

모 핵심집단의 allele로부터 효율적으로 allele를 포함시키기 위해 Zhao et al. (2010 a), Moe et al. (2012), and Aye et al. (2013)의 연구에 의하여 이미 사용성이 입증된 바가 있 는 PowerCore 1.0 (Kim et al., 2007)의 휴리스틱 알고리즘 을 이용하여 핵심집단을 선발하였다. 모집단과 선발된 대립유 전자간의 분석을 위해 Excel (Microsoft Office; Microsoft, Redmond, WA, USA)을 이용하였다.

Genetic diversity analysis and evaluation

Powermarker 3.25 (Liu and Muse 2005) 프로그램을 이용 하여 number of alleles (NA), number of specific alleles (SA), number of rare alleles (RA), major allele frequency (MF), polymorphism information content (PIC)에 대한 분석을 하였 고, 내장되어 있는 Mega 4.0 (Tamura et al. 2007) 프로그램 을 사용해 unrooted neighbor-joining tree를 작성하였다.

각 마커에 대한 PIC는 다음 공식에 의하여 계산하였다.

$P I C = 1 - \sum_{i = 1}^{n} p_{i}^{2} - 2 \sum_{i = 1}^{n - 1} \sum_{j = i + 1}^{n} p_{i}^{2} p_{j}^{2}$

단, p는 i번째 SSR 마커의 j번째 패턴의 상대적 빈도 (Botstein et al., 1980).

Population structure analysis of the core set

핵심집단의 구조 분석을 위하여 The model-based program Structure 2.2 (Pritchard et al., 2000; Falush et al., 2003)를 사용하였으며, 설정된 값은 burn-in 100,000, run length은 150,000으로 하였다. Structure 프로그램에서 집단 (K)는 1에서 10까지 분석하였으며, 이 모델은 최대 확률을 가진 K를 구하 기 위해서 ΔK 값을 이용하였다. ΔK는 Structure (Evanno et al., 2005) 프로그램에서 추론된 클러스터 개수에 대한 로그 확률의 2차 변화와 관계된 량으로, 각각의 자원은 70%이상의 멤버십 추정 확률을 가지는 하나의 집단(sub 집단)으로 할당 되었다.

결 과

Sub core set development and allele capture efficiency

PowerCore를 이용하여 참깨 모 집단 277점으로부터 소규모 핵심집단을 구축한 결과, 112점의 소규모 핵심집단을 구축하 였으며 이는 모 집단 품종의 40.4%에 해당하며. 총 allele의 수는 158개 중 155개로 모 집단의 98%에 해당한다. 또한 SSR 마커의 대립유전자 분포는 모 집단과 비슷한 양상을 보였다 (Table 2, Fig 1).

Genetic diversity of the sub core set

Power Marker를 이용한 소규모 핵심집단과 모 핵심집단의 분석결과를 NA, GD, PIC, NG와 같은 척도로 평가하였을 때 비슷한 양상을 보였으며(Table 2), 평균 값은 각각 11.07, 0.65, 0.61 및 20.79였다.

MF는 소규모 핵심집단에서 감소한 것으로 관찰되었다. 모 집단에서는 38개의 specific allele와 112개의 rare allele로, 각 각의 평균은 2.71과 8.0인 반면에, 소규모 핵심집단에서는 45 개의 specific allele와 109개의 rare allele가 관찰되었으며 평 균은 3.21, 7.79로 나타났다. 특히, specific allele는 모 핵심 집단보다 소규모 핵심집단에서 45개로 증가한 것으로 관찰되 었다(Table 2).

Population structure of the sub core set

참깨 112점을 model-based program 및 14개의 SSR 마커 를 이용해 집단 구조 분석한 바 L(K)은 K과 같이 정확한 형 태를 보이지 않았으며, 정확한 K 값을 얻기 위해 Evanno et al., (2005) 제안한 ΔK을 사용하였다. Fig 2, 3

선발된 112점을 분석하기 위해 Alpha parameter를 최대로 하는 ΔK를 선택하였다. 최대 ΔK 값은 K = 4이고, alpha parameter는 0.1058이었으며, 모든 accessions은 4개의 subpopulation (membership > 70%)로 추정되었다. Pop-1는 19점, Pop-2 20점, Pop-3에는 21점이 포함되었고 16점은 Pop-4에 포함되었다. 다른 36점은 admixture에 포함되었다(Table 1, Fig. 4). 흥미롭게도 Pop-1에는 대부분 한국 품종이 포함되었 고 Pop-3은 중동 아시아, Pop-4는 남아시아 지역 품종이 포함 되었다. 마지막으로 Pop-2에는 다양한 지역의 품종이 포함되 었다.

유전적 차이 기반 분석은 112점에 속한 shared allele 빈도 로 계산하였고, unrooted phylogram는 Powermarker 3.23 와 Mega 4 (Tamura et al. 2007)를 이용하였다. 집단 구조분석의 유사 클러스터링 패턴을 분석하였고(Fig. 5), model-based 집 단 분석 결과에 따라 다른 색으로 구분하였다.

각 model-based 집단의 유전적 다양성을 측정한 결과(Table 3), Pop-1의 gene diversity는 0.483, allelic richness 평균은 4.643이 고 PIC는 0.445로 4개의 subpopulation 중에 가장 낮은 것으 로 관찰되었으나, Pop-2와 Pop-3, Pop-4는 gene diversity, PIC값이 Pop-1에 비해 모두 높았다. 소규모 핵심집단의 최대 점유 품종은 한국품종으로서 모 핵심집단에서의 수와 비율에 큰 차이가 없었다(Table 4).

고 찰

핵심집단 구축은 유전자원관리의 유용한 관리를 위해 Frankel (1984)가 제안하였다. 본 연구는 소규모 핵심집단은 다양한 유전자원에 대하여 대표성을 띄는 whole genome resequencing의 모 집단으로부터 구축되었으며. 277점의 모 집 단에서 대표적인 소규모 핵심집단 112점을 구축하였다.

소규모 핵심집단의 품종은 모 핵심집단의 대부분의 척도를 대변하고 있을 뿐만 아니라(Table 2), 전체 품종의 60%를 감 소시켰다(Table 4). 이는 Zhao et al. (2010a), Moe et al. (2012), and Aye et al. (2013)의 연구에 의하여 이미 사용성 이 입증된 바가 있는 PowerCore (Kim et al., 2007)는 본 연구에서도 그 사용성이 입증되었다 할 수 있겠다.

좋은 핵심집단은 최대한 적은 수의 자원으로 최대한 많은 유전적 다양성을 대표하는데 있다. 핵심집단 또는 allelemining set의 구축은 방대한 양의 유전자원에서 표현형 검사 나 육종에 이용할 수 있으며(Zhao et al., 2010a), allelic richness는 풍부한 다양성 관련 척도이다(Schoen and Brown 1993; Bataillon et al., 1996).

또한 소규모 집단에서 specific allele는 증가는PowerCore가 품종 수를 감소시키고 최대한의 다양성을 보여준다고 할 수 있으며, 중복성의 품종을 줄이거나 어떤 allele 선발할 때 specific allele를 구별해 준다(Aye et al., 2013). 이는 핵심집 단 내에서 specific allele의 증가를 도출할 것이고, 이러한 결 과는 중요한 allele를 보유하는 핵심집단 방식을 구현하여 나 타낼 것이다.

본 연구에서 소규모 핵심집단은 모 핵심집단의 대부분의 allele 다양성을 포함하였다(Table 2, Fig. 1). 따라서 핵심집단 은 277점에서 112점으로 수를 감소시키면서 최대의 유전적 다 양성을 보여주며, 전체 참깨 선발로부터 대표성을 나타낸다. Brown et al., (1987) 은 핵심집단의 수는 기본 선발집단에서 5 ~ 10%를 포함하여야 되며 적어도 70%이상 유전적 다양성을 대변할 수 있어야 한다고 하였으며, Diwan et al. (1995) 은 핵심집단은 항상 10% 이상 설정되어야 한다고 제안하였다. 또 한 van Hintum (1995)은 선정된 품종은 핵심집단의 특정 목 적에 따라 5 ~ 20% 의존한다 하였다. 따라서 선발된 핵심집단 은 본 연구의 모든 품종의 대표성을 충분히 나타낼 것으로 볼 수 있다.

참깨 원산지는 아프리카에서 중동지역을 거쳐 인도, 중국, 한국 일본 순으로 되어 있다(Bedigian et al., 1986; Bedigian and Harlan, 1986; Bedigian, 2003). 본 소규모 핵심집단의 sub-population group cluster를 각 자원의 원산지와 대응시켜 본 결과는 Fig. 6와 같다. 원산지 아프리카는 Pop-3가 우점 하였다. 원산지 아프리카와 가까운 유럽 쪽은 아프리카와 비슷 하게 Pop-3이 다수를 차지하였으며(Afghanistan, Iran, Turkey), 이 보다 더 먼 지역에서는 Pop-4가 증가하다가 (Nepal, China, India, Philippines), 이에서 더 멀어질수록 Pop-2과 Pop-1이 증가하는 양상을 보였다.

구조분석을 통해 4개의 subpopulation과 admixture가 동시에 있는 것으로 관찰되었으며(Table 1, Fig. 4), unrooted phylogram에서도 비슷한 경향을 보였다(Fig. 5). 본 연구결과에서 admixed/hybrid genotype이 존재한다는 사실은 참깨에서 hybridization/introgression가 빈번히 발생한다는 것을 시사한다. Hybridization과 introgression은 그것을 탐색하고 범위를 명확 하게 추정하는 것은 어려우나, 재배종과 야생종 및 잡초성 종 사이의 유전적 재조합은 재배작물의 기원을 밝히는 데 매우 중요하다. 본 연구 결과에서의 subpopulation 들은 다양한 지 역이 속해있는 Pop-2를 제외하고 지역에 따라 population이 구분되는 것으로 관찰되었다. Pop-1는 한국 품종이 대부분이 었으며 그럼에도 불구하고 소규모 핵심집단과 모 집단의 최대 포함 비율은 유지 되었다. 비록 Pop-1은 4개의 subpopulation 중 가장 낮은 gene diversity를 가졌지만 이는 한국품종의 고 유 특수성을 보여주는 결과라고 해석할 수 있다(Table 4).

결론적으로, 본 연구를 통해 277점의 모 집단으로부터 총 155 alleles, SSR locus 별로 평균 11.07 alleles를 가진 소규 모 핵심집단을 112점을 동정하였다. 이는 모 집단의 대부분 allele를 포함한 결과이다. 집단 구조 분석을 통해서 유전적 거 리에 따라 군집화를 수행한 결과 소규모 핵심집단은 4개의 subpopulation으로 나누어짐을 발견하였다. 따라서, 본 연구에 서 구축한 소규모 핵심집단을 기반으로 참깨 육종 및 연구자 들이 품종을 육성하는데 아직 이용하지 않은 유용한 allele들 을 도입하기 위한 참깨 유전자원의 효율적인 관리 및 유용유 전자 선발을 위한 기초 자료로서 유용하게 활용 될 수 있을 것으로 보이며, 향후 참깨 World Collection을 이용한 대규모 유전자원으로부터 선발된 참깨 핵심집단의 유전자형 변이와 주요 농업형질에 대한 association 분석을 통해 종실특성, 종실 의 지방산, 잎의 향, 색소 및 기능성 성분에 대한 유용 대립 유전자 대량 발굴로 참깨 molecular designed breeding 분야 에 활용될 수 있을 것으로 판단된다. 또한, 참깨 분리집단을 육성하여, 차후 유전체 분석을 통해 고밀도 유전자 연관지도 작성에 활용할 수 있으며, 분리집단의 주요 농업형질, 종실 품 질 특성, 생리활성 물질함량에 대한 특성 평가를 수행하여 목 적형질에 연관된 분자표지를 개발하여 목적형질에 대한 초기 세대 선발효율을 높일 수 있을 것으로 보인다. 고밀도 연관지 도를 활용하여 유전자 지도에 기초한 유전자 클로닝 등에 응 용될 수 있을 것이며, 참깨 또는 다른 작물과의 비교유전체 정보를 이용한 기능연구에 활용 가능할 것으로 사료된다.

적 요

국립유전자원센터에서 분양받은 참깨 모 핵심집단 277점 에 대해 PowerCore 의 휴리스틱 알고리즘을 이용하여 112점 의 소규모 핵심집단을 작성하였다.
본 연구에서 작성한 112점의 소규모 핵심집단은, 115개의 대립유전자를 가지고 있으며, 이들 대립유전자는 모 핵심집단 에서 총 158개의 유전자로부터 유래되었고, 소규모 핵심집단의 rare alleles은 모 핵심집단과 거의 100% 가깝게 표현되었다.
Structure 프로그램을 이용하여 집단 구조 분석을 수행한 결과, 소규모 핵심집단은 4개의 sub 집단과 admixture 집단 구조를 가지고 있음이 관찰되었다. Sub 집단 중 대부분의 한 국품종을 포함한 Pop-1은 유전적 다양성이 다른 3가지 sub 집단보다 낮지만, 모 핵심집단과 유사한 특징을 같은 비율을 지니고 있기 때문에 소규모 핵심집단은 최소한의 품종으로 표 현할 수 있다.

ACKNOWLEDGMENTS

이 논문은 2015년도 정부(교육부)의 재원으로 한국연구재 단의 지원을 받아 수행된 기초연구사업임(과제번호: 2014R1A 1A2057073).

Figure

Fig. 1..

Allele frequency histograms for 14 SSR markers between the core set (277acc) and sub core set (112acc) of sesame.

Fig. 2..

(Log) Likelihood of the data (n=112), as a function of K (the number of groups used to stratify the sample).

Fig. 3..

Values of ΔK, with its modal value detecting a true K of four groups (K = 4).

Fig. 4..

Model-based ancestry for each of the 112 accessions based on the 14 simple sequence repeat marker used to build the Q matrix. The numbers represent the serial number of the accessions and predefined population in the bracket.

Fig. 5..

Unrooted neighbor-joining tree (UPGMA) based on Nei’s genetic distance matrix (shared allele frequency) among 112 sesame accessions. The colour corresponds to that of model-based populations.

Fig. 6..

The geometric distribution of model-based sub-populations of 112 core accessions.

Table

Table 1..

List of the 112 accessions selected as the sub core set and their model-based clusters.

Sl. No.	Acc. No.	Country code	Country	Geographical region of origin	Inferred cluster

1	169955	AFG	Afghanistan	South Asia	Admixture
2	184426	AFG	Afghanistan	South Asia	2
3	184427	AFG	Afghanistan	South Asia	1
4	184428	AFG	Afghanistan	South Asia	Admixture
5	184429	AFG	Afghanistan	South Asia	3
6	184430	AFG	Afghanistan	South Asia	4
7	184431	AFG	Afghanistan	South Asia	Admixture
8	184432	AFG	Afghanistan	South Asia	3
9	184434	AFG	Afghanistan	South Asia	3
10	184436	AFG	Afghanistan	South Asia	3
11	169338	CHN	China	East Asia	2
12	169359	CHN	China	East Asia	2
13	169626	CHN	China	East Asia	Admixture
14	184452	CHN	China	East Asia	Admixture
15	196047	CHN	China	East Asia	2
16	196049	CHN	China	East Asia	2
17	196068	CHN	China	East Asia	2
18	184318	EGY	Egypt	North Africa	3
19	184321	EGY	Egypt	North Africa	3
20	184326	EGY	Egypt	North Africa	3
21	184519	EGY	Egypt	North Africa	Admixture
22	184520	EGY	Egypt	North Africa	3
23	184292	IND	India	South Asia	4
24	184425	IND	India	South Asia	4
25	184466	IND	India	South Asia	3
26	184472	IND	India	South Asia	4
27	184473	IND	India	South Asia	Admixture
28	184475	IND	India	South Asia	4
29	184476	IND	India	South Asia	4
30	184492	IND	India	South Asia	4
31	184600	IND	India	South Asia	Admixture
32	184718	IND	India	South Asia	Admixture
33	184726	IND	India	South Asia	1
34	196116	IND	India	South Asia	Admixture
35	184510	IRN	Iran	West Asia	3
36	184511	IRN	Iran	West Asia	3
37	184514	IRN	Iran	West Asia	Admixture
38	184515	IRN	Iran	West Asia	Admixture
39	184516	IRN	Iran	West Asia	3
40	184517	IRN	Iran	West Asia	Admixture
41	184736	IRN	Iran	West Asia	3
42	184527	JPN	Japan	East Asia	Admixture
43	184529	JPN	Japan	East Asia	Admixture
44	184532	JPN	Japan	East Asia	Admixture
45	192443	JPN	Japan	East Asia	2
46	209652	JPN	Japan	East Asia	4
47	029144	KOR	Korea	East Asia	Admixture
48	029182	KOR	Korea	East Asia	1
49	029497	KOR	Korea	East Asia	2
50	029580	KOR	Korea	East Asia	1
51	029757	KOR	Korea	East Asia	Admixture
52	029817	KOR	Korea	East Asia	1
53	029860	KOR	Korea	East Asia	1
54	029873	KOR	Korea	East Asia	1
55	030125	KOR	Korea	East Asia	1
56	102678	KOR	Korea	East Asia	Admixture
57	102975	KOR	Korea	East Asia	1
58	103159	KOR	Korea	East Asia	1
59	113273	KOR	Korea	East Asia	Admixture
60	113593	KOR	Korea	East Asia	1
61	156334	KOR	Korea	East Asia	Admixture
62	156362	KOR	Korea	East Asia	1
63	193010	KOR	Korea	East Asia	1
64	193692	KOR	Korea	East Asia	1
65	195373	KOR	Korea	East Asia	1
66	184354	MEX	Mexico	North America	Admixture
67	184356	MEX	Mexico	North America	4
68	184366	MEX	Mexico	North America	2
69	184546	MEX	Mexico	North America	Admixture
70	196091	MEX	Mexico	North America	2
71	184756	NPL	Nepal	South Asia	4
72	184757	NPL	Nepal	South Asia	4
73	200589	NPL	Nepal	South Asia	1
74	169941	PAK	Pakistan	South Asia	4
75	184569	PAK	Pakistan	South Asia	4
76	184570	PAK	Pakistan	South Asia	Admixture
77	184579	PAK	Pakistan	South Asia	Admixture
78	184586	PAK	Pakistan	South Asia	Admixture
79	184305	PHL	Philippines	South Asia	2
80	184308	PHL	Philippines	South Asia	4
81	184311	PHL	Philippines	South Asia	2
82	184313	PHL	Philippines	South Asia	Admixture
83	184314	PHL	Philippines	South Asia	2
84	184317	PHL	Philippines	South Asia	3
85	169949	RUS	Russia	East Europe	1
86	184651	RUS	Russia	East Europe	Admixture
87	184653	RUS	Russia	East Europe	2
88	184654	RUS	Russia	East Europe	1
89	184747	RUS	Russia	East Europe	Admixture
90	184753	RUS	Russia	East Europe	4
91	169244	TUR	Turkey	South Europe	3
92	169248	TUR	Turkey	South Europe	3
93	169397	TUR	Turkey	South Europe	3
94	169404	TUR	Turkey	South Europe	3
95	169409	TUR	Turkey	South Europe	Admixture
96	184593	TUR	Turkey	South Europe	3
97	184602	TUR	Turkey	South Europe	4
98	184615	TUR	Turkey	South Europe	Admixture
99	184623	TUR	Turkey	South Europe	Admixture
100	184636	TUR	Turkey	South Europe	2
101	184642	TUR	Turkey	South Europe	2
102	184643	TUR	Turkey	South Europe	3
103	184644	TUR	Turkey	South Europe	3
104	184727	TUR	Turkey	South Europe	Admixture
105	184269	USA	America	North America	Admixture
106	184272	USA	America	North America	2
107	184399	USA	America	North America	Admixture
108	184686	USA	America	North America	2
109	192412	USA	America	North America	1
110	184409	VEN	Venezuela	North America	2
111	184715	VEN	Venezuela	North America	2
112	184734	VEN	Venezuela	North America	Admixture

Table 2..

Comparing the allele richness and genetic diversity between the core set (277 accessions) and sub core set (112 accessions) of sesame.

MF : major allele frequency, SA : specific allele, RA : rare allele, GD : gene diversity, NA : number of alleles, PIC : polymorphic information content, NG : genotype No

	Sesame core set (277 accessions)	Sesame sub core set (112 accessions)
Loci	MF	SA	RA	GD	NA	PIC	NG	MF	SA	RA	GD	NA	PIC	NG
GBssr-sa-5	0.39	4	12	0.74	16	0.71	31	0.36	4	11	0.78	15	0.75	30
GBssr-sa-8	0.49	1	9	0.69	13	0.65	26	0.39	2	10	0.77	13	0.74	26
GBssr-sa-34	0.71	0	1	0.42	3	0.34	5	0.75	0	1	0.38	3	0.32	5
GBssr-sa-40	0.46	1	3	0.70	7	0.65	13	0.42	1	4	0.71	7	0.66	13
GBssr-sa-58	0.69	3	8	0.47	10	0.42	14	0.66	5	8	0.51	10	0.47	13
GBssr-sa-72	0.70	2	5	0.49	8	0.46	16	0.61	2	4	0.59	8	0.55	16
GBssr-sa-83	0.78	3	5	0.38	8	0.36	11	0.74	3	5	0.43	8	0.41	11
GBssr-sa-108	0.23	3	12	0.86	18	0.84	40	0.21	4	12	0.87	18	0.86	40
GBssr-sa-123	0.38	7	14	0.76	18	0.73	30	0.34	6	13	0.80	17	0.78	29
GBssr-sa-135	0.50	1	5	0.61	8	0.54	13	0.43	4	5	0.65	8	0.58	13
GBssr-sa-164	0.60	1	1	0.48	3	0.37	4	0.59	1	1	0.49	3	0.38	4
GBssr-sa-178	0.53	3	4	0.51	6	0.39	7	0.53	3	4	0.53	6	0.42	7
GBssr-sa-182	0.50	6	22	0.72	25	0.71	49	0.42	7	20	0.80	24	0.79	48
GBssr-sa-184	0.29	3	11	0.82	15	0.79	36	0.26	3	11	0.85	15	0.84	36
Total	7.23	38	112	8.64	158	7.96	295	6.71	45	109	9.17	155	8.56	291
Mean	0.52	2.71	8	0.62	11.29	0.57	21.07	0.48	3.21	7.79	0.65	11.07	0.61	20.79

Table 3..

Genetic diversity of model-based populations for the selected entries of sub core set.

	Sample size	NA	GD	PIC
Overall	112	11.071	0.655	0.611
Pop-1	19	4.643	0.483	0.445
Pop-2	20	4.571	0.564	0.514
Pop-3	20	5.571	0.522	0.489
Pop-4	16	5.214	0.529	0.496

Table 4..

Genetic diversity contributed by country among primary core set and sub core set.

No	Country	Sesame core set (277 acc.)	Sesame sub core set (112 acc.)
1	Afghanistan	14	5.05	10	8.93
2	China	27	9.75	7	6.25
3	Egypt	15	5.42	5	4.46
4	India	19	6.86	12	10.71
5	Iran	10	3.61	7	6.25
6	Japan	13	4.69	5	4.46
7	Korea	47	16.97	19	16.96
8	Mexico	23	8.30	5	4.46
9	Nepal	12	4.33	3	2.68
10	Pakistan	12	4.33	5	4.46
11	Philippines	11	3.97	6	5.36
12	Russia	15	5.42	6	5.36
13	Turkey	21	7.58	14	12.50
14	America (USA)	25	9.03	5	4.46
15	Venezuela	13	4.69	3	2.68

		277	100	112	100

Reference

Aye A K , Moe K T , Chung J W , Baek H J , Park Y J (2013) Genetic diversity and population structure of the selected core set in Amaranthus using SSR markers , Plant Breeding, Vol.132 ; pp.165-173
Anilakumar K R , Pal A , Khanumand F , Bawa A S (2010) Nutritional, medicinal and industrial uses of sesame (Sesamum indicum L) seeds-an overview , Agriculturae Conspectus Scientificus (ACS), Vol.75 ; pp.159-168
Bataillon T M , David J L , Schoen D J (1996) Neutral genetic markers and conservation genetics: simulated germplasm collections , Genetics, Vol.144 ; pp.409-417
Bedigian D (2010) Characterization of sesame (Sesamum indicum L.) germplasm: a critique , Genet Resour Crop Evol, Vol.57 ; pp.641-647
Bedigian D (2003) Evolution of sesame revisited: domestication, diversity and prospects , Genet Resour Crop Evol, Vol.50 ; pp.779-787
Bedigian D , Harlan J R (1986) Evidence for cultivation of sesame in the ancient world , Econ Bot, Vol.40 ; pp.137-154
Bedigian D , Smyth C A , Harlan J R (1986) Patterns of morphological variation in sesamum indicum , Econ Bot, Vol.40 ; pp.353-365
Bedigian D , Seigler D S , Harlan J R (1985) Sesamin, sesamolin and the origin of sesame , Biochem Syst Ecol, Vol.13 ; pp.133-139
Botstein D , White R L , Skolnick M , Davis R W (1980) Construction of a genetic linkage map in man using restriction fragment length polymorphisms , Am J Hum Genet, Vol.32 ; pp.314-331
Soybean genetics newsletter-United States Agricultural Research Service (USA)Brown A , Grace J , Speer S (1987) Designation of a core collection of perennial Glycine ,
Cho Y I , Chung J W , Lee G A , Ma K H , Dixit A , Gwag J G , Park Y J (2010) Development and characterization of twenty-five new polymorphic microsatellite markers in proso millet (Panicum miliaceum L) , Genes & Genomics, Vol.32 ; pp.267-273
Diwan N , McIntosh M , Bauchan G (1995) Methods of developing a core collection of annual Medicago species , Theor Appl Genet, Vol.90 ; pp.755-761
Dixit A , Jin M H , Chung J W , Yu J W , Chung H K , Ma K H , Park Y J , Cho E G (2005) Development of polymorphic microsatellite markers in sesame (Sesamum indicum L) , Molecular Ecology Notes, Vol.5 ; pp.736-738
Evanno G , Regnaut S , Goudet J (2005) Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study , Mol Ecol, Vol.14 ; pp.2611-2620
Falush D , Stephens M , Pritchard J K (2003) Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies , Genetics, Vol.164 ; pp.1567-1587
Frankel O (1984) Genetic perspectives of germplasm conservation , Genetic manipulation: impact on man and society, Cambridge University Press, ; pp.161-170
Frankel O , Brown A Holden JHW , Williams JT (1984) Plant genetic resources today: a critical appraisal , Crop genetic resources: conservation & evaluation,
Fuller D Q (2003) Further evidence on the prehistory of sesame , Asian Agri-History, Vol.7 ; pp.127-137
Hong W J , Aye A K , Park Y J (2013) Cultivar Identification of Chrysanthemum (Dendranthema grandiflorum. Ramat.) using SSR Markers , Korean J. Intl. Agri, Vol.25 (4) ; pp.385-394
IPGRIHintum T V , Hodgkin T , Brown A , Hintum T V , Morales E (1995) Hierarchical approaches to the analysis of the genetic diversity of crop plants , Core collections of plant genetic resources, Wiley-Sayce Publication,
Holden J (1984) The second ten years , Crop genetic resources: conservation and evaluation, ; pp.277-285
Jarvis D I , Hodgkin T (1999) Wild relatives and crop cultivars: detecting natural introgression and farmer selection of new genetic combinations in agroecosystems , Mol Ecol, Vol.8 ; pp.S159-S173
Joshi A (1961) Sesame—A Monograph , Hyderabad: Indian Central Oilseeds Committee,
Jin M H , Lee J R , Yu J W , Chung J W , Ma K H , Dixit A , Kim D H , Paek N C , Cho E G , Park Y J (2009) Development and characterization of microsatellite markers for utilization in diversity analysis of sesame (Sesamum indicum L) germplasm collection , Konkuk Journal of Life Science and Environment, Vol.31 ; pp.1-10
Joshi A B , Narayananj E , Vasudeva R (1961) Sesamum , Sesamum,
Jyothi B (2009) Molecular mapping and characterization of yield QTL and tagging of wilt resistance gene (s) in sesame , (Sesamum indicum L),
Kim K W , Chung H K , Cho G T , Ma K H , Chandrabalan D , Gwag J G , Kim T S , Cho E G , Park Y J (2007) Power- Core: a program applying the advanced M strategy with a heuristic search for establishing core sets , Bioinformatics, Vol.23 ; pp.2155-2162
Kumar A , Hiremath S (2008) Cytological analysis of interspecific hybrid between Sesamum indicum L× S. X S.orientaleL.var.malabaricum , Karnataka J Agric Sci, Vol.21 ; pp.498-502
Lu F H , Chung J W , Cho Y I , Kim T S , Park Y J (2009) Analysis of Genetic Diversity and Population Structure of Rice Cultivars from the Americas Using SSR Markers , Korean J. Intl. Agri, Vol.21 (4) ; pp.268-275
Lee J K , Hong G Y , Dixit A , Chung J W , Ma K H , Lee J K , Kang H Y , Cho Y H , Gwag J G , Park Y J (2008) Characterization of microsatellite loci developed for Amaranthus hypochondriacus and their cross-amplifications in wild species , Conserv Genet, Vol.9 ; pp.243-246
Liu K , Muse S V (2005) PowerMarker: an integrated analysis environment for genetic marker analysis , Bioinformatics, Vol.21 ; pp.2128- 2129
Mabberley D J (1997) The Plant-book: A Portable Dictionary of the Vascular Plants , Cambridge university press, ; pp.658
Ma K H , Kim K H , Dixit A , Chung I M , Gwag J G , Kim T S , Park Y J (2010) Assessment of genetic diversity and relationships among Coix lacryma-jobi accessions using microsatellite markers , Biol Plant, Vol.54 ; pp.272-278
Moazzami A A , Kamal-Eldin A (2006) Sesame seed is a rich source of dietary lignans , J Am Oil Chem Soc, Vol.83 ; pp.719-723
Moe K T , Gwag J G , Park Y J (2012) Efficiency of Power- Core in core set development using amplified fragment length polymorphic markers in mungbean , Plant breeding, Vol.131 ; pp.110-117
Namiki M (1995) The chemistry and physiological functions of sesame , Food Rev Int, Vol.11 ; pp.281-329
Nanthakumar G , Singh K , Vaidyanathan P (2000) Relationships between cultivated Sesame (Sesamum sp.) and the wild relatives based on morphological characters, isozymes and RAPD markers , Journal of Genetics & Breeding, Vol.54 ; pp.5-12
Nayar N , Mehra K (1970) Sesame: Its uses, botany, cytogenetics, and origin , Econ Bot, Vol.24 ; pp.20-31
Pathirana R (1994) Natural Cross?Pollination in Sesame (Sesamum indicum L) , Plant breeding, Vol.112 ; pp.167-170
Pritchard J K , Stephens M , Donnelly P (2000) Inference of population structure using multilocus genotype data , Genetics, Vol.155 ; pp.945-959
Schoen D J , Brown A H (1993) Conservation of allelic richness in wild crop relatives is aided by assessment of genetic markers , Proc Natl Acad Sci U S A, Vol.90 ; pp.10623-10627
Spandana B , Reddy V P , Prasanna G J , Anuradha G , Sivaramakrishnan S (2012) Development and characterization of microsatellite markers (SSR) in Sesamum (Sesamum indicum L) species , Appl Biochem Biotechnol, Vol.168 ; pp.1594-1607
Tamura K , Dudley J , Nei M , Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 40 , Mol Biol Evol, Vol.24 ; pp.1596-1599
Van Rheenen H (1980) Aspects of natural cross-fertilization in sesame (Sesamum indicum L) , Trop Agric, Vol.57 ; pp.53-59
Weiss E (1983) Sesame Weiss, EA Oilseed crops, Longman, ; pp.282-340
Zhang H , Miao H , Wang L , Qu L , Liu H , Wang Q , Yue M (2013) Genome sequencing of the important oilseed crop Sesamum indicum L , Genome Biol, Vol.14 ; pp.401
Zhao W G , Cho T , Ma K H , Chung J W , Gwag J G , Park Y J (2010) Development of an allele-mining set in rice using a heuristic algorithm and SSR genotype data with least redundancy for the post-genomic era , Mol Breed, Vol.26 ; pp.639-651
Zhao W , Chung J W , Ma K H , Kim T S , Kim S M , Shin D I , Kim C H , Koo H M , Park Y J (2009) Analysis of genetic diversity and population structure of rice cultivars from Korea, China and Japan using SSR markers , Genes & Genomics, Vol.31 ; pp.283- 292
Zhao W , Chung J W , Lee G A , Ma K H , Kim H H , Kim K T , Chung I M , Lee J K , Kim N S , Kim S M , Park Y J (2011) Molecular genetic diversity and population structure of a selected core set in garlic and its relatives using novel SSR markers , Plant Breeding, Vol.130 ; pp.46-54