Haplogroup R1a

Haplogroup R1a
Possible time of origin	22,000 YBP [1] to 25,000[2] years ago
Possible place of origin	Eurasia (see text).
Ancestor	Haplogroup R1
Descendants	Haplogroup R1a-Z282 (Europe), R1a-Z93 (Asia)
Defining mutations	R1a: L62, L63, L120, M420, M449, M511, M513; R1a1a: M17, M198, M512, M514, M515, L168, L449, L457, L566
Highest frequencies	See List of R1a frequency by population

Haplogroup R1a, or haplogroup R-M420, is a human Y-chromosome DNA haplogroup which is distributed in a large region in Eurasia, extending from Scandinavia and Central Europe to southern Siberia and South Asia.[3][2]

While R1a originated ca. 22,000[1] to 25,000[2] years ago, its subclade M417 (R1a1a1) diversified ca. 5,800 years ago.[4] The place of origin of the subclade plays a role in the debate about the origins of Proto-Indo-Europeans.

The SNP mutation R-M420 was discovered after R-M17 (R1a1a), which resulted in a reorganization of the lineage in particular establishing a new paragroup (designated R-M420*) for the relatively rare lineages which are not in the R-SRY10831.2 (R1a1) branch leading to R-M17.

Origins

R1a origins

Karafet et al. (2014) "rapid diversification process of K-M526 likely occurred in Southeast Asia, with subsequent westward expansions of the ancestors of haplogroups R and Q."[5]

The split of R1a (M420) is computed to ca. 22,000[1] or 25,000[2] years ago, which is the time of the last glacial maximum. A 2014 study by Peter A. Underhill et al., using 16,244 individuals from over 126 populations from across Eurasia, concluded that there was compelling evidence that "the initial episodes of haplogroup R1a diversification likely occurred in the vicinity of present-day Iran."[2]

Diversification of R1a1a1 (M417) and ancient migrations

R1a origins (Underhill 2019;[3] R1a1a origins (Pamjav et al. 2012); possible migration R1a to Baltic coast; and R1a1a oldest expansion and highest frequency (Underhill et al. 2014)

According to Underhill et al. (2014), the downstream R1a-M417 subclade diversified into Z282 and Z93 circa 5,800 years ago.[4][note 1] Even though R1a occurs as a Y-chromosome haplogroup among various languages such as Slavic and Indo-Iranian, the question of the origins of R1a1a is relevant to the ongoing debate concerning the urheimat of the Proto-Indo-European people, and may also be relevant to the origins of the Indus Valley Civilization. R1a shows a strong correlation with Indo-European languages of Southern and Western Asia and Central and Eastern Europe,[7][3] being most prevalent in Eastern Europe, West Asia, and South Asia. In Europe, Z282 is prevalent particularly while in Asia Z93 dominates. The connection between Y-DNA R-M17 and the spread of Indo-European languages was first noted by T. Zerjal and colleagues in 1999.[8]

Steppe origins

Proposed steppe dispersal of R1a1a

Semino et al. (2000) proposed Ukrainian origins, and a postglacial spread of the R1a1 gene during the Late Glacial Maximum, subsequently magnified by the expansion of the Kurgan culture into Europe and eastward.[9] Spencer Wells proposes Central Asian origins, suggesting that the distribution and age of R1a1 points to an ancient migration corresponding to the spread by the Kurgan people in their expansion from the Eurasian steppe.[10] According to Pamjav et al. (2012), R1a1a diversified in the Eurasian Steppes or the Middle East and Caucasus region:

Inner and Central Asia is an overlap zone for the R1a1-Z280 and R1a1-Z93 lineages [which] implies that an early differentiation zone of R1a1-M198 conceivably occurred somewhere within the Eurasian Steppes or the Middle East and Caucasus region as they lie between South Asia and Central- and Eastern Europe."[11]

Three genetic studies in 2015 gave support to the Kurgan theory of Gimbutas regarding the Indo-European Urheimat. According to those studies, haplogroups R1b and R1a, now the most common in Europe (R1a is also common in South Asia) would have expanded from the Russian steppes, along with the Indo-European languages; they also detected an autosomal component present in modern Europeans which was not present in Neolithic Europeans, which would have been introduced with paternal lineages R1b and R1a, as well as Indo-European languages.[12][13][14]

Source of R1a1a1 in Corded Ware culture

European middle-Neolithic period. Comb Ware culture ca. 4200 BCE – ca. 2000 BCE

Corded Ware culture (ca. 2900 BCE – ca. 2350 BCE

David Anthony considers the Yamnaya culture to be the Indo-European Urheimat.[15][16] According to Haak et al. (2015), a massive migration from the Yamnaya culture northwards took place ca. 2,500 BCE, accounting for 75% of the genetic ancestry of the Corded Ware culture, noting that R1a and R1b may have "spread into Europe from the East after 3,000 BCE".[17] Yet, all their seven Yamnaya samples belonged to the R1b-M269 subclade,[17] but no R1a1a has been found in their Yamnaya samples. This raises the question where the R1a1a in the Corded Ware culture came from, if it was not from the Yamnaya culture.[18]

Semenov & Bulat (2016) do argue for such an origin of R1a1a in the Corded Ware culture, noting that several publications point to the presence of R1a1 in the Comb Ware culture.[19][note 2]

Haak et al. (2015) found that part of the Yamnaya ancestry derived from the Middle East and that neolithic techniques probably arrived at the Yamnaya culture from the Balkans.[note 3] The Rossen culture (4,600–4,300 BC), which was situated on Germany and predates the Corded Ware culture, an old subclade of R1a, namely L664, can still be found.[note 4]

Transcaucasia & West Asian origins and possible influence on Indus Valley Civilization

Part of the South Asian genetic ancestry derives from west Eurasian populations, and some researchers have implied that Z93 may have come to India via Iran[21] and expanded there during the Indus Valley Civilization.[2][22]

Mascarenhas et al. (2015) proposed that the roots of Z93 lie in West Asia, and proposed that "Z93 and L342.2 expanded in a southeasterly direction from Transcaucasia into South Asia,"[21] noting that such an expansion is compatible with "the archeological records of eastward expansion of West Asian populations in the 4th millennium BCE culminating in the so-called Kura-Araxes migrations in the post-Uruk IV period."[21] Yet, Lazaridis noted that sample I1635 of Lazaridis et al. (2016), their Armenian Kura-Araxes sample, carried Y-haplogroup R1b1-M415(xM269)[note 5] (also called R1b1a1b-CTS3187).[23]

According to Underhill et al. (2014) the diversification of Z93 and the "early urbanization within the Indus Valley [...] occurred at [5,600 years ago] and the geographic distribution of R1a-M780 (Figure 3d[note 6]) may reflect this."[2][note 7] Poznik et al. (2016) note that 'striking expansions' occurred within R1a-Z93 at ~4,500–4,000 years ago, which "predates by a few centuries the collapse of the Indus Valley Civilisation."[22][note 8]

However, according to Narasimhan et al. (2018), steppe pastoralists are a likely source for R1a in India.[25][note 9]

Proposed South Asian origins

Kivisild et al. (2003) have proposed either South or West Asia,[26][note 10] while Mirabal et al. (2009) see support for both South and Central Asia.[7]

South Asian populations have the highest STR diversity within R1a1a,[27][28][7][3][1][29] and subsequent older TMRCA datings,[note 11] and R1a1a is present among both higher (Brahmin) castes and lower castes, although the presence is higher among Brahmin castes.[1][29] From these findings some researchers have concluded that R1a1a originated in South Asia,[28][1][note 12][note 13] excluding a substantial genetic influx from Indo-European migrants.[28][27][3]

However, this diversity, and the subsequent older TMRCA-datings, can also be explained by the historically high population numbers, which increases the likelihood of diversification and microsatellite variation.[32][33] According to Sengupta et al. (2006), "[R1a1 and R2] could have actually arrived in southern India from a southwestern Asian source region multiple times."[27][note 14] Silva et al. (2017) noted that R1a in South Asia most "likely spread from a single Central Asian source pool, there do seem to be at least three and probably more R1a founder clades within the Subcontinent, consistent with multiple waves of arrival."[33] According to Martin P. Richards, co-author of Silva et al. (2017), "[the prevalence of R1a in India was] very powerful evidence for a substantial Bronze Age migration from central Asia that most likely brought Indo-European speakers to India."[32][34]

Phylogeny

The R1a family tree now has three major levels of branching, with the largest number of defined subclades within the dominant and best known branch, R1a1a (which will be found with various names such as "R1a1" in relatively recent but not the latest literature).

Topology

The topology of R1a is as follows (codes [in brackets] non-isogg codes):[6][35][36][2][37] Tatiana et al. (2014) "rapid diversification process of K-M526 likely occurred in Southeast Asia, with subsequent westward expansions of the ancestors of haplogroups R and Q."[5]

P P295/PF5866/S8 (also known as K2b2).
R (R-M207)[36][6]
- R*
- R1 (R-M173)
  - R1*[36]
  - R1a (M420)[36] (Eastern Europe, Asia)[2]
    - R1a*[6]
    - R1a1[36] (M459/PF6235,[36] SRY1532.2/SRY10831.2[36])
      - R1a1 (M459)[36][6]
      - R1a1a (M17, M198)[36]
        R1a1a1 (M417, page7)[36]
        R1a1a1a (CTS7083/L664/S298)[36]
        
        R1a1a1b (S224/Z645, S441/Z647)[36]
        R1a1a1b1 (PF6217/S339/Z283)[36]
        R1a1a1b1a (Z282)[36] [R1a1a1a*] (Z282) (Europe)[38]
        R1a1a1b1a1[36] [The old topological code is R1a1a1b*，which is outdated and might lead to some confusion.][38] (M458)[36][38] [R1a1a1g] (M458)[37]
        [R1a1a1g*][37]
        
        [R1a1a1g1] (M334)[37]
        
        R1a1a1b1a1a (L260/S222)[36] [R1a1a1g2][37]
        
        R1a1a1b1a2[36] (S466/Z280, S204/Z91)[36]
        R1a1a1b1a2a[36]
        
        R1a1a1b1a2b (CTS1211)[36] [R1a1a1c*] (M558)[38] [R-CTS1211] (V2803/CTS3607/S3363/M558, CTS1211/S3357, Y34/FGC36457)[6]
        R1a1a1b1a2b3* (M417+, Z645+, Z283+, Z282+, Z280+, CTS1211+, CTS3402, Y33+, CTS3318+, Y2613+) (Gwozdz's Cluster K)[35]
        
        R1a1a1b1a2b3a (L365/S468)[36]
        
        R1a1a1b1a3 (Z284)[36] [R1a1a1a1] (Z284)[38]
        
        R1a1a1b2 (F992/S202/Z93)[36] [R1a1a2*] (Z93, M746)(Asia)[38]
        R1a1a1b2a (F3105/S340/Z94, L342.2/S278.2)[36] [R1a1b2a*] (Z95)[38] R-Z94 (Z94/F3105/S340, Z95/F3568)[6]
        R-Z2124 (Z2121/S3410, Z2124)[6]
        [R1a1b2a*] (Z2125)[38]
        [R1a1b2a*] (M434)[38] [R1a1a1f] (M434)[37]
        
        [R1a1b2a*] (M204)[38]
        
        [R1a1b2a1*] (M560)[38]
        
        [R1a1b2a2*] (M780, L657)[38] (India)[2]
        
        [R1a1b2a3*] (Z2122, M582)[38]
        
        [R1a1a1c] (M64.2, M87, M204)[37]
        
        [R1a1a1d] (P98)[37]
        [R1a1a1d2a][39]
        
        [R1a1a1e] (PK5)[37]
  - R1b (M343) (Western Europe)
- R2 (India)

Haplogroup R

Haplogroup R phylogeny

R (M207)

R1 (M173)

M420	R1a

M343	R1b

M173(xM420, M343)	R1*

R2 (M479)

R* M207(xM173, M479)

R-M173 (R1)

R1a is distinguished by several unique markers, including the M420 mutation. It is a subclade of Haplogroup R-M173 (previously called R1). R1a has the sister-subclades Haplogroup R1b-M343, and the paragroup R-M173*.

R-M420 (R1a)

R-M420, defined by the mutation M420, has two branches: R-SRY1532.2, defined by the mutation SRY1532.2, which makes up the vast majority; and R-M420*, the paragroup, defined as M420 positive but SRY1532.2 negative. (In the 2002 scheme, this SRY1532.2 negative minority was one part of the relatively rare group classified as the paragroup R1*.) Mutations understood to be equivalent to M420 include M449, M511, M513, L62, and L63.[3][40]

Only isolated samples of the new paragroup R-M420* were found by Underhill 2009, mostly in the Middle East and Caucasus: 1/121 Omanis, 2/150 Iranians, 1/164 in the United Arab Emirates, and 3/612 in Turkey. Testing of 7224 more males in 73 other Eurasian populations showed no sign of this category.[3]

R-SRY1532.2 (R1a1)

R1a1 is defined by SRY1532.2 or SRY10831.2 (understood to always include SRY10831.2, M448, L122, M459, and M516[3][41]). This family of lineages is dominated by M17 and M198. In contrast, paragroup R-SRY1532.2* lacks either the M17 or M198 markers.

The R-SRY1532.2* paragroup is apparently less rare than R1*, but still relatively unusual, though it has been tested in more than one survey. Underhill et al. (2009) reported 1/51 in Norway, 3/305 in Sweden, 1/57 Greek Macedonians, 1/150 Iranians, 2/734 ethnic Armenians, and 1/141 Kabardians.[3] Sahoo et al. (2006) reported R-SRY1532.2* for 1/15 Himachal Pradesh Rajput samples.[28]

R-M17/M198 (R1a1a)

The following SNPs are associated with R1a1a:

SNP	Mutation	Y-position (NCBI36)	Y-position (GRCh37)	RefSNP ID
M17	INS G	20192556	21733168	rs3908
M198	C->T	13540146	15030752	rs2020857
M512	C->T	14824547	16315153	rs17222146
M514	C->T	17884688	19375294	rs17315926
M515	T->A	12564623	14054623	rs17221601
L168	A->G	14711571	16202177	-
L449	C->T	21376144	22966756	-
L457	G->A	14946266	16436872	rs113195541
L566	C->T	-	-	-

R-M417 (R1a1a1)

R1a1a1 (R-M417) is the most widely found subclade, in two variations which are found respectively in Europe (R1a1a1b1 (R-Z282) ([R1a1a1a*] (R-Z282) (Underhill 2014)[2]) and Central and South Asia (R1a1a1b2 (R-Z93) ([R1a1a2*] (R-Z93) Underhill 2014)[2]).

R-Z282 (R1a1a1b1a) (Eastern Europe)

This large subclade appears to encompass most of the R1a1a found in Europe.[11]

R1a1a1b1a [R1a1a1a* (Underhill (2014))] (R-Z282*) occurs in northern Ukraine, Belarus, and Russia at a frequency of ~20%.[2]
R1a1a1b1a3 [R1a1a1a1 (Underhill (2014))] (R-Z284) occurs in Northwest Europe and peaks at ~20% in Norway.[2]
R1a1a1c (M64.2, M87, M204) is apparently rare: it was found in 1 of 117 males typed in southern Iran.[42]

R-M458 (R1a1a1b1a1)

Frequency distribution of R-M458

R-M458 is a mainly Slavic SNP, characterized by its own mutation, and was first called cluster N. Underhill et al. (2009) found it to be present in modern European populations roughly between the Rhine catchment and the Ural Mountains and traced it to "a founder effect that [...] falls into the early Holocene period, 7.9±2.6 KYA."[3] M458 was found in one skeleton from a 14th-century grave field in Usedom, Mecklenburg-Vorpommern, Germany.[43] The paper by Underhill et al. (2009) also reports a surprisingly high frequency of M458 in some Northern Caucasian populations (for example 27.5% among Karachays and 23.5% among Balkars, 7.8% among Karanogays and 3.4% among Abazas).

R-L260 (R1a1a1b1a1a) (Gwozdz's cluster P)

R1a1a1b1a1a (R-L260), commonly referred to as West Slavic or Polish, is a subclade of the larger parent group R-M458, and was first identified as an STR cluster by Pawlowski et al. 2002 and then by Gwozdz 2009. Thus, R-L260 was what Gwozdz 2009 called cluster "P." In 2010 it was verified to be a haplogroup identified by its own mutation (SNP).[44] It apparently accounts for about 8% of Polish men, making it the most common subclade in Poland. Outside of Poland it is less common. [45] In addition to Poland, it is mainly found in the Czech Republic and Slovakia, and is considered "clearly West Slavic." The founding ancestor of R-L260 is estimated to have lived between 2000 and 3000 years ago, i.e. during the Iron Age, with significant population expansion less than 1,500 years ago.[46]

R-M334

R-M334 ([R1a1a1g1],[37] a subclade of [R1a1a1g] (M458)[37] c.q. R1a1a1b1a1 (M458)[36]) was found by Underhill et al. (2009) only in one Estonian man and may define a very recently founded and small clade.[3]

R1a1a1b1a2 (S466/Z280, S204/Z91)

R1a1a1b1a2b3* (Gwozdz's Cluster K)

R1a1a1b1a2b3* (M417+, Z645+, Z283+, Z282+, Z280+, CTS1211+, CTS3402, Y33+, CTS3318+, Y2613+) (Gwozdz's Cluster K)[35] is a STR based group that is R-M17(xM458). This cluster is common in Poland but not exclusive to Poland.[46]

R1a1a1b1a2b3a (R-L365)

R1a1a1b1a2b3a (R-L365)[36] was early called Cluster G.

R1a1a1b2 (R-Z93) (Asia)

**Relative frequency of R-M434 to R-M17**
Region	People	N	R-M17		R-M434
Region	People	N	Number	Freq. (%)	Number	Freq. (%)
Pakistan	Baloch	60	9	15%	5	8%
Pakistan	Makrani	60	15	25%	4	7%
Middle East	Oman	121	11	9%	3	2.5%
Pakistan	Sindhi	134	65	49%	2	1.5%
Table only shows positive sets from N = 3667 derived from 60 Eurasian populations sample.[3]

This large subclade appears to encompass most of the R1a1a found in Asia.[11]

R-Z93* or R1a1a1b2* (R1a1a2* in Underhill (2014)) is most common (>30%) in the South Siberian Altai region of Russia, cropping up in Kyrgyzstan (6%) and in all Iranian populations (1-8%).[2]
R-Z2125 occurs at highest frequencies in Kyrgyzstan and in Afghan Pashtuns (>40%). At a frequency of >10%, it is also observed in other Afghan ethnic groups and in some populations in the Caucasus and Iran.[2]
- R-M434 is a subclade of Z2125. It was detected in 14 people (out of 3667 people tested), all in a restricted geographical range from Pakistan to Oman. This likely reflects a recent mutation event in Pakistan.[3]
R-M560 is very rare and was only observed in four samples: two Burushaski speakers (north Pakistan), one Hazara (Afghanistan), and one Iranian Azerbaijani.[2]
R-M780 occurs at high frequency in South Asia: India, Pakistan, Afghanistan, and the Himalayas. The group also occurs at >3% in some Iranian populations and is present at >30% in Roma from Croatia and Hungary.[2]

Geographic distribution of R1a1a

R1a among other European haplogroups

R1a

Distribution of R1a (purple) and R1b (red).

Historical

In Mesolithic Europe, R1a is characteristic of Eastern Hunter-Gatherers (EHGs).[47] A male EHG of the Veretye culture buried near Lake Lacha in Arkhangelsk Oblast, Russia ca. 10,700 BC was found to be a carrier of the paternal haplogroup R1a5-YP1301 and the maternal haplogroup U4a.[48][49][47] A Mesolithic male from Karelia ca. 8,800 BC to 7950 BC has been found to be carrying haplogroup R1a.[50] A Mesolithic male buried at Deriivka ca. 7000 BC to 6700 BC carried the paternal haplogroup R1a and the maternal U5a2a.[14] Another male from Karelia from ca. 5,500 to 5,000 BC, who was considered an EHG, carried haplogroup R1a.[12] A male from the Comb Ceramic culture in Kudruküla ca. 5,900 BC to 3,800 BC has been determined to be a carrier of R1a and the maternal U2e1.[51] Mathieson et al. (2015) found the paternal R1a-Z93[14] - the earliest sample of this clade ever found.[52] - at Alexandria, Ukraine ca. 4000 BC, Sredny Stog culture.[52] R1a has been found in the Corded Ware culture,[53][54] in which it is predomiant.[55] Examined males of the Bronze Age Fatyanovo culture belong entirely to R1a, specifically subclade R1a-Z93.[47][48][56]

Haplogroup R1a has later been found in ancient fossils associated with the Urnfield culture;[57] as well as the burial of the remains of the Sintashta,[13] Andronovo,[58] the Pazyryk,[59] Tagar,[58] Tashtyk,[58] and Srubnaya cultures, the inhabitants of ancient Tanais,[60] in the Tarim mummies,[61] and the aristocracy Xiongnu.[62] The skeletal remains of a father and his two sons, from an archaeological site discovered in 2005 near Eulau (in Saxony-Anhalt, Germany) and dated to about 2600 BCE, tested positive for the Y-SNP marker SRY10831.2. The Ysearch number for the Eulau remains is 2C46S. The ancestral clade was thus present in Europe at least 4600 years ago, in association with one site of the widespread Corded Ware culture.[53]

Europe

In Europe, the R1a1 sub-clade is found at highest levels among peoples of Central and Eastern European descent, with results ranging from 35-65% among Czechs, Hungarians, Poles, Slovaks, western Ukrainians (particularly Rusyns), Belarusians, Moldovans, and Russians.[63][64][9] In the Baltics, R1a1a frequencies decrease from Lithuania (45%) to Estonia (around 30%).[65][66][67][9][68]

There is a significant presence in peoples of Scandinavian descent, with highest levels in Norway and Iceland, where between 20 and 30% of men are in R1a1a.[69][70] Vikings and Normans may have also carried the R1a1a lineage westward; accounting for at least part of the small presence in the British Isles.[71][72] In East Germany, where Haplogroup R1a1a reaches a peak frequency in Rostock at a percentage of 31.3%, it averages between 20 and 30%.[73]

In Southern Europe R1a1a is not common, but significant levels have been found in pockets, such as in the Pas Valley in Northern Spain, areas of Venice, and Calabria in Italy.[74] The Balkans shows lower frequencies, and significant variation between areas, for example more than 30% in Slovenia, Croatia and Greek Macedonia, but less than 10% in Albania, Kosovo and parts of Greece on south from Olympus gorge.[75][67][9]

R1a is virtually composed only of the Z284 subclade in Scandinavia, which is only found in a single sample of a Slovenian in Eastern Europe, where the main subclade is Z282 (Z280 and M458) and there is a negligible representation of Z93 in each region other than Turkey.[2] West Slavs and Hungarians are characterized by a high frequency of the subclade M458 and a low Z92, a subclade of Z280. Hundreds of Slovenian samples and Czechs lack the Z92 subclade of Z280, while Poles, Slovaks, Croats and Hungarians only show a very low frequency of Z92.[2] The Balts, East Slavs, Serbs, Macedonians, Bulgarians and Romanians demonstrate a ratio Z280>M458 and a high, up to a prevailing share of Z92.[2] Balts and East Slavs have the same subclades and similar frequencies in a more detailed phylogeny of the subclades.[76][77] The Russian geneticist Oleg Balanovsky speculated that there is a predominance of the assimilated pre-Slavic substrate in the genetics of East and West Slavic populations, according to him the common genetic structure which contrasts East Slavs and Balts from other populations may suggest the explanation that the pre-Slavic substrate of the East Slavs consisted most significantly of Baltic-speakers, which at one point predated the Slavs in the cultures of the Eurasian steppe according to archaeological and toponymic references.[note 15]

Asia

Central Asia

Zerjal et al. (2002) found R1a1a in 64% of a sample of the Tajiks of Tajikistan and 63% of a sample of the Kyrgyz of Kyrgyzstan.[78]

Haber et al. (2012) found R1a1a-M17(xM458) in 26.0% (53/204) of a set of samples from Afghanistan, including 60% (3/5) of a sample of Nuristanis, 51.0% (25/49) of a sample of Pashtuns, 30.4% (17/56) of a sample of Tajiks, 17.6% (3/17) of a sample of Uzbeks, 6.7% (4/60) of a sample of Hazaras, and in the only sampled Turkmen individual.[79]

Di Cristofaro et al. (2013) found R1a1a-M198/M17 in 56.3% (49/87) of a pair of samples of Pashtuns from Afghanistan (including 20/34 or 58.8% of a sample of Pashtuns from Baghlan and 29/53 or 54.7% of a sample of Pashtuns from Kunduz), 29.1% (37/127) of a pool of samples of Uzbeks from Afghanistan (including 28/94 or 29.8% of a sample of Uzbeks from Jawzjan, 8/28 or 28.6% of a sample of Uzbeks from Sar-e Pol, and 1/5 or 20% of a sample of Uzbeks from Balkh), 27.5% (39/142) of a pool of samples of Tajiks from Afghanistan (including 22/54 or 40.7% of a sample of Tajiks from Balkh, 9/35 or 25.7% of a sample of Tajiks from Takhar, 4/16 or 25.0% of a sample of Tajiks from Samangan, and 4/37 or 10.8% of a sample of Tajiks from Badakhshan), 16.2% (12/74) of a sample of Turkmens from Jawzjan, and 9.1% (7/77) of a pair of samples of Hazara from Afghanistan (including 7/69 or 10.1% of a sample of Hazara from Bamiyan and 0/8 or 0% of a sample of Hazara from Balkh).[80]

Malyarchuk et al. (2013) found R1a1-SRY10831.2 in 30.0% (12/40) of a sample of Tajiks from Tajikistan.[81]

Ashirbekov et al. (2017) found R1a-M198 in 6.03% (78/1294) of a set of samples of Kazakhs from Kazakhstan. R1a-M198 was observed with greater than average frequency in the study's samples of the following Kazakh tribes: 13/41 = 31.7% of a sample of Suan, 8/29 = 27.6% of a sample of Oshaqty, 6/30 = 20.0% of a sample of Qozha, 4/29 = 13.8% of a sample of Qypshaq, 1/8 = 12.5% of a sample of Tore, 9/86 = 10.5% of a sample of Jetyru, 4/50 = 8.0% of a sample of Argyn, 1/13 = 7.7% of a sample of Shanyshqyly, 8/122 = 6.6% of a sample of Alimuly, 3/46 = 6.5% of a sample of Alban. R1a-M198 also was observed in 5/42 = 11.9% of a sample of Kazakhs of unreported tribal affiliation.[82]

South Asia

In South Asia, R1a1a has often been observed in a number of demographic groups.[28][27]

In India, high frequencies of this haplogroup is observed in West Bengal Brahmins (72%)[27] to the east, Gujarat Lohanas (60%) [3] to the west, Khatris (67%)[3] in the north and Iyengar Brahmins (31%)[27] in the south. It has also been found in several South Indian Dravidian-speaking Adivasis including the Chenchu (26%) and the Valmikis of Andhra Pradesh, Kota (22.58%)[83] and the Kallar of Tamil Nadu suggesting that R1a1a is widespread in Tribal Southern Indians.[26]

Besides these, studies show high percentages in regionally diverse groups such as Manipuris (50%)[3] to the extreme North East and among Punjabis (47%)[26] to the extreme North West.

In Pakistan it is found at 71% among the Mohanna tribe in Sindh province to the south and 46% among the Baltis of Gilgit-Baltistan to the north.[3] Among the Sinhalese of Sri Lanka, 23% were found to be R1a1a (R-SRY1532) positive.[84] Hindus of Chitwan District in the Terai region Nepal show it at 69%.[85]

East Asia

The frequency of R1a1a is comparatively low among some Turkic-speaking groups like Yakuts, yet levels are higher (19 to 28%) in certain Turkic or Mongolic-speaking groups of Northwestern China, such as the Bonan, Dongxiang, Salar, and Uyghurs.[10][86][87]

A Chinese paper published in 2018 found R1a-Z94 in 38.5% (15 / 39) of a sample of Keriyalik Uyghurs from Darya Boyi / Darya Boye Village, Yutian County, Xinjiang (于田县达里雅布依乡), R1a-Z93 in 28.9% (22/76) of a sample of Dolan Uyghurs from Horiqol township, Awat County, Xinjiang (阿瓦提县乌鲁却勒镇), and R1a-Z93 in 6.3% (4/64) of a sample of Loplik Uyghurs from Karquga / Qarchugha Village, Yuli County, Xinjiang (尉犁县喀尔曲尕乡). R1a(xZ93) was observed only in one of 76 Dolan Uyghurs.[88] Note that Darya Boyi Village is located in a remote oasis formed by the Keriya River in the Taklamakan Desert.

A 2011 Y-dna study found that 10% of Northern Han Chinese from eastern Gansu and 8.9% of Northern Han from western Henan had the Y-dna R1a1.[89] In a 2014 paper, R1a1a has been detected in 1.8% (2/110) of Chinese samples. These two samples (R-M17, R-M198, R-M434, R-M458 for both) belonged to Han individuals from Fujian and Shanxi provinces.[90]

In Eastern Siberia, R1a1a is found among certain indigenous ethnic groups including Kamchatkans and Chukotkans, and peaking in Itel'man at 22%.[91]

West Asia

R1a1a has been found in various forms, in most parts of Western Asia, in widely varying concentrations, from almost no presence in areas such as Jordan, to much higher levels in parts of Kuwait and Iran. The Shimar (Shammar) Bedouin tribe in Kuwait show the highest frequency in the Middle East at 43%.[92][93][94]

Wells 2001, noted that in the western part of the country, Iranians show low R1a1a levels, while males of eastern parts of Iran carried up to 35% R1a1a. Nasidze et al. 2004 found R1a1a in approximately 20% of Iranian males from the cities of Tehran and Isfahan. Regueiro 2006 in a study of Iran, noted much higher frequencies in the south than the north.

A newer study has found 20.3% R-M17* among Kurdish samples which were taken in the Kurdistan Province in western Iran, 9.7% among Mazandaranis in North Iran in the province of Mazandaran, 9.4% among Gilaks in province of Gilan, 12.8% among Persian and 17.6% among Zoroastrians in Yazd, 18.2% among Persians in Isfahan, 20.3% among Persians in Khorasan, 16.7% Afro-Iranians, 18.4% Qeshmi "Gheshmi", 21.4% among Persian Speaking Bandari people in Hormozgan and 25% among the Baloch people in Sistan and Baluchestan Province.[95]

Di Cristofaro et al. (2013) found haplogroup R1a in 9.68% (18/186) of a set of samples from Iran, though with a large variance ranging from 0% (0/18) in a sample of Iranians from Tehran to 25% (5/20) in a sample of Iranians from Khorasan and 27% (3/11) in a sample of Iranians of unknown provenance. All Iranian R1a individuals carried the M198 and M17 mutations except one individual in a sample of Iranians from Gilan (n=27), who was reported to belong to R1a-SRY1532.2(xM198, M17).[80]

Malyarchuk et al. (2013) found R1a1-SRY10831.2 in 20.8% (16/77) of a sample of Persians collected in the provinces of Khorasan and Kerman in eastern Iran, but they did not find any member of this haplogroup in a sample of 25 Kurds collected in the province of Kermanshah in western Iran.[81]

Haplogroup R1a1a was found at elevated levels among a sample of the Israeli population who self-designated themselves as Levites and Ashkenazi Jews (Levites comprise approximately 4% of Jews). Behar et al. (2003) reported R1a1a to be the dominant haplogroup in Ashkenazi Levites (52%), although rare in Ashkenazi Cohanim (1.3%).[64]

Further to the north of these Middle Eastern regions on the other hand, R1a1a levels start to increase in the Caucasus, once again in an uneven way. Several populations studied have shown no sign of R1a1a, while highest levels so far discovered in the region appears to belong to speakers of the Karachay-Balkar language among whom about one quarter of men tested so far are in haplogroup R1a1a.[3]

The frequency of R1a1a is comparatively low among some Turkic-speaking groups including Turks and Azeris.

Historic naming of R1a

The historic naming system commonly used for R1a was inconsistent in different published sources, because it changed often; this requires some explanation.

In 2002, the Y Chromosome Consortium (YCC) proposed a new naming system for haplogroups (YCC 2002), which has now become standard. In this system, names with the format "R1" and "R1a" are "phylogenetic" names, aimed at marking positions in a family tree. Names of SNP mutations can also be used to name clades or haplogroups. For example, as M173 is currently the defining mutation of R1, R1 is also R-M173, a "mutational" clade name. When a new branching in a tree is discovered, some phylogenetic names will change, but by definition all mutational names will remain the same.

The widely occurring haplogroup defined by mutation M17 was known by various names, such as "Eu19", as used in (Semino et al. 2000) in the older naming systems. The 2002 YCC proposal assigned the name R1a to the haplogroup defined by mutation SRY1532.2. This included Eu19 (i.e. R-M17) as a subclade, so Eu19 was named R1a1. Note, SRY1532.2 is also known as SRY10831.2 The discovery of M420 in 2009 has caused a reassignment of these phylogenetic names.(Underhill et al. 2009 and ISOGG 2012) R1a is now defined by the M420 mutation: in this updated tree, the subclade defined by SRY1532.2 has moved from R1a to R1a1, and Eu19 (R-M17) from R1a1 to R1a1a.

More recent updates recorded at the ISOGG reference webpage involve branches of R-M17, including one major branch, R-M417.

Contrasting family trees for R1a, showing the evolution of understanding of this clade

2002 Scheme proposed in (YCC 2002)

2009 Scheme as per (Underhill et al. 2009)

ISOGG tree as per January 2011

As M420 went undetected, M420 lineages were classified as either R1* or R1a (SRY1532.2, also known as SRY10831.2)

R1
M173

R1*

All cases without M343 or SRY1532.2 (including a minority M420+ cases)

R1a
SRY1532.2
(SRY10831.2)

R1a*

R1a1
M17, M198

	R1a1*

M56	R1a1a

M157	R1a1b

M87, M204 M64.2	R1a1c

R1b
M343

sibling clade to R1a

After 2009, a new layer was inserted covering all old R1a, plus its closest known relatives

R1
M173

R1*

All cases without M343 or M420 (smaller than old "R1a*")

R1a
M420

R1a* All cases with M420 but without SRY1532.2

R1a1
SRY1532.2

	R1a1(Old R1a)

R1a1a
M17, M198

R1a1a*

M56

R1a1a1

M157

R1a1a2

M64.2,..

R1a1a3

P98

R1a1a4

PK5

R1a1a5

M434

R1a1a6

M458

	R1a1a7*

M334	R1a1a7a

Page68

R1a1a8

R1b
M343

Sibling clade to R1a (same as before)

Latest information

R1
M173

R1* (As before)

R1a
M420

R1a* (As before)

R1a1
SRY1532.2

R1a1* (As before)

R1a1a
M17

	R1a1a* (As before)

R1a1a1
M417, Page7

R1a1a1*

M56

R1a1a1a

M157

R1a1a1b

M64.2,..

R1a1a1c

P98

R1a1a1d

PK5

R1a1a1e

M434

R1a1a1f

Z283

R1a1a1g*

M458

	R1a1a1g1*

M334	R1a1a1g1a

L260	R1a1a1g1b

Z280

	R1a1a1g2*

P278.2	R1a1a1g2a

L365	R1a1a1g2b

L366	R1a1a1g2c

Z92	R1a1a1g2d

Z284

	R1a1a1g3*

P278.2	R1a1a1g3a

Z93

R1a1a1h*

L342.2

	R1a1a1h1*

L657	R1a1a1h1a

R1b
M343

Sibling clade to R1a (same as before)

gollark: Well, shell has accursuous quoting rules.

gollark: . is fine, ! is mostly fine, spaces can be bee, ? is dangerous, ' more so, " even morererer so, anomalous Unicode particularly.

gollark: Well, I'll be at the Lunar Site.

gollark: They are some of the least fearsome of punctuation.

gollark: Surely your code can deal with *spaces*.

Notes

According to Family Tree, they diversified ca. 5,000 years ago.[6]
Semenov & Bulat (2016) refer to the following publications:
5. Haak, Wolfgang (2015). "Massive migration from the steppe is a source for Indo-European languages in Europe". Nature. 522 (7555): 207–211. arXiv:1502.02783. Bibcode:2015Natur.522..207H. bioRxiv 10.1101/013433. doi:10.1038/NATURE14317. PMC 5048219. PMID 25731166.
6. Mathieson, Iain (2015). "Eight thousand years of natural selection in Europe". bioRxiv 10.1101/016477.
8. Chekunova Е.М., Yartseva N.V., Chekunov М.К., Мazurkevich А.N. The First Results of the Genotyping of the Aboriginals and Human Bone Remains of the Archeological Memorials of the Upper Podvin’e. // Archeology of the lake settlements of IV—II Thousands BC: The chronology of cultures and natural environment and climatic rhythms. Proceedings of the International Conference, Devoted to the 50-year Research of the Pile Settlements on the North-West of Russia. St. Petersburg, 13–15 November 2014.
9. Jones, ER; Gonzalez-Fortes, G; Connell, S; Siska, V; Eriksson, A; Martiniano, R; McLaughlin, RL; Gallego Llorente, M; Cassidy, LM; Gamba, C; Meshveliani, T; Bar-Yosef, O; Müller, W; Belfer-Cohen, A; Matskevich, Z; Jakeli, N; Higham, TF; Currat, M; Lordkipanidze, D; Hofreiter, M; Manica, A; Pinhasi, R; Bradley, DG (2015). "Upper Palaeolithic genomes reveal deep roots of modern Eurasians". Nat Commun. 6: 8912. Bibcode:2015NatCo...6.8912J. doi:10.1038/ncomms9912. PMC 4660371. PMID 26567969.
Yet, Haak et al. also explicitly state: "...a type of Near Eastern ancestry different from that which was introduced by early farmers."[20]
According to Family Tree DNA, L664 formed 4,700 ybp, that is, 2,700 BCE.[6]
Lazaridis, Twitter, 18 juni 2016: "I1635 (Armenia_EBA) is R1b1-M415(xM269). We'll be sure to include in the revision. Thanks to the person who noticed! #ILovePreprints."
See also "Big deal of 2016: the territory of present-day Iran cannot be the Indo-European homeland". Eurogenes Blog. November 26, 2016, for a discussion of the same topic.
See map for M780 distribution at Dieneke's Anthropology Blog, Major new article on the deep origins of Y-haplogroup R1a (Underhill et al. 2014)[24]
According to Family Tree DNA, M780 formed 4700 ybp.[6] This dating coincides with the eastward movement between 2800 and 2600 BCE of the Yamnaya culture into the region of the Poltavka culture, a predecessor of the Sintashta culture, from which the Indo-Iranians originated. M780 is concentrated in the Ganges Valley, the locus of the classic Vedic society.
Poznik et al. (2016) calculate with a generation time of 30 years; a generation time of 20 years yields other results.
"The evidence that the Steppe_MLBA [Middle to Late Bronze Age] cluster is a plausible source for the Steppe ancestry in South Asia is also supported by Y chromosome evidence, as haplogroup R1a which is of the Z93 subtype common in South Asia today [Underhill et al. (2014), Silva et al. (2017)] was of high frequency in Steppe_MLBA (68%) (16), but rare in Steppe_EMBA [Early to Middle Bronze Age] (absent in our data)."[25]
Kivisild et al. (2003): "Haplogroup R1a, previously associated with the putative Indo-Aryan invasion, was found at its highest frequency in Punjab but also at a relatively high frequency (26%) in the Chenchu tribe. This finding, together with the higher R1a-associated short tandem repeat diversity in India and Iran compared with Europe and central Asia, suggests that southern and western Asia might be the source of this haplogroup."[26]
Lucotte (2015) dates the origin in the subcontinent to approximately 15,500 year before present[30] Datations show that the Z93 Pakistano-Indian group is the most ancient (about 15,5 K years); in Europe, the Eastern populations are the most ancient (about 12,5 K years) and the Northern ones the most recent.
Sahoo et al. (2006): "... one should expect to observe dramatically lower genetic variation among Indian Rla lineages. In fact, the opposite is true: the STR haplotype diversity on the background of R1a in Central Asia (and also in Eastern Europe) has already been shown to be lower than that in India (6). Rather, the high incidence of R1* and Rla throughout Central Asian European populations (without R2 and R* in most cases) is more parsimoniously explained by gene flow in the opposite direction, possibly with an early founder effect in South or West Asia.[31]
Sharma et al. (2009): "A peculiar observation of the highest frequency (up to 72.22%) of Y-haplogroup R1a1* in Brahmins hinted at its presence as a founder lineage for this caste group. Further, observation of R1a1* in different tribal population groups, existence of Y-haplogroup R1a* in ancestors and extended phylogenetic analyses of the pooled dataset of 530 Indians, 224 Pakistanis and 276 Central Asians and Eurasians bearing the R1a1* haplogroup supported the autochthonous origin of R1a1 lineage in India and a tribal link to Indian Brahmins. However, it is important to discover novel Y-chromosomal binary marker(s) for a higher resolution of R1a1* and confirm the present conclusions."[1]
Sengupta et al. (2006): "The widespread geographic distribution of HG R1a1-M17 across Eurasia and the current absence of informative subdivisions defined by binary markers leave uncertain the geographic origin of HG R1a1-M17. However, the contour map of R1a1-M17 variance shows the highest variance in the northwestern region of India [...] The question remains of how distinctive is the history of L1 relative to some or all of R1a1 and R2 representatives. This uncertainty neutralizes previous conclusions that the intrusion of HGs R1a1 and R2 from the northwest in Dravidian-speaking southern tribes is attributable to a single recent event. [R1a1 and R2] could have actually arrived in southern India from a southwestern Asian source region multiple times, with some episodes considerably earlier than others. Considerable archeological evidence exists regarding the presence of Mesolithic peoples in India (Kennedy 2000), some of whom could have entered the subcontinent from the northwest during the late Pleistocene epoch. The high variance of R1a1 in India (table 12), the spatial frequency distribution of R1a1 microsatellite variance clines (fig. 4), and expansion time (table 11) support this view."[27]
Балановский (2015), p. 208 (in Russian) Прежде всего, это преобладание в славянских популяциях дославянского субстрата — двух ассимилированных ими генетических компонентов – восточноевропейского для западных и восточных славян и южноевропейского для южных славян...Можно с осторожностью предположить, что ассимилированный субстратмог быть представлен по преимуществу балтоязычными популяциями. Действительно, археологические данные указыва ют на очень широкое распространение балтских групп перед началом расселения славян. Балтскийсубстрату славян (правда, наряду с финно-угорским) выявляли и антропологи. Полученные нами генетические данные — и на графиках генетических взаимоотношений, и по доле общих фрагментов генома — указывают, что современные балтские народы являются ближайшими генетически ми соседями восточных славян. При этом балты являются и лингвистически ближайшими род ственниками славян. И можно полагать, что к моменту ассимиляции их генофонд не так сильно отличался от генофонда начавших свое широкое расселение славян. Поэтому если предположить,что расселяющиеся на восток славяне ассимилировали по преимуществу балтов, это может объяснить и сходство современных славянских и балтских народов друг с другом, и их отличия от окружающих их не балто-славянских групп Европы...В работе высказывается осторожное предположение, что ассимилированный субстрат мог быть представлен по преимуществу балтоязычными популяциями. Действительно, археологические данные указывают на очень широкое распространение балтских групп перед началом расселения славян. Балтский субстрат у славян (правда, наряду с финно-угорским) выявляли и антропологи. Полученные в этой работе генетические данные — и на графиках генетических взаимоотношений, и по доле общих фрагментов генома — указывают, что современные балтские народы являются ближайшими генетическими соседями восточных славян.

References

Sharma et al. 2009.
Underhill et al. 2014.
Underhill et al. 2009.
Underhill et al. 2014, p. 130.
Karafet et al. 2014.
"R1a tree". YFull.
Mirabal et al. 2009.
Zerjal, T.; et al. (1999). "The use of Y-chromosomal DNA variation to investigate population history: recent male spread in Asia and Europe". In S.S. Papiha; R. Deka & R. Chakraborty (eds.). Genomic diversity: applications in human population genetics. New York: Kluwer Academic/Plenum Publishers. pp. 91–101. ISBN 978-0-3064-6295-5.
Semino et al. 2000.
Wells 2001.
Pamjav et al. 2012.
Haak et al. 2015.
Allentoft et al. 2015.
Mathieson et al. 2015.
Anthony 2007.
Anthony & Ringe 2015.
Haak et al. 2015, p. 5.
Semenov & Bulat 2016.
Semenov & Bulat 2016, p. 41.
Haak et al. 2015, p. 4.
Mascarenhas et al. 2015, p. 9.
Poznik et al. 2016, p. 5.
Arame's English blog, Y DNA from ancient Near East
"Dienekes' Anthropology Blog: Major new article on the deep origins of Y-haplogroup R1a (Underhill et al. 2014)". March 27, 2014. Retrieved December 20, 2019.
Narasimhan et al. 2018.
Kivisild et al. 2003.
Sengupta 2006.
Sahoo et al. 2006.
Thangaraj et al. 2010.
Gérard, Lucotte (2015). "The Major Y-Chromosome Haplotype XI – Haplogroup R1a in Eurasia" (PDF). Hereditary Genetics.
Sahoo et al. 2006, p. 845-846.
Joseph, Tony (16 June 2017). "How genetics is settling the Aryan migration debate". The Hindu.
Silva et al. 2017.
""Heavily sex-biased" population dispersals into the Indian Subcontinent (Silva et al. 2017)". Eurogenes Blog. March 28, 2017.
"About Us". Family Tree DNA. Retrieved December 20, 2019.
"ISOGG 2017 Y-DNA Haplogroup R". isogg.org. Retrieved December 20, 2019.
"Haplogroup R (Y-DNA) - SNPedia". www.snpedia.com. Retrieved December 20, 2019.
Underhill et al. 2014, p. 125.
"R1a in Yamnaya". Eurogenes Blog. March 21, 2016. Archived from the original on 2018-05-05. Retrieved December 20, 2019.
"Y-DNA Haplogroup R and its Subclades". International Society of Genetic Genealogy (ISOGG). Retrieved 8 January 2011.
Krahn, Thomas. "Draft Y-Chromosome Tree". Family Tree DNA. Archived from the original on 2013-05-26. Retrieved 2012-12-07.
Regueiro 2006.
J. Freder, Die mittelalterlichen Skelette von Usedom [The mediaeval skeletons of Usedom], Berlin 2010, p. 86 (Dissertation Free University Berlin 2010).
Gwozdz, Peter (6 Aug 2018). "Polish Y-DNA Clades".
Pawlowski et al. 2002.
Gwozdz 2009.
Saag et al. 2020, p. 5.
Saag et al. 2020, p. 29, Table 1.
Saag et al. 2020, Supplementary Data 2, Row 4.
Fu et al. 2016.
Saag et al. 2017.
Anthony 2019, pp. 16, 17.
Haak et al. 2008.
Brandit et al. 2013.
Malmström et al. 2019, p. 2.
Saag et al. 2020, Supplementary Data 2, Rows 5-49.
Schweitzer, D. (23 March 2008). "Lichtenstein Cave Data Analysis" (PDF). dirkschweitzer.net. Archived from the original (PDF) on 14 August 2011. Summary in English of Schilz (2006).
Keyser et al. 2009.
Ricaut et al. 2004.
Корниенко И. В., Водолажский Д. И. Использование нерекомбинантных маркеров Y-хромосомы в исследованиях древних популяций (на примере поселения Танаис)//Материалы Донских антропологических чтений. Ростов-на-Дону, Ростовский научно-исследовательский онкологический институт, Ростов-на-Дону, 2013.
Chunxiang Li et al. 2010.
Kim et al. 2010.
Balanovsky et al. 2008.
Behar et al. 2003.
Kasperaviciūte, Kucinskas & Stoneking 2005.
Battaglia et al. 2008.
Rosser et al. 2000.
Tambets et al. 2004.
Bowden et al. 2008.
Dupuy et al. 2005.
Passarino et al. 2002.
Capelli et al. 2003.
Kayser et al. 2005.
Scozzari et al. 2001.
Pericić et al. 2005.
"UNTITLED". pereformat.ru (in Russian).
"UNTITLED". www.rodstvo.ru.
Zerjal et al. 2002.
Haber et al. 2012.
Di Cristofaro et al. 2013.
Malyarchuk et al. 2013.
Ashirbekov et al. 2017.
Arunkumar 2012.
Toomas Kivisild; Siiri Rootsi; Mait Metspalu; Ene Metspalu; Juri Parik; Katrin Kaldma; Esien Usanga; Sarabjit Mastana; Surinder S. Papiha; Richard Villems. "The Genetics of Language and Farming Spread in India" (PDF). In P. Bellwwood; C. Renfrew (eds.). Examining the farming/language dispersal hypothesis. McDonald Institute Monographs. Cambridge University. pp. 215–222. Retrieved December 20, 2019.
Fornarino et al. 2009.
Wang et al. 2003.
Zhou et al. 2007.
Liu Shu-hu et al. 2018.
Zhong et al. 2011.
Yan et al. 2014.
Lell et al. 2002.
Mohammad et al. 2009.
Nasidze et al. 2004.
Nasidze et al. 2005.
Grugni et al. 2012.

Sources

Allentoft, Morten E.; Sikora, Martin; Sjögren, Karl-Göran; Rasmussen, Simon; Rasmussen, Morten; Stenderup, Jesper; Damgaard, Peter B.; Schroeder, Hannes; et al. (2015). "Population genomics of Bronze Age Eurasia". Nature. 522 (7555): 167–172. Bibcode:2015Natur.522..167A. doi:10.1038/nature14507. PMID 26062507. S2CID 4399103.
Anthony, David W. (2007), The Horse The Wheel And Language. How Bronze-Age Riders From the Eurasian Steppes Shaped The Modern World, Princeton University Press
Anthony, David (Spring–Summer 2019). "Archaeology, Genetics, and Language in the Steppes: A Comment on Bomhard". Journal of Indo-European Studies. 47 (1–2). Retrieved January 9, 2020.
Anthony, David; Ringe, Don (2015), "The Indo-European Homeland from Linguistic and Archaeological Perspectives", Annual Review of Linguistics, 1: 199–219, doi:10.1146/annurev-linguist-030514-124812
ArunKumar, G; Soria-Hernanz, DF; Kavitha, VJ; Arun, VS; Syama, A; Ashokan, KS (2012). "Population Differentiation of Southern Indian Male Lineages Correlates with Agricultural Expansions Predating the Caste System". PLOS ONE. 7 (11): e50269. Bibcode:2012PLoSO...750269A. doi:10.1371/journal.pone.0050269. PMC 3508930. PMID 23209694.
Ashirbekov, E. E.; et al. (2017). "Distribution of Y-Chromosome Haplogroups of the Kazakh from the South Kazakhstan, Zhambyl, and Almaty Regions" (PDF). Reports of the National Academy of Sciences of the Republic of Kazakhstan. 6 (316): 85–95. ISSN 2224-5227.
Balanovsky O, Rootsi S, Pshenichnov A, Kivisild T, Churnosov M, Evseeva I, Pocheshkhova E, Boldyreva M, et al. (2008). "Two Sources of the Russian Patrilineal Heritage in Their Eurasian Context". American Journal of Human Genetics. 82 (1): 236–250. doi:10.1016/j.ajhg.2007.09.019. PMC 2253976. PMID 18179905.
Балановский, О. П. (2015-11-30). Генофонд Европы (in Russian). KMK Scientific Press. ISBN 9785990715707.
Battaglia V, Fornarino S, Al-Zahery N, Olivieri A, Pala M, Myres NM, King RJ, Rootsi S, et al. (2008). "Y-chromosomal evidence of the cultural diffusion of agriculture in southeast Europe". European Journal of Human Genetics. 17 (6): 820–30. doi:10.1038/ejhg.2008.249. PMC 2947100. PMID 19107149.
Behar D, Thomas MG, Skorecki K, Hammer MF, Bulygina E, Rosengarten D, Jones AL, Held K, et al. (2003). "Multiple Origins of Ashkenazi Levites: Y Chromosome Evidence for Both Near Eastern and European Ancestries" (PDF). American Journal of Human Genetics. 73 (4): 768–779. doi:10.1086/378506. PMC 1180600. PMID 13680527.
Bowden GR, Balaresque P, King TE, Hansen Z, Lee AC, Pergl-Wilson G, Hurley E, Roberts SJ, et al. (2008). "Excavating Past Population Structures by Surname-Based Sampling: The Genetic Legacy of the Vikings in Northwest England". Molecular Biology and Evolution. 25 (2): 301–309. doi:10.1093/molbev/msm255. PMC 2628767. PMID 18032405.
Brandit, G.; et al. (The Genographic Consortium) (2013). "Ancient DNA Reveals Key Stages in the Formation of Central European Mitochondrial Genetic Diversity". Science. 342 (6155): 257–261. Bibcode:2013Sci...342..257B. doi:10.1126/science.1241844. PMC 4039305. PMID 24115443.
Capelli C, Redhead N, Abernethy JK, Gratrix F, Wilson JF, Moen T, Hervig T, Richards M, et al. (2003). "A Y Chromosome Census of the British Isles" (PDF). Current Biology. 13 (11): 979–84. doi:10.1016/S0960-9822(03)00373-7. PMID 12781138. S2CID 526263. also at "University College London" (PDF).
Chunxiang Li; Hongjie Li; Yinqiu Cui; Chengzhi Xie; Dawei Cai; Wenying Li; Victor H Mair; Zhi Xu; et al. (2010). "Evidence that a West-East admixed population lived in the Tarim Basin as early as the early Bronze Age" (PDF). BMC Biology. 8 (1): 15. doi:10.1186/1741-7007-8-15. PMC 2838831. PMID 20163704. Archived from the original (PDF) on 27 April 2011.
Di Cristofaro J, Pennarun E, Mazières S, Myres NM, Lin AA, Temori SA, Metspalu M, Metspalu E, et al. (2013). "Afghan Hindu Kush: Where Eurasian Sub-Continent Gene Flows Converge". PLOS ONE. 8 (10). e76748. Bibcode:2013PLoSO...876748D. doi:10.1371/journal.pone.0076748. PMC 3799995. PMID 24204668.
Dupuy BM, Stenersen M, Lu TT, Olaisen B (2005). "Geographical heterogeneity of Y-chromosomal lineages in Norway" (PDF). Forensic Science International. 164 (1): 10–19. doi:10.1016/j.forsciint.2005.11.009. PMID 16337760.
Fornarino, Simona; Pala, Maria; Battaglia, Vincenza; Maranta, Ramona; Achilli, Alessandro; Modiano, Guido; Torroni, Antonio; Semino, Ornella; et al. (2009). "Mitochondrial and Y-chromosome diversity of the Tharus (Nepal): a reservoir of genetic variation". BMC Evolutionary Biology. 9: 154. doi:10.1186/1471-2148-9-154. PMC 2720951. PMID 19573232.
Fu, Qiaomei; et al. (May 2, 2016). "The genetic history of Ice Age Europe". Nature. 534 (7606): 200–205. Bibcode:2016Natur.534..200F. doi:10.1038/nature17993. hdl:10211.3/198594. PMC 4943878. PMID 27135931.
Grugni V, Battaglia V, Kashani BH, Parolo S, Al-Zahery N, Achilli A, Olivieri A, Gandini F, Houshmand M, Sanati MH, Torroni A, Semino O (2012). "Ancient Migratory Events in the Middle East: New Clues from the Y-Chromosome Variation of Modern Iranians". PLOS ONE. 7 (7). e41252. Bibcode:2012PLoSO...741252G. doi:10.1371/journal.pone.0041252. PMC 3399854. PMID 22815981.
Gwozdz (2009). "Y-STR Mountains in Haplospace, Part II: Application to Common Polish Clades" (PDF). Journal of Genetic Genealogy. 5 (2).
Haak, W.; Brandt, G.; Jong, H. N. d.; Meyer, C.; Ganslmeier, R.; Heyd, V.; Hawkesworth, C.; Pike, A. W. G.; et al. (2008). "Ancient DNA, Strontium isotopes, and osteological analyses shed light on social and kinship organization of the Later Stone Age". Proceedings of the National Academy of Sciences. 105 (47): 18226–18231. Bibcode:2008PNAS..10518226H. doi:10.1073/pnas.0807592105. PMC 2587582. PMID 19015520.
Haak, Wolfgang; Lazaridis, Iosif; Patterson, Nick; Rohland, Nadin; Mallick, Swapan; Llamas, Bastien; Brandt, Guido; Nordenfelt, Susanne; et al. (2015). "Massive migration from the steppe is a source for Indo-European languages in Europe". bioRxiv. 522 (7555). 013433. arXiv:1502.02783. Bibcode:2015Natur.522..207H. bioRxiv 10.1101/013433. doi:10.1038/NATURE14317. PMC 5048219. PMID 25731166.
Haber M, Platt DE, Ashrafian Bonab M, Youhanna SC, Soria-Hernanz DF, Martínez-Cruz B, Douaihy B, Ghassibe-Sabbagh M, et al. (2012). "Afghanistan's ethnic groups share a Y-chromosomal heritage structured by historical events". PLOS ONE. 7 (3). e34288. Bibcode:2012PLoSO...734288H. doi:10.1371/journal.pone.0034288. PMC 3314501. PMID 22470552.
Karafet, Tatiana M.; Mendez, Fernando L.; Sudoyo, Herawati; Lansing, J. Stephen; Hammer, Michael F. (2014). "Improved phylogenetic resolution and rapid diversification of Y-chromosome haplogroup K-M526 in Southeast Asia". Nature. 23 (3): 369–373. doi:10.1038/ejhg.2014.106. PMC 4326703. PMID 24896152.
Kasperaviciūte, D.; Kucinskas, V.; Stoneking, M. (2005). "Y Chromosome and Mitochondrial DNA Variation in Lithuanians". Annals of Human Genetics. 68 (5): 438–452. doi:10.1046/j.1529-8817.2003.00119.x. PMID 15469421. S2CID 26562505.
Kayser M, Lao O, Anslinger K, Augustin C, Bargel G, Edelmann J, Elias S, Heinrich M, et al. (2005). "Significant genetic differentiation between Poland and Germany follows present-day political borders, as revealed by Y-chromosome analysis" (PDF). Human Genetics. 117 (5): 428–443. doi:10.1007/s00439-005-1333-9. PMID 15959808. S2CID 11066186. Archived from the original (PDF) on 2009-03-04.
Keyser, Christine; Bouakaze, Caroline; Crubézy, Eric; Nikolaev, Valery G.; Montagnon, Daniel; Reis, Tatiana; Ludes, Bertrand (2009). "Ancient DNA provides new insights into the history of south Siberian Kurgan people". Human Genetics. 126 (3): 395–410. doi:10.1007/s00439-009-0683-0. PMID 19449030. S2CID 21347353.
Kim, Kijeong; Brenner, Charles H.; Mair, Victor H.; Lee, Kwang-Ho; Kim, Jae-Hyun; Gelegdorj, Eregzen; Batbold, Natsag; Song, Yi-Chung; et al. (2010). "A western Eurasian male is found in 2000-year-old elite Xiongnu cemetery in Northeast Mongolia". American Journal of Physical Anthropology. 142 (3): 429–440. doi:10.1002/ajpa.21242. PMID 20091844.
Kivisild, T; Rootsi, S; Metspalu, M; Mastana, S; Kaldma, K; Parik, J; Metspalu, E; Adojaan, M; et al. (2003). "The Genetic Heritage of the Earliest Settlers Persists Both in Indian Tribal and Caste Populations". AJHG. 72 (2): 313–32. doi:10.1086/346068. PMC 379225. PMID 12536373.
Lazaridis, Iosif; et al. (2016). "Genomic insights into the origin of farming in the ancient Near East". Nature. 536 (7617): 419–424. Bibcode:2016Natur.536..419L. doi:10.1038/nature19310. PMC 5003663. PMID 27459054.
Lell JT, Sukernik RI, Starikovskaya YB, Su B, Jin L, Schurr TG, Underhill PA, Wallace DC (2002). "The Dual Origin and Siberian Affinities of Native American Y Chromosomes" (PDF). American Journal of Human Genetics. 70 (1): 192–206. doi:10.1086/338457. PMC 384887. PMID 11731934. Archived from the original (PDF) on 2003-04-22.
Liu Shu-hu; Nizam Yilihamu; Rabiyamu Bake; Abdukeram Bupatima; Dolkun Matyusup (2018). "A study of genetic diversity of three isolated populations in Xinjiang using Y-SNP". Acta Anthropologica Sinica. 37 (1): 146–156. Lay summary – Indo-European.eu.
Malmström, Helena; Günther, Torsten; Svensson, Emma M.; Juras, Anna; Fraser, Magdalena; Munters, Arielle R.; Pospieszny, Łukasz; Tõrv, Mari; et al. (October 9, 2019). "The genomic ancestry of the Scandinavian Battle Axe Culture people and their relation to the broader Corded Ware horizon". Proceedings of the Royal Society B. 286 (1912). doi:10.1098/rspb.2019.1528. PMC 6790770. PMID 31594508.
Malyarchuk, Boris; Derenko, Miroslava; Wozniak, Marcin; Grzybowski, Tomasz (2013). "Y-chromosome variation in Tajiks and Iranians". Annals of Human Biology. 40 (1): 48–54. doi:10.3109/03014460.2012.747628. PMID 23198991. S2CID 2752490.
Mascarenhas, Desmond D.; Raina, Anupuma; Aston, Christopher E.; Sanghera, Dharambir K. (2015). "Genetic and Cultural Reconstruction of the Migration of an Ancient Lineage". BioMed Research International. 2015: 651415. doi:10.1155/2015/651415. PMC 4605215. PMID 26491681.
Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Alpaslan Roodenberg, Songul; Harney, Eadaoin; Stewardson, Kristin; et al. (2015). "Eight thousand years of natural selection in Europe". bioRxiv. 016477. doi:10.1101/016477.
Mirabal, Sheyla; Regueiro, M; Cadenas, AM; Cavalli-Sforza, LL; Underhill, PA; Verbenko, DA; Limborska, SA; Herrera, RJ; et al. (2009). "Y-Chromosome distribution within the geo-linguistic landscape of northwestern Russia". European Journal of Human Genetics. 17 (10): 1260–1273. doi:10.1038/ejhg.2009.6. PMC 2986641. PMID 19259129.
Mohammad T, Xue Y, Evison M, Tyler-Smith C (2009). "Genetic structure of nomadic Bedouin from Kuwait". Heredity. 103 (5): 425–433. doi:10.1038/hdy.2009.72. PMC 2869035. PMID 19639002.
Narasimhan, Vagheesh M.; Anthony, David; Mallory, James; Reich, David (2018), The Genomic Formation of South and Central Asia, bioRxiv 10.1101/292581, doi:10.1101/292581
Nasidze I, Ling EY, Quinque D, Dupanloup I, Cordaux R, Rychkov S, Naumova O, Zhukova O, et al. (2004). "Mitochondrial DNA and Y-Chromosome Variation in the Caucasus" (PDF). Annals of Human Genetics. 68 (Pt 3): 205–221. doi:10.1046/j.1529-8817.2004.00092.x. PMID 15180701. Archived from the original (PDF) on 2004-10-30.
Nasidze I, Quinque D, Ozturk M, Bendukidze N, Stoneking M (2005). "MtDNA and Y-chromosome Variation in Kurdish Groups" (PDF). Annals of Human Genetics. 69 (Pt 4): 401–412. doi:10.1046/j.1529-8817.2005.00174.x. PMID 15996169. Archived from the original (PDF) on 2009-08-23.
Pamjav, Horolma; Fehér, Tibor; Németh, Endre; Pádár, Zsolt (2012), "Brief communication: new Y-chromosome binary markers improve phylogenetic resolution within haplogroup R1a1", American Journal of Physical Anthropology, 149 (4): 611–615, doi:10.1002/ajpa.22167, PMID 23115110, S2CID 4820868
Passarino G, Cavalleri GL, Lin AA, Cavalli-Sforza LL, Børresen-Dale AL, Underhill (2002). "Different genetic components in the Norwegian population revealed by the analysis of mtDNA and Y chromosome polymorphisms". European Journal of Human Genetics. 10 (9): 521–529. doi:10.1038/sj.ejhg.5200834. PMID 12173029.
Pathak, Ajai K.; Kadian, Anurag; Kushniarevich, Alena; Montinaro, Francesco; Mondal, Mayukh; Ongaro, Linda; Singh, Manvendra; Kumar, Pramod; et al. (6 December 2018). "The Genetic Ancestry of Modern Indus Valley Populations from Northwest India". The American Journal of Human Genetics. 103 (6): 918–929. doi:10.1016/j.ajhg.2018.10.022. PMC 6288199. PMID 30526867.
Pawlowski, R; Dettlaff-Kakol, A; MacIejewska, A; Paszkowska, R; Reichert, M; Jezierski, G (2002). "Population genetics of 9 Y-chromosome STR loci w Northern Poland". Arch. Med. Sadowej Kryminol. 52 (4): 261–277. PMID 14669672.
Pericić M, Lauc LB, Klarić IM, Rootsi S, Janićijević B, Rudan I, Terzić R, Colak I, et al. (2005). "High-resolution phylogenetic analysis of southeastern Europe traces major episodes of paternal gene flow among Slavic populations". Mol. Biol. Evol. 22 (10): 1964–75. doi:10.1093/molbev/msi185. PMID 15944443.
Poznik GD, et al. (2016). "Punctuated bursts in human male demography inferred from 1,244 worldwide Y-chromosome sequences". Nature Genetics. 48 (6): 593–599. doi:10.1038/ng.3559. hdl:11858/00-001M-0000-002A-F024-C. PMC 4884158. PMID 27111036.
Regueiro, M; Cadenas, AM; Gayden, T; Underhill, PA; Herrera, RJ (2006). "Iran: Tricontinental Nexus for Y-Chromosome Driven Migration". Hum Hered. 61 (3): 132–143. doi:10.1159/000093774. PMID 16770078. S2CID 7017701.
Ricaut F, Keyser-Tracqui C, Bourgeois I, Crubézy E, Ludes B (2004). "Genetic Analysis of a Scytho-Siberian Skeleton and Its Implications for Ancient Central Asian Migrations". Human Biology. 76 (1): 109–25. doi:10.1353/hub.2004.0025. PMID 15222683. S2CID 35948291.
Rosser ZH, Zerjal T, Hurles ME, Adojaan M, Alavantic D, Amorim A, Amos W, Armenteros M, et al. (2000). "Y-Chromosomal Diversity in Europe Is Clinal and Influenced Primarily by Geography, Rather than by Language". American Journal of Human Genetics. 67 (6): 1526–1543. doi:10.1086/316890. PMC 1287948. PMID 11078479.
Saag, Lehti; Varul, Liivi; Scheib, Christiana Lyn; Stenderup, Jesper; Allentoft, Morten E.; Saag, Lauri; Pagani, Luca; Reidla, Maere; et al. (July 24, 2017). "Extensive Farming in Estonia Started through a Sex-Biased Migration from the Steppe". Current Biology. Cell Press. 27 (14): 2185–2193. doi:10.1016/j.cub.2017.06.022. PMID 28712569.
Saag, Lehti; Vasilyev, Sergey V.; Varul, Liivi; Kosorukova, Natalia V.; Gerasimov, Dmitri V.; Oshibkina, Svetlana V.; Griffith, Samuel J.; Solnik, Anu; et al. (July 3, 2020). "Genetic ancestry changes in Stone to Bronze Age transition in the East European plain". bioRxiv. doi:10.1101/2020.07.02.184507. S2CID 220366142.
Sahoo, S; Singh, A; Himabindu, G; Banerjee, J; Sitalaximi, T; Gaikwad, S; Trivedi, R; Endicott, P; et al. (2006). "A prehistory of Indian Y chromosomes: Evaluating demic diffusion scenarios". Proceedings of the National Academy of Sciences. 103 (4): 843–848. Bibcode:2006PNAS..103..843S. doi:10.1073/pnas.0507714103. PMC 1347984. PMID 16415161.
Scozzari R, Cruciani F, Pangrazio A, Santolamazza P, Vona G, Moral P, Latini V, Varesi L, et al. (2001). "Human Y-Chromosome Variation in the Western Mediterranean Area: Implications for the Peopling of the Region" (PDF). Human Immunology. 62 (9): 871–84. CiteSeerX 10.1.1.408.4857. doi:10.1016/S0198-8859(01)00286-5. PMID 11543889.
Semenov, Alexander S.; Bulat, Vladimir V. (2016), "Ancient Paleo-DNA of Pre-Copper Age North-Eastern Europe: Establishing the Migration Traces of R1a1 Y-DNA Haplogroup", European Journal of Molecular Biotechnology, 11 (1): 40–54, doi:10.13187/ejmb.2016.11.40, S2CID 172131289
Semino, O; Passarino, G; Oefner, PJ; Lin, AA; Arbuzova, S; Beckman, LE; De Benedictis, G; Francalacci, P; et al. (2000). "The Genetic Legacy of Paleolithic Homo sapiens sapiens in Extant Europeans: A Y Chromosome Perspective" (PDF). Science. 290 (5494): 1155–1159. Bibcode:2000Sci...290.1155S. doi:10.1126/science.290.5494.1155. PMID 11073453. Archived from the original (PDF) on 2003-11-25.
Sengupta, S; Zhivotovsky, LA; King, R; Mehdi, SQ; Edmonds, CA; Chow, CE; Lin, AA; Mitra, M; et al. (2006). "Polarity and Temporality of High-Resolution Y-Chromosome Distributions in India Identify Both Indigenous and Exogenous Expansions and Reveal Minor Genetic Influence of Central Asian Pastoralists". American Journal of Human Genetics. 78 (2): 202–21. doi:10.1086/499411. PMC 1380230. PMID 16400607.
Sharma, S; Rai, E; Sharma, P; Jena, M; Singh, S; Darvishi, K; Bhat, AK; Bhanwer, AJ; et al. (2009). "The Indian origin of paternal haplogroup R1a1(*)substantiates the autochthonous origin of Brahmins and the caste system". Journal of Human Genetics. 54 (1): 47–55. doi:10.1038/jhg.2008.2. PMID 19158816.
Schilz, Felix (2006). Molekulargenetische Verwandtschaftsanalysen am prähistorischen Skelettkollektiv der Lichtensteinhöhle [Molecular genetic kinship analysis on the prehistoric skeleton collective of the Lichtenstein Cave] (PDF) (Dissertation) (in German). Göttingen: Mathematisch-Naturwissenschaftlichen Fakultäten der Georg-August-Universität.
Silva, Marina; et al. (2017). "A genetic chronology for the Indian Subcontinent points to heavily sex-biased dispersals". BMC Evolutionary Biology. 17 (1): 88. doi:10.1186/s12862-017-0936-9. PMC 5364613. PMID 28335724.
Tambets K, Rootsi S, Kivisild T, Help H, Serk P, Loogväli EL, Tolk HV, Reidla M, et al. (2004). "The Western and Eastern Roots of the Saami—the Story of Genetic 'Outliers' Told by Mitochondrial DNA and Y Chromosomes". American Journal of Human Genetics. 74 (4): 661–682. doi:10.1086/383203. PMC 1181943. PMID 15024688.
Thangaraj, Kumarasamy; Naidu, B. Prathap; Crivellaro, Federica; Tamang, Rakesh; Upadhyay, Shashank; Sharma, Varun Kumar; Reddy, Alla G.; Walimbe, S. R.; et al. (2010). Cordaux, Richard (ed.). "The Influence of Natural Barriers in Shaping the Genetic Structure of Maharashtra Populations". PLOS ONE. 5 (12): e15283. Bibcode:2010PLoSO...515283T. doi:10.1371/journal.pone.0015283. PMC 3004917. PMID 21187967.
Underhill, PA; Myres, NM; Rootsi, S; Metspalu, M; Zhivotovsky, LA; King, RJ; Lin, AA; Chow, CE; et al. (4 November 2009). "Separating the post-Glacial coancestry of European and Asian Y chromosomes within haplogroup R1a". European Journal of Human Genetics (published April 2010). 18 (4): 479–84. doi:10.1038/ejhg.2009.194. PMC 2987245. PMID 19888303.
Underhill, Peter A.; et al. (26 March 2014). "The phylogenetic and geographic structure of Y-chromosome haplogroup R1a". European Journal of Human Genetics (published January 2015). 23 (1): 124–131. doi:10.1038/ejhg.2014.50. PMC 4266736. PMID 24667786. "PDF" (PDF). Archived from the original (PDF) on 2016-08-16. Retrieved 2016-06-12.
Wang, Wei; Wise, Cheryl; Baric, Tom; Black, Michael L.; Bittles, Alan H. (2003). "The origins and genetic structure of three co-resident Chinese Muslim populations: The Salar, Bo'an and Dongxiang". Human Genetics. 113 (3): 244–52. doi:10.1007/s00439-003-0948-y. PMID 12759817. S2CID 11138499.
Wells, R.S. (2001), "The Eurasian Heartland: A continental perspective on Y-chromosome diversity", Proceedings of the National Academy of Sciences of the USA, 98 (18): 10244–10249, Bibcode:2001PNAS...9810244W, doi:10.1073/pnas.171305098, PMC 56946, PMID 11526236
Yan, Shi; Wang, Chuan-Chao; Zheng, Hong-Xiang; Wang, Wei; Qin, Zhen-Dong; Wei, Lan-Hai; Wang, Yi; Pan, Xue-Dong; et al. (29 August 2014). "Y Chromosomes of 40% Chinese Descend from Three Neolithic Super-Grandfathers". PLOS ONE. 9 (8). e105691. arXiv:1310.3897. Bibcode:2014PLoSO...9j5691Y. doi:10.1371/journal.pone.0105691. PMC 4149484. PMID 25170956.
Y Chromosome Consortium "YCC" (2002). "A Nomenclature System for the Tree of Human Y-Chromosomal Binary Haplogroups". Genome Research. 12 (2): 339–348. doi:10.1101/gr.217602. PMC 155271. PMID 11827954.
Zerjal, Tatiana; Wells, R. Spencer; Yuldasheva, Nadira; Ruzibakiev, Ruslan; Tyler-Smith, Chris (2002). "A Genetic Landscape Reshaped by Recent Events: Y-Chromosomal Insights into Central Asia". The American Journal of Human Genetics. 71 (3): 466–82. doi:10.1086/342096. PMC 419996. PMID 12145751.
Zhong H, Shi H, Qi XB, Duan Y, Tan PP, Jin L, SU B, Ma RZ (January 2011). "Extended Y chromosome investigation suggests postglacial migrations of modern humans into East Asia via the northern route". Molecular Biology and Evolution. 28 (1): 717–27. doi:10.1093/molbev/msq247. PMID 20837606.
Zhou, Ruixia; An, Lizhe; Wang, Xunling; Shao, Wei; Lin, Gonghua; Yu, Weiping; Yi, Lin; Xu, Shijian; et al. (2007). "Testing the hypothesis of an ancient Roman soldier origin of the Liqian people in northwest China: a Y-chromosome perspective". Journal of Human Genetics. 52 (7): 584–91. doi:10.1007/s10038-007-0155-0. PMID 17579807.

External links

DNA Tree

FTDNA R1a Y-chromosome Haplogroup Project
R1a1a1 and Subclades Y-DNA Project – Background Family Tree DNA R1a1a1

TMRCA

TMRCA = Time to Most Recent Common Ancestor

Various

Danish Demes Regional DNA Project: Y-DNA Haplogroup R1a
Eurogenes Blog, The Poltovka outlier
Avotaynu Online, The Y-DNA Fingerprint of the Shpoler Zeida, a Tzaddik Who Touched the World

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[vanOven2014-111] Van Oven M, Van Geystelen A, Kayser M, Decorte R, Larmuseau HD (2014). "Seeing the wood for the trees: a minimal reference phylogeny for the human Y chromosome". Human Mutation. 35 (2): 187–91. doi:10.1002/humu.22468. PMID 24166809.

[ISOGG2015-112] International Society of Genetic Genealogy (ISOGG; 2015), Y-DNA Haplogroup Tree 2015. (Access date: 1 February 2015.)

[A0-T-113] Haplogroup A0-T is also known as A-L1085 (and previously as A0'1'2'3'4).

[A1-114] Haplogroup A1 is also known as A1'2'3'4.

[LT-115] Haplogroup LT (L298/P326) is also known as Haplogroup K1.

[K2-116] Between 2002 and 2008, Haplogroup T-M184 was known as "Haplogroup K2". That name has since been re-assigned to K-M526, the sibling of Haplogroup LT.

[K2a-117] Haplogroup K2a (M2308) and its primary subclade K-M2313 were separated from Haplogroup NO (F549) in 2016. (This followed the publication of: Poznik GD, Xue Y, Mendez FL, et al. (2016). "Punctuated bursts in human male demography inferred from 1,244 worldwide Y-chromosome sequences". Nature Genetics. 48 (6): 593–9. doi:10.1038/ng.3559. PMC 4884158. PMID 27111036. In the past, other haplogroups, including NO (M214) and K2e had also been identified with the name "K2a".

[K2b-118] Haplogroup K2b (M1221/P331/PF5911) is also known as Haplogroup MPS.

[K2e-119] Haplogroup K2e (K-M147) was previously known as "Haplogroup X" and "K2a" (but is a sibling subclade of the present K2a).

[K-M2313-120] K-M2313*, which as yet has no phylogenetic name, has been documented in two living individuals, who have ethnic ties to India and South East Asia. In addition, K-Y28299, which appears to be a primary branch of K-M2313, has been found in three living individuals from India. See: Poznik op. cit.; YFull YTree v5.08, 2017, "K-M2335", and; PhyloTree, 2017, "Details of the Y-SNP markers included in the minimal Y tree" (Access date of these pages: 9 December 2017)

[K2b1-121] Haplogroup K2b1 (P397/P399) is also known as Haplogroup MS, but has a broader and more complex internal structure.

[P-122] Haplogroup P (P295) is also klnown as K2b2.

[S-123] Haplogroup S, as of 2017, is also known as K2b1a. (Previously the name Haplogroup S was assigned to K2b1a4.)

[M-124] Haplogroup M, as of 2017, is also known as K2b1b. (Previously the name Haplogroup M was assigned to K2b1d.)

[7] According to Family Tree, they diversified ca. 5,000 years ago.[6]

[21] Semenov & Bulat (2016) refer to the following publications:
5. Haak, Wolfgang (2015). "Massive migration from the steppe is a source for Indo-European languages in Europe". Nature. 522 (7555): 207–211. arXiv:1502.02783. Bibcode:2015Natur.522..207H. bioRxiv 10.1101/013433. doi:10.1038/NATURE14317. PMC 5048219. PMID 25731166.
6. Mathieson, Iain (2015). "Eight thousand years of natural selection in Europe". bioRxiv 10.1101/016477.
8. Chekunova Е.М., Yartseva N.V., Chekunov М.К., Мazurkevich А.N. The First Results of the Genotyping of the Aboriginals and Human Bone Remains of the Archeological Memorials of the Upper Podvin’e. // Archeology of the lake settlements of IV—II Thousands BC: The chronology of cultures and natural environment and climatic rhythms. Proceedings of the International Conference, Devoted to the 50-year Research of the Pile Settlements on the North-West of Russia. St. Petersburg, 13–15 November 2014.
9. Jones, ER; Gonzalez-Fortes, G; Connell, S; Siska, V; Eriksson, A; Martiniano, R; McLaughlin, RL; Gallego Llorente, M; Cassidy, LM; Gamba, C; Meshveliani, T; Bar-Yosef, O; Müller, W; Belfer-Cohen, A; Matskevich, Z; Jakeli, N; Higham, TF; Currat, M; Lordkipanidze, D; Hofreiter, M; Manica, A; Pinhasi, R; Bradley, DG (2015). "Upper Palaeolithic genomes reveal deep roots of modern Eurasians". Nat Commun. 6: 8912. Bibcode:2015NatCo...6.8912J. doi:10.1038/ncomms9912. PMC 4660371. PMID 26567969.

[23] Yet, Haak et al. also explicitly state: "...a type of Near Eastern ancestry different from that which was introduced by early farmers."[20]

[24] According to Family Tree DNA, L664 formed 4,700 ybp, that is, 2,700 BCE.[6]

[27] Lazaridis, Twitter, 18 juni 2016: "I1635 (Armenia_EBA) is R1b1-M415(xM269). We'll be sure to include in the revision. Thanks to the person who noticed! #ILovePreprints."
See also "Big deal of 2016: the territory of present-day Iran cannot be the Indo-European homeland". Eurogenes Blog. November 26, 2016, for a discussion of the same topic.

[30] See map for M780 distribution at Dieneke's Anthropology Blog, Major new article on the deep origins of Y-haplogroup R1a (Underhill et al. 2014)[24]

[FamilyTree-M780-31] According to Family Tree DNA, M780 formed 4700 ybp.[6] This dating coincides with the eastward movement between 2800 and 2600 BCE of the Yamnaya culture into the region of the Poltavka culture, a predecessor of the Sintashta culture, from which the Indo-Iranians originated. M780 is concentrated in the Ganges Valley, the locus of the classic Vedic society.

[32] Poznik et al. (2016) calculate with a generation time of 30 years; a generation time of 20 years yields other results.

[34] "The evidence that the Steppe_MLBA [Middle to Late Bronze Age] cluster is a plausible source for the Steppe ancestry in South Asia is also supported by Y chromosome evidence, as haplogroup R1a which is of the Z93 subtype common in South Asia today [Underhill et al. (2014), Silva et al. (2017)] was of high frequency in Steppe_MLBA (68%) (16), but rare in Steppe_EMBA [Early to Middle Bronze Age] (absent in our data)."[25]

[Kivisild2003-36] Kivisild et al. (2003): "Haplogroup R1a, previously associated with the putative Indo-Aryan invasion, was found at its highest frequency in Punjab but also at a relatively high frequency (26%) in the Chenchu tribe. This finding, together with the higher R1a-associated short tandem repeat diversity in India and Iran compared with Europe and central Asia, suggests that southern and western Asia might be the source of this haplogroup."[26]

[41] Lucotte (2015) dates the origin in the subcontinent to approximately 15,500 year before present[30] Datations show that the Z93 Pakistano-Indian group is the most ancient (about 15,5 K years); in Europe, the Eastern populations are the most ancient (about 12,5 K years) and the Northern ones the most recent.

[R1a-India-43] Sahoo et al. (2006): "... one should expect to observe dramatically lower genetic variation among Indian Rla lineages. In fact, the opposite is true: the STR haplotype diversity on the background of R1a in Central Asia (and also in Eastern Europe) has already been shown to be lower than that in India (6). Rather, the high incidence of R1* and Rla throughout Central Asian European populations (without R2 and R* in most cases) is more parsimoniously explained by gene flow in the opposite direction, possibly with an early founder effect in South or West Asia.[31]

[R1a-India2-44] Sharma et al. (2009): "A peculiar observation of the highest frequency (up to 72.22%) of Y-haplogroup R1a1* in Brahmins hinted at its presence as a founder lineage for this caste group. Further, observation of R1a1* in different tribal population groups, existence of Y-haplogroup R1a* in ancestors and extended phylogenetic analyses of the pooled dataset of 530 Indians, 224 Pakistanis and 276 Central Asians and Eurasians bearing the R1a1* haplogroup supported the autochthonous origin of R1a1 lineage in India and a tribal link to Indian Brahmins. However, it is important to discover novel Y-chromosomal binary marker(s) for a higher resolution of R1a1* and confirm the present conclusions."[1]

[47] Sengupta et al. (2006): "The widespread geographic distribution of HG R1a1-M17 across Eurasia and the current absence of informative subdivisions defined by binary markers leave uncertain the geographic origin of HG R1a1-M17. However, the contour map of R1a1-M17 variance shows the highest variance in the northwestern region of India [...] The question remains of how distinctive is the history of L1 relative to some or all of R1a1 and R2 representatives. This uncertainty neutralizes previous conclusions that the intrusion of HGs R1a1 and R2 from the northwest in Dravidian-speaking southern tribes is attributable to a single recent event. [R1a1 and R2] could have actually arrived in southern India from a southwestern Asian source region multiple times, with some episodes considerably earlier than others. Considerable archeological evidence exists regarding the presence of Mesolithic peoples in India (Kennedy 2000), some of whom could have entered the subcontinent from the northwest during the late Pleistocene epoch. The high variance of R1a1 in India (table 12), the spatial frequency distribution of R1a1 microsatellite variance clines (fig. 4), and expansion time (table 11) support this view."[27]

[92] Балановский (2015), p. 208 (in Russian) Прежде всего, это преобладание в славянских популяциях дославянского субстрата — двух ассимилированных ими генетических компонентов – восточноевропейского для западных и восточных славян и южноевропейского для южных славян...Можно с осторожностью предположить, что ассимилированный субстратмог быть представлен по преимуществу балтоязычными популяциями. Действительно, археологические данные указыва ют на очень широкое распространение балтских групп перед началом расселения славян. Балтскийсубстрату славян (правда, наряду с финно-угорским) выявляли и антропологи. Полученные нами генетические данные — и на графиках генетических взаимоотношений, и по доле общих фрагментов генома — указывают, что современные балтские народы являются ближайшими генетически ми соседями восточных славян. При этом балты являются и лингвистически ближайшими род ственниками славян. И можно полагать, что к моменту ассимиляции их генофонд не так сильно отличался от генофонда начавших свое широкое расселение славян. Поэтому если предположить,что расселяющиеся на восток славяне ассимилировали по преимуществу балтов, это может объяснить и сходство современных славянских и балтских народов друг с другом, и их отличия от окружающих их не балто-славянских групп Европы...В работе высказывается осторожное предположение, что ассимилированный субстрат мог быть представлен по преимуществу балтоязычными популяциями. Действительно, археологические данные указывают на очень широкое распространение балтских групп перед началом расселения славян. Балтский субстрат у славян (правда, наряду с финно-угорским) выявляли и антропологи. Полученные в этой работе генетические данные — и на графиках генетических взаимоотношений, и по доле общих фрагментов генома — указывают, что современные балтские народы являются ближайшими генетическими соседями восточных славян.

[FOOTNOTESharma_et_al.2009-1] Sharma et al. 2009.

[FOOTNOTEUnderhill_et_al.2014-2] Underhill et al. 2014.

[FOOTNOTEUnderhill_et_al.2009-3] Underhill et al. 2009.

[FOOTNOTEUnderhill_et_al.2014130-4] Underhill et al. 2014, p. 130.

[FOOTNOTEKarafet_et_al.2014-5] Karafet et al. 2014.

[yfull-R1a-6] "R1a tree". YFull.

[FOOTNOTEMirabal_et_al.2009-8] Mirabal et al. 2009.

[9] Zerjal, T.; et al. (1999). "The use of Y-chromosomal DNA variation to investigate population history: recent male spread in Asia and Europe". In S.S. Papiha; R. Deka & R. Chakraborty (eds.). Genomic diversity: applications in human population genetics. New York: Kluwer Academic/Plenum Publishers. pp. 91–101. ISBN 978-0-3064-6295-5.

[FOOTNOTESemino_et_al.2000-10] Semino et al. 2000.

[FOOTNOTEWells2001-11] Wells 2001.

[FOOTNOTEPamjav_et_al.2012-12] Pamjav et al. 2012.

[FOOTNOTEHaak_et_al.2015-13] Haak et al. 2015.

[FOOTNOTEAllentoft_et_al.2015-14] Allentoft et al. 2015.

[FOOTNOTEMathieson_et_al.2015-15] Mathieson et al. 2015.

[FOOTNOTEAnthony2007-16] Anthony 2007.

[FOOTNOTEAnthonyRinge2015-17] Anthony & Ringe 2015.

[FOOTNOTEHaak_et_al.20155-18] Haak et al. 2015, p. 5.

[FOOTNOTESemenovBulat2016-19] Semenov & Bulat 2016.

[FOOTNOTESemenovBulat201641-20] Semenov & Bulat 2016, p. 41.

[FOOTNOTEHaak_et_al.20154-22] Haak et al. 2015, p. 4.

[FOOTNOTEMascarenhas_et_al.20159-25] Mascarenhas et al. 2015, p. 9.

[FOOTNOTEPoznik_et_al.20165-26] Poznik et al. 2016, p. 5.

[28] Arame's English blog, Y DNA from ancient Near East

[29] "Dienekes' Anthropology Blog: Major new article on the deep origins of Y-haplogroup R1a (Underhill et al. 2014)". March 27, 2014. Retrieved December 20, 2019.

[FOOTNOTENarasimhan_et_al.2018-33] Narasimhan et al. 2018.

[FOOTNOTEKivisild_et_al.2003-35] Kivisild et al. 2003.

[FOOTNOTESengupta2006-37] Sengupta 2006.

[FOOTNOTESahoo_et_al.2006-38] Sahoo et al. 2006.

[FOOTNOTEThangaraj_et_al.2010-39] Thangaraj et al. 2010.

[40] Gérard, Lucotte (2015). "The Major Y-Chromosome Haplotype XI – Haplogroup R1a in Eurasia" (PDF). Hereditary Genetics.

[FOOTNOTESahoo_et_al.2006845-846-42] Sahoo et al. 2006, p. 845-846.

[Joseph_2017-45] Joseph, Tony (16 June 2017). "How genetics is settling the Aryan migration debate". The Hindu.

[FOOTNOTESilva_et_al.2017-46] Silva et al. 2017.

[48] ""Heavily sex-biased" population dispersals into the Indian Subcontinent (Silva et al. 2017)". Eurogenes Blog. March 28, 2017.

[ftdna-r1a-49] "About Us". Family Tree DNA. Retrieved December 20, 2019.

[ISOGG2016-50] "ISOGG 2017 Y-DNA Haplogroup R". isogg.org. Retrieved December 20, 2019.

[snpedia-51] "Haplogroup R (Y-DNA) - SNPedia". www.snpedia.com. Retrieved December 20, 2019.

[FOOTNOTEUnderhill_et_al.2014125-52] Underhill et al. 2014, p. 125.

[53] "R1a in Yamnaya". Eurogenes Blog. March 21, 2016. Archived from the original on 2018-05-05. Retrieved December 20, 2019.

[54] "Y-DNA Haplogroup R and its Subclades". International Society of Genetic Genealogy (ISOGG). Retrieved 8 January 2011.

[55] Krahn, Thomas. "Draft Y-Chromosome Tree". Family Tree DNA. Archived from the original on 2013-05-26. Retrieved 2012-12-07.

[FOOTNOTERegueiro2006-56] Regueiro 2006.

[57] J. Freder, Die mittelalterlichen Skelette von Usedom [The mediaeval skeletons of Usedom], Berlin 2010, p. 86 (Dissertation Free University Berlin 2010).

[58] Gwozdz, Peter (6 Aug 2018). "Polish Y-DNA Clades".

[FOOTNOTEPawlowski_et_al.2002-59] Pawlowski et al. 2002.

[FOOTNOTEGwozdz2009-60] Gwozdz 2009.

[FOOTNOTESaag_et_al.20205-61] Saag et al. 2020, p. 5.

[FOOTNOTESaag_et_al.202029Table_1-62] Saag et al. 2020, p. 29, Table 1.

[FOOTNOTESaag_et_al.2020Supplementary_Data_2,_Row_4-63] Saag et al. 2020, Supplementary Data 2, Row 4.

[FOOTNOTEFu_et_al.2016-64] Fu et al. 2016.

[FOOTNOTESaag_et_al.2017-65] Saag et al. 2017.

[FOOTNOTEAnthony201916,_17-66] Anthony 2019, pp. 16, 17.

[FOOTNOTEHaak_et_al.2008-67] Haak et al. 2008.

[FOOTNOTEBrandit_et_al.2013-68] Brandit et al. 2013.

[FOOTNOTEMalmström_et_al.20192-69] Malmström et al. 2019, p. 2.

[FOOTNOTESaag_et_al.2020Supplementary_Data_2,_Rows_5-49-70] Saag et al. 2020, Supplementary Data 2, Rows 5-49.

[71] Schweitzer, D. (23 March 2008). "Lichtenstein Cave Data Analysis" (PDF). dirkschweitzer.net. Archived from the original (PDF) on 14 August 2011. Summary in English of Schilz (2006).

[FOOTNOTEKeyser_et_al.2009-72] Keyser et al. 2009.

[FOOTNOTERicaut_et_al.2004-73] Ricaut et al. 2004.

[74] Корниенко И. В., Водолажский Д. И. Использование нерекомбинантных маркеров Y-хромосомы в исследованиях древних популяций (на примере поселения Танаис)//Материалы Донских антропологических чтений. Ростов-на-Дону, Ростовский научно-исследовательский онкологический институт, Ростов-на-Дону, 2013.

[FOOTNOTEChunxiang_Li_et_al.2010-75] Chunxiang Li et al. 2010.

[FOOTNOTEKim_et_al.2010-76] Kim et al. 2010.

[FOOTNOTEBalanovsky_et_al.2008-77] Balanovsky et al. 2008.

[FOOTNOTEBehar_et_al.2003-78] Behar et al. 2003.

[FOOTNOTEKasperaviciūteKucinskasStoneking2005-79] Kasperaviciūte, Kucinskas & Stoneking 2005.

[FOOTNOTEBattaglia_et_al.2008-80] Battaglia et al. 2008.

[FOOTNOTERosser_et_al.2000-81] Rosser et al. 2000.

[FOOTNOTETambets_et_al.2004-82] Tambets et al. 2004.

[FOOTNOTEBowden_et_al.2008-83] Bowden et al. 2008.

[FOOTNOTEDupuy_et_al.2005-84] Dupuy et al. 2005.

[FOOTNOTEPassarino_et_al.2002-85] Passarino et al. 2002.

[FOOTNOTECapelli_et_al.2003-86] Capelli et al. 2003.

[FOOTNOTEKayser_et_al.2005-87] Kayser et al. 2005.

[FOOTNOTEScozzari_et_al.2001-88] Scozzari et al. 2001.

[FOOTNOTEPericić_et_al.2005-89] Pericić et al. 2005.

[90] "UNTITLED". pereformat.ru (in Russian).

[91] "UNTITLED". www.rodstvo.ru.

[FOOTNOTEZerjal_et_al.2002-93] Zerjal et al. 2002.

[FOOTNOTEHaber_et_al.2012-94] Haber et al. 2012.

[FOOTNOTEDi_Cristofaro_et_al.2013-95] Di Cristofaro et al. 2013.

[FOOTNOTEMalyarchuk_et_al.2013-96] Malyarchuk et al. 2013.

[FOOTNOTEAshirbekov_et_al.2017-97] Ashirbekov et al. 2017.

[FOOTNOTEArunkumar2012-98] Arunkumar 2012.

[99] Toomas Kivisild; Siiri Rootsi; Mait Metspalu; Ene Metspalu; Juri Parik; Katrin Kaldma; Esien Usanga; Sarabjit Mastana; Surinder S. Papiha; Richard Villems. "The Genetics of Language and Farming Spread in India" (PDF). In P. Bellwwood; C. Renfrew (eds.). Examining the farming/language dispersal hypothesis. McDonald Institute Monographs. Cambridge University. pp. 215–222. Retrieved December 20, 2019.

[FOOTNOTEFornarino_et_al.2009-100] Fornarino et al. 2009.

[FOOTNOTEWang_et_al.2003-101] Wang et al. 2003.

[FOOTNOTEZhou_et_al.2007-102] Zhou et al. 2007.

[FOOTNOTELiu_Shu-hu_et_al.2018-103] Liu Shu-hu et al. 2018.

[FOOTNOTEZhong_et_al.2011-104] Zhong et al. 2011.

[FOOTNOTEYan_et_al.2014-105] Yan et al. 2014.

[FOOTNOTELell_et_al.2002-106] Lell et al. 2002.

[FOOTNOTEMohammad_et_al.2009-107] Mohammad et al. 2009.

[FOOTNOTENasidze_et_al.2004-108] Nasidze et al. 2004.

[FOOTNOTENasidze_et_al.2005-109] Nasidze et al. 2005.

[FOOTNOTEGrugni_et_al.2012-110] Grugni et al. 2012.

Haplogroup R1a

Origins

R1a origins

Diversification of R1a1a1 (M417) and ancient migrations

Steppe origins

Proposed steppe dispersal of R1a1a

Source of R1a1a1 in Corded Ware culture

Transcaucasia & West Asian origins and possible influence on Indus Valley Civilization

Proposed South Asian origins

Phylogeny

Topology

Haplogroup R

R-M173 (R1)

R-M420 (R1a)

R-SRY1532.2 (R1a1)

R-M17/M198 (R1a1a)

R-M417 (R1a1a1)

R-Z282 (R1a1a1b1a) (Eastern Europe)

R-M458 (R1a1a1b1a1)

R-L260 (R1a1a1b1a1a) (Gwozdz's cluster P)

R-M334

R1a1a1b1a2 (S466/Z280, S204/Z91)

R1a1a1b1a2b3* (Gwozdz's Cluster K)

R1a1a1b1a2b3a (R-L365)

R1a1a1b2 (R-Z93) (Asia)

Geographic distribution of R1a1a

Historical

Europe

Asia

Central Asia

South Asia

East Asia

West Asia

Historic naming of R1a

See also

Y-DNA R-M207 subclades

Y-DNA backbone tree

Notes

References

Sources

Further reading

External links