Haplogroup O-K18


Haplogroup O-K18 also known as O-F2320 and Haplogroup O1b1, is a human Y-chromosome DNA haplogroup. Haplogroup O-K18 is a descendant branch of Haplogroup O-P31. Based on its disjunct distribution, O-K18 can be further divided into south subclade O1b1a1-PK4 and north subclade O1b1a2-PAGE59. O-PAGE59 is widely distributed in East Asia, whereas O-PK4 is more frequent in South China and Southeast Asia. O-PK4 is best known for the high frequency of its O-M95 subclade among populations of Southeast Asia and among speakers of Austroasiatic languages in South Asia.

Origin

In a paper published in 2011 by a group of Chinese researchers affiliated with Fudan University, it has been suggested that China is the origin of the expansion of haplogroup O-M268, the parent haplogroup of O-F2320.

Distribution

Haplogroup O-K18 is distributed widely in Asia, from southern India to the Altai Mountains and Central Asia in the west, and from Indonesia to northern China and Japan in the east. According to its distribution, O-K18 can be roughly divided into north subclade O-PAGE59 and south subclade O-PK4. O-PAGE59 are relatively rare and are more abundant in Northern and Eastern parts of China. It is also found at low frequencies of approximately 1% or less at the periphery of its distribution in other Indo-Pacific area like Vietnamese, Koreans, Japanese, West Kalimantan, Hazaras, and Arabs. The other haplogroup O-PK4 consists of O-F838 and O-M95. O-F838 are more frequent in the South Han in China, showing the same trend with the its parallel branch O-M95 in China. The other branch, O-M95, is the best known subclade for the whole Y Haplogroup O-K18. O-M95 is found only at marginally low frequencies of approximately 1% at the periphery of its distribution in southern India, Central Asia, northern China, and Japan, but many populations within the vast intervening territory in South Asia, Southeast Asia, and southern China display a greatly elevated frequency of Haplogroup O-M95 Y-chromosomes. Haplogroup O-M122, which attains its peak frequency among the Sino-Tibetan and Hmong–Mien peoples of China and Southeast Asia, and Haplogroup O-M119, which predominates among Taiwanese aborigines and many populations of the Philippines, also generally occur among speakers of Austroasiatic languages in South China and the Indochinese Peninsula, but usually at much lower frequencies than Haplogroup O-M95.
According to the National Geographic project regarding O-M95: The Austro-Asiatic language family developed in groups containing men from this lineage. As these groups spread across Southeast Asia in successive waves, they spread their language. Today, the distribution of men from this lineage matches the pattern of these waves of migration. It is 42 percent of male lineages in Java, 40 percent of male lineages in Vietnam, and 38 percent of male lineages in Borneo. It accounts for 28 percent of the male population in Malaysia. It is present in Sumatra in about 14 percent of the male population. In mainland China, it is, on average, about 3 percent of the male population. In South Asia, it is 9 percent of the Pardhan, between 1 and 2 percent of the Andh, and 10 percent of the Naikpod. It is around 59 percent of Balinese male lineages.
PopulationFrequencynSourceSNPs
Shompen1.0012M95
Nicobarese1.0011M95
Juang0.98049M95
Bonda0.95242M95
Juang0.88954M95
Gadaba0.88927M95
Birhor0.88762M95
Lamet0.85735M95
Ho0.84445M95
Han Chinese 0.83831M95
Korku0.81459M95
Baiga 0.81042M95
Inh0.79434M95
Mawasi 0.71839M95
Kharia 0.70634M95=24
Katu0.68945M95
Santhal0.68990M95=62
Mal0.66050M95
Ho0.65879M95
Bo0.64328M95
Talieng0.62935M95
Brau0.62532M95
Khmu0.60851M95
Asur0.60288M95
Oy0.60050M95
Korwa0.59542M95
Li 0.58834M95
Balinese0.573641M95
Alak0.56730M95
Suy0.56439M95
Mech0.55020M95=11
Bit0.53628M95
Ho0.53628M95=15
Aheu0.52638M95
Savara0.52421M95
Baiga 0.52223M95
Blang
0.50028M95=14
Bugan0.50032M95
Java 0.49261M95
Kharia0.48637M95
Ngeq0.48635M95
Birhor0.480100M95=48
Naxi 0.47540M95=19
Jeh0.46932M95
Santhal0.468109M95
Bouyei0.46745M95=21
Munda0.45353M95
Lachung0.45020M95=9
Lawa
0.44050M95=22
Sui0.44050M95=22
Chin
0.42119M95=8
Laven0.42050M95
So0.42050M95
Buyi 0.41748M95=20
Muong0.41712M95
Khasi0.41392M95
Kharia0.38936M95
Zhuang 0.38347M95=18
Santhal 0.37351M95=19
Buyi0.37135M95
Kol 0.37162M95=23
Java0.362141M95
Lao 0.36025M95
Munda0.35194M95=33
Santhal0.35020M95
Mon
0.33318M95=6
Kinh0.33315M95
Oraon0.31991M95
Banjarmasin, Indonesia0.31822M95
Malaysia0.31332M95
Mountain Kimmun0.31332M95
Blang0.30852M95
Miao 0.30649M95
Cham 0.30559M95
Khasi0.303353M95
Shan
0.30020M95=6
Sulawesi0.29417M95
Dai 0.28649M95=14
Lisu 0.28649M95=14
Thailand0.25375M95
Tai Khün
0.25024M95=6
Miao 0.25048M95=12
Zhuang0.25028M95=7
Lowland Kimmun0.24441M95
Tai Lue
0.24291M95=22
Zhuang 0.235166M95
Palyu0.23330M95
Cambodian0.23126M95=6
Khon Mueang
0.229205M95=47
Wa
0.22631M95=7
Tai Yuan
0.22485M95=18
M95=1
Ambon0.22218M95
Kota Kinabalu, Malaysia0.21565M95
Dai 0.21428M95=6
Northern Mien0.21233M95
Flower-head Mien0.21119M95
Mech 0.21119M95=4
Borneo 0.20986M95
Dai 0.20035M95=7
Dong0.20020M95=4
Southern Mien0.19431M95
CDX
0.19252Z1018=4
Z23849=2
F2517=2
Y26463=1
Y9322*=1
Sumatra0.19226M95
Zhuang
0.19063M95=12
Mahali 0.18832M95
Garo0.18233M95
Blue Kimmun0.17928M95
Thai 0.17617M95
Sui 0.17540M95=7
Xinhmul0.17229M95
Malagasy0.17135M95
Miao 0.16349M95
Lowland Yao0.16131M95
Batak Toba 0.15838M95
Rajbanshi
0.15645M95=7
Daur0.15439M95
Bamar
0.15372M95=11
Western Mien0.14947M95
Paharia
0.130100M95=13
Mandar 0.13054M95
Pahng0.12931M95
De'ang
0.12516M95=2
Bai 0.12050M95=6
Kalimantan0.12025M95
Kinh 0.11876M95
Garo0.11371M95
Oraon0.109110M95=12
Yao 0.10647M95=5
Kataang0.10837M95
Bai 0.10030M95=3
Yami0.10030M95
Han 0.09265PK4
Qiang0.09133M95
Top Board Mien0.09111M95
Thin Board Mien0.09111M95
Miao 0.090100M95
She0.08834M95
Yi 0.08547M95=4
Hanoi, Vietnam0.08324M95
Tai Yong
0.07726M95=2
Native Mien0.07341M95
Vietnamese0.07170M95
Minnan 0.06760M95
Han Chinese 0.061165M95
Hmong Daw 0.05951M95
Hakka 0.05934M95
Hani0.05934M95
Jingpo 0.05917M95=1
Yao 0.05735M95
Bunu0.05636M95
Dai 0.05020M95=1
Kapingamarangi0.04821M95
Flores0.046394M95
Jino 0.04445M95=2
Japanese0.04347M95
Lahu 0.04347M95=2
Utsat 0.04272M95=3
Tibetan 0.04050M95=2
Western Samoa0.04025M95
Ewenki 0.03826M95
Akha 0.03727M95
Bengali 0.03754M95=2
Northern She 0.03656M95
Han 0.035258M95
Miao 0.03458M95
Han 0.03330M95
Han 0.03132M95
Han 0.031129PK4
Han 0.02934M95
Dhimal 0.02836M95=1
Achang 0.02540M95=1
Tharu 0.024164M95
Hani 0.02441M95=1
Han 0.024167PK4
Japanese0.024210M95
Tharu 0.02245M95
Mongol 0.02246M95=1
Han 0.02147M95=1
Philippines0.02148M95
Nu 0.02050M95=1
Tharu 0.020202
M95
Rajbanshi 0.02051M95=1
Uzbek0.01954M95
Han 0.01855M95
Japanese0.01856M95
Yao 0.01760M95
Taiwan plains tribes0.016370M95
Japanese0.0132390M95
Japanese0.008263M95
Sumba0.003350M95
Hui 0.00015M95=0
Dulong 0.00028M95=0
Manchu 0.00041M95=0
Mosuo 0.00047M95=0
Pumi 0.00047M95=0
Philippines0.000146M95
Taiwan mountain aborigines0.000325M95

Subclade Distribution

O-K18

O-CTS10887/O-PAGE59
O-PAGE59 is relatively rare and is usually marked as O1B*/O2*-M268 in the past academic report. It shares a common ancestor with its outgroup, O-PK4, approximately 24,400 years before present according to 23mofang.
It is mainly distributed in East Asia and is mainly found in Han Chinese and occasionally found in Taiwan plains tribes, Vietnamese, Dai, Filipinos, Koreans, Japanese, West Kalimantan, Hazaras, and Arabs. TMRCA of Han Chinese, Dai, Vietnamese, and Japanese members estimated to be 15,900 ybp. Relative paper illustrates O-P31/M268 is found in North China, East China and South China.
Analysis of DNA extracted from a tooth from what are believed to be the remains of Cao Ding shows that he belonged to this clade. The researchers also found that the Y-chromosome of Cao Ding matches those of self-proclaimed living descendants of Cao Cao who hold lineage records dating back to more than 100 generations ago. Cao Cao laid the foundation of Cao Wei, one of three major states that succeeded the Han Dynasty of China.
In Yangshao culture, there is an ancient male who belongs to O-PAGE59 haplogroup in WangGou site. This is currently the earliest discovered ancient DNA related to O-PAGE59.

O-PK4

Coalescence age of O-PK4 is 12,900 years before present and it mainly consists of two downstream O-F838 and O-M95. It is best known for the high frequency of its O-M95 subclade among populations of Southeast Asia and among speakers of Austroasiatic languages in South Asia.
O-F838
This lineage has been relocated upstream of M95 following a paper published on the subject in 2011. Found in three samples of Han Chinese: 3/65 = 4.6% South China, 1/129 = 0.8% North China, 1/167 = 0.6% East China.
Peng et al. found O-PK4, which probably should belong to O-F838 according to the phylogenetic tree of human Y-DNA as it is currently resolved, in a Bamar individual in Ayeyarwady Region, Myanmar.
Trejaut et al. found O-PK4 in one of 18 individuals sampled on Ambon Island, Indonesia, one of 24 individuals sampled in Hanoi, Vietnam, six of 258 miscellaneous Han volunteers in Taiwan, one of 60 Minnan in Taiwan, and one of 85 Siraya in Pingtung, Taiwan.
Wang et al. found O-PK4 in two of a sample of 46 Khams Tibetans from Xinlong County, Sichuan.

O-M95

This subclade is downstream from O-PK4. It reaches high frequencies among the populations of the islands of Sumatra, Java, Bali, and Borneo in western Indonesia . It has been found to be by far the most common Y-chromosome haplogroup among the Balinese, occurring in approximately 58.6% of a sample of Balinese men. It has been found in 17.1% of a sample of Malagasy in Madagascar and in 1.7% of a sample of Swahili people in Kilifi, Kenya. It is one of the most frequently occurring Y-DNA haplogroups among men in Malaysia, Thailand, Laos, Cambodia, Vietnam, and Myanmar. It is also very common among minority ethnic groups in India and China, especially those who have ethnolinguistic connections with populations in Southeast Asia.
O-M95 is relatively infrequent in other populations, but a study published in 2006 has found it in samples of Daurs, Qiang people, She people, Hani people, Yao people in Liannan, Guangdong, Japanese people, Evenks in China, Han Chinese in Lanzhou, Gansu, Han Chinese in Yili, Xinjiang, Han Chinese in Chengdu, Sichuan, and Yao people in Bama, Guangxi.
A study published in 2010 found O-M95 in 57.3% Bali, 49.2% Java, 31.3% Malaysia, 20.9% Borneo, 15.8% Toba people in Sumatra, 13.0% Mandar people in Sulawesi, 7.1% Vietnam, 6.1% Han Chinese, 4.6% Flores, 3.4% Miao in China, 2.1% Philippines, 1.7% Yao in China, and 0.3% Sumba.
Trejaut et al. found O-M95 in 36.2% Java, 29.4% Sulawesi, 25.3% general population of Thailand, 25% Malaysia, 22.2% Ambon, 19.2% Sumatra, 12.0% Kalimantan, 10.0% Yami, 8.3% Hanoi, Vietnam, 6.7% Minnan in Taiwan, 5.9% Hakka in Taiwan, 3.7% Akka in Thailand, 3.5% miscellaneous Han in Taiwan, 1.8% Han in Fujian, 1.6% Taiwan Plains Tribes. The authors did not find any cases of O-M95 among their samples from the Philippines or Taiwan Highlands Tribes.

O-M88

This subclade is downstream from O-M95. According to YFull YTree v7.03.00, all extant members of O-M88, which is also known as O-M111, share a most recent common ancestor approximately 6,200 years before present. The entire O-M88 clade is estimated to share a most recent common ancestor with its nearest outgroup, O-CTS5854, approximately 9,700 years before present.
O-M88 is frequently found among Tai peoples, Vietnamese people, Hani-Akha people, She people, and some tribal peoples in Laos, with a moderate distribution among Cambodians, Qiang people, Yi people, Tujia people, Hlai, Miao, Yao, Cham people, Taiwanese aborigines, populations of Borneo, the Philippines, and Malaysia , Han Chinese of Sichuan, Hunan, Guangxi, Guangdong, Yunnan, and Taiwan.
Trejaut et al. found O-M88 in 37.5% Bunun, 25.9% Akka in Thailand, 25.0% Hanoi, Vietnam, 17.3% general population of Thailand, 5.0% Java, 3.4% Philippines, 3.3% Yami, 2.9% Hakka in Taiwan, 1.7% Minnan in Taiwan, 1.55% Han in Taiwan, and 0.54% Taiwan Plains Tribes.

O-M297

More research is needed on this lineage. It is claimed to be downstream from M95 and parallel to M88.

Phylogenetics

Phylogenetic history

Prior to 2002, there were in academic literature at least seven naming systems for the Y-Chromosome Phylogenetic tree. This led to considerable confusion. In 2002, the major research groups came together and formed the Y-Chromosome Consortium. They published a joint paper that created a single new tree that all agreed to use. Later, a group of citizen scientists with an interest in population genetics and genetic genealogy formed a working group to create an amateur tree aiming at being above all timely. The table below brings together all of these works at the point of the landmark 2002 YCC Tree. This allows a researcher reviewing older published literature to quickly move between nomenclatures.
YCC 2002/2008 ''''''YCC 2002 YCC 2005 YCC 2008 YCC 2010r ISOGG 2006ISOGG 2007ISOGG 2008ISOGG 2009ISOGG 2010ISOGG 2011ISOGG 2012
O-M17526VII1U28Eu16H9IO*OOOOOOOOOO
O-M11926VII1U32Eu16H9HO1*O1aO1aO1aO1aO1aO1aO1aO1aO1aO1a
O-M10126VII1U32Eu16H9HO1aO1a1O1a1aO1a1aO1a1O1a1O1a1aO1a1aO1a1aO1a1aO1a1a
O-M5026VII1U32Eu16H10HO1bO1a2O1a2O1a2O1a2O1a2O1a2O1a2O1a2O1a2O1a2
O-P3126VII1U33Eu16H5IO2*O2O2O2O2O2O2O2O2O2O2
O-M9526VII1U34Eu16H11GO2a*O2aO2aO2aO2aO2aO2aO2aO2aO2a1O2a1
O-M8826VII1U34Eu16H12GO2a1O2a1O2a1O2a1O2a1O2a1O2a1O2a1O2a1O2a1aO2a1a
O-SRY46520VII1U35Eu16H5IO2b*O2bO2bO2bO2bO2bO2bO2bO2bO2bO2b
O-47z5VII1U26Eu16H5IO2b1O2b1aO2b1O2b1O2b1aO2b1aO2b1O2b1O2b1O2b1O2b1
O-M12226VII1U29Eu16H6LO3*O3O3O3O3O3O3O3O3O3O3
O-M12126VII1U29Eu16H6LO3aO3aO3a1O3a1O3a1O3a1O3a1O3a1O3a1O3a1aO3a1a
O-M16426VII1U29Eu16H6LO3bO3bO3a2O3a2O3a2O3a2O3a2O3a2O3a2O3a1bO3a1b
O-M15913VII1U31Eu16H6LO3cO3cO3a3aO3a3aO3a3O3a3O3a3aO3a3aO3a3aO3a3aO3a3a
O-M726VII1U29Eu16H7LO3d*O3cO3a3bO3a3bO3a4O3a4O3a3bO3a3bO3a3bO3a2bO3a2b
O-M11326VII1U29Eu16H7LO3d1O3c1O3a3b1O3a3b1-O3a4aO3a3b1O3a3b1O3a3b1O3a2b1O3a2b1
O-M13426VII1U30Eu16H8LO3e*O3dO3a3cO3a3cO3a5O3a5O3a3cO3a3cO3a3cO3a2c1O3a2c1
O-M11726VII1U30Eu16H8LO3e1*O3d1O3a3c1O3a3c1O3a5aO3a5aO3a3c1O3a3c1O3a3c1O3a2c1aO3a2c1a
O-M16226VII1U30Eu16H8LO3e1aO3d1aO3a3c1aO3a3c1aO3a5a1O3a5a1O3a3c1aO3a3c1aO3a3c1aO3a2c1a1O3a2c1a1

Research publications

The following research teams per their publications were represented in the creation of the YCC Tree.

Phylogenetic trees

This phylogenetic tree of haplogroup O subclades is based on the YCC 2008 tree and subsequent published research.

Genetics

Y-DNA O subclades

Y-DNA backbone tree

Footnotes

Works cited

Books
Conference Posters
Journals