Spurious languages


Spurious languages are languages that have been reported as existing in reputable works, while other research has reported that the language in question did not exist. Some spurious languages have been proven to not exist. Others have very little evidence supporting their existence, and have been dismissed in later scholarship. Others still are of uncertain existence due to limited research.
Below is a sampling of languages that have been claimed to exist in reputable sources but have subsequently been disproved or challenged. In some cases a purported language is tracked down and turns out to be another, known language. This is common when language varieties are named after places or ethnicities.
Some alleged languages turn out to be hoaxes, such as the Kukurá language of Brazil or the Taensa language of Louisiana. Others are honest errors that persist in the literature despite being corrected by the original authors; an example of this is , the name given in 1892 to two Colonial word lists, one of Tlingit and one of a Salishan language, that were mistakenly listed as Patagonian. The error was corrected three times that year, but nonetheless "Hongote" was still listed as a Patagonian language a century later in Greenberg.
In the case of New Guinea, one of the most linguistically diverse areas on Earth, some spurious languages are simply the names of language surveys that the data was published under. Examples are , Kia,, , listed as Indo-Pacific languages in ; these are actually rivers that gave their names to language surveys in the Greater Awyu languages and Ok languages of New Guinea.

Dubious languages

Dubious languages are those whose existence is uncertain. They include:
Glottolog, maintained at the Max Planck Institute for Evolutionary Anthropology in Leipzig, classifies several languages, some with ISO 639 codes, as spurious/unattested. These include:
Also some Aeta ethnic names with no cultural memory of an extinct language:
Other ISO codes that Glottolog regards as spurious, because they are not a distinct language, are polyphyletic, or have not been shown to exist, include:
Ir ,
Skagit ,
Snohomish ,
Ahirani ,
Pokangá ,
Chetco ,
Arakwal ,
Anasi ,
Yarí* ,
Yola ,
Seru* ,
Gowli ,
Mina ,
Degaru* ,
Bubia ,
Gbati-ri ,
Tetete* ,
Kannada Kurumba ,
Vatrata* ,
Kofa* ,
Old Turkish ,
Tingui-Boto* ,
Imeraguen ,
Yauma ,
Rufiji ,
Ngong ,
Dombe ,
Subi* ,
Mawayana* ,
Kwak ,
Potiguára* ,
Coxima* ,
Chipiajes* ,
Cagua* ,
Kakauhua ,
Yangho* ,
Takpa* ,
N'Ko ,
Sara Dunjo ,
Putoh ,
Bainouk-Samik ,
Kamba * ,
Bikaru-Bragge* ,
Baga Binari ,
Baga Sobané ,
Ontenu* ,
Baga Kaloum ,
Munda ,
Aduge* ,
Khalaj** ,
Buso* ,
Uokha* ,
Ihievbe* ,
Coyaima* ,
Natagaimas* ,
Odut* ,
Chilean Quechua ,
Quetzaltepec Mixe ,
Kang ,
Thu Lao ,
Pu Ko* ,
Gey ,
Kakihum* ,
Bonjo* ,
Katukína* ,
Warduji ,
Lui ,
Lama ,
Inpui Naga* ,
Puimei Naga* ,
Purum ,
Welaung* ,
Lumba-Yakkha* ,
Phangduwali / Lambichhong* ,
Lingkhim ,
Northwestern Tamang ,
Southwestern Tamang ,
Kayort* ,
Loarki ,
Con ,
Gengle ,
Kuanhua* ,
Yarsun ,
Kabixí* ,
Vasekela Bushman ,
Maligo ,
Pao ,
Bhalay / Gowlan* ,
Balau* ,
Kuku-Mangk ,
Buya* ,
Aramanik ,
Mediak ,
Kisankasa ,
Southwestern Nisu ,
Tawang Monpa* ,
Adap ,
Southern Lolopo ,
Eastern Lalu ,
Ndonde Hamba* ,
Lang'e* ,
Lopi* ,
Laopang ,
Kunggara ,
Chuanqiandian Cluster Miao ,
Karipuna do Amapa*

Spurious according to Ethnologue and ISO 639-3

Following is a list of ISO 639-3 language codes which have been retired since the standard was established in 2006, arranged by the year in which the change request was submitted; in most cases the actual retirement took effect in the beginning of the following year. Also included is a partial list of languages that appeared at one time in Ethnologue but were removed prior to 2006, arranged by the first edition in which they did not appear.
The list includes codes that have been retired from ISO 639-3 or languages removed from Ethnologue because the language apparently does not exist and cannot be identified with an existing language. The list does not include instances where the "language" turns out to be a spelling variant of another language or the name of a village where an already known language is spoken; these are cases of duplicates, which are resolved in ISO 639-3 by a code merger. It does include "languages" for which there is no evidence or which cannot be found.
SIL codes are upper case; ISO codes are lower case. Once retired, ISO 639-3 codes are not reused. SIL codes that were retired prior to 2006 may have been re-used or may have reappeared as ISO codes for other languages.

Removed from Ethnologue, 12th ed., 1992

And several supposed extinct Arawakan languages of Venezuela and Colombia:
Additional languages and codes were retired in 2015, due to a lack of evidence that they existed, but were not necessarily spurious as languages.

Retired 2016