A human language is any language used by people to communicate or convey ideas.
The human language type is used for languages that are members of the lowest-level language family. For example, the Kapampangan language belongs to the hierarchy of language families Central Luzon (lowest level) - Northern Philippine - Philippine - Malayo-Polynesian - Austronesian (highest level).
For more information, please see the Freebase wiki page on human language.
More about Best Human Language of All Time:
Best Human Language of All Time is a public top list created by Listnerd on Rankly.com on November 27th 2012. Items on the Best Human Language of All Time top list are added by the Rankly.com community and ranked using our secret ranking sauce. Best Human Language of All Time has gotten 2.125 views and has gathered 624 votes from 624 voters. Only owner can add items. Just members can vote.
Best Human Language of All Time is a top list in the Education & Books category on Rankly.com. Are you a fan of Education & Books or Best Human Language of All Time? Explore more top 100 lists about Education & Books on Rankly.com or participate in ranking the stuff already on the all time Best Human Language of All Time top list below.
If you're not a member of Rankly.com, you should consider becoming one. Registration is fast, free and easy. At Rankly.com, we aim to give you the best of everything - including stuff like the Best Human Language of All Time list.
Get your friends to vote! Spread this URL or share:
Tocharian or Tokharian (/təˈkɛəriən/ or /təˈkɑriən/) —also known as Agni-Kuchi or Agni-Kuči (outdated: Arśi-Kuči)— is an extinct branch of the Indo-European language family, formerly spoken in oases on the northern edge of the Tarim Basin (now part of the Xinjiang Uyghur Autonomous Region of China). Two branches of Tocharian are known from documents dating from the 3rd to 9th centuries AD:
Prakrit documents from 3rd century Kroran on the southeast edge of the Tarim Basin contain loanwords and names that appear to come from another variety of Tocharian, dubbed Tocharian C. All these languages became extinct after Uyghur tribes expanded into the area.
The existence of the Tocharian languages and alphabet was not even suspected until archaeological exploration of the Tarim basin by Aurel Stein in the early 20th century brought to light fragments of manuscripts in an unknown language.
It soon became clear that these fragments were actually written in two distinct but related languages belonging to a hitherto unknown branch of Indo-European, now known as Tocharian. The discovery of Tocharian upset some theories about the relations of Indo-European languages and revitalized their study.
Bavarian (German: Bairisch (help·info), Austro-Bavarian: Boarisch, IPA: /bɑrɪʃ/, is a major group of Upper German varieties spoken in the southeast of the German language area: Bavaria.
The Bavarian regiolect has its origins in the Germanic tribe known as the Bavarii, who established a tribal duchy, which covered much of what is today Bavaria and some of Austria in the early Middle Ages and was eventually subdued by Charlemagne. However, they gradually migrated down the Danube and into the Alps to all those areas where Bavarian dialects are spoken. German linguists refer to this speech variety, a group of three East Upper German dialects, as Bairisch "Bavarian". They are divided into Oberpfälzisch (Upper Palatinian, i.e. North[ern] Bavarian), Donaubairisch (Danube or Danubian Bavarian, i.e. Central Bavarian) and Alpenbairisch (Alpine Bavarian, i.e. South[ern] Bavarian).
These areas had been provinces of the Roman Empire, and the languages of the population were based on Latin, but this language was replaced by the Germanic dialects of the immigrants as the previous inhabitants were assimilated or forced out. This development contrasts with that in the provinces of Gallia and
The Thracian language was the Indo-European language spoken in ancient times in Southeastern Europe by the Thracians, the northern neighbors of the Ancient Greeks. The Thracian language exhibits satemization: it either belonged to the Satem group of Indo-European languages or it was strongly influenced by Satem languages. The language, of which little is known from written sources was extinct by the Early Middle Ages.
Excluding Dacian (either because it may have been a distinct Thracic language, or because, as some linguists hypothesize, it was a different branch of Indo-European) the Thracian language was spoken in what is now the southern half of Bulgaria, eastern Republic of Macedonia, Northern Greece, European Turkey and in parts of Bithynia (North-Western Asiatic Turkey).
Eastern Serbia is usually considered by paleolinguists to have been a Daco-Moesian language area. Moesian (after Vladimir Georgiev et al.) is grouped with Dacian.
Little is known for certain about the Thracian language, since no phrase beyond a few words in length has been satisfactorily deciphered, and the sounder decipherments given for the shorter phrases may not be completely accurate. Some of the longer
Gheg (or Geg) is one of the two major varieties of Albanian language. The other one is Tosk, on which standard Albanian is based. The dividing line between these two varieties is the Shkumbin River, which winds its way through central Albania.
Gheg is spoken in Northern Albania, Kosovo, northwestern Republic of Macedonia, southern Montenegro, Serbia, and Turkey. It is also spoken in parts of Sicily and southern Italy.
There are still some authors who write in Gheg. Gheg is used on television and in newspapers in Kosovo only.
Gheg has several dialects, notably:
Italian linguist Carlo Tagliavini puts the Gheg speech of Kosovo and Macedonia in "Eastern Geg".
Assimilations are common in Gheg, but are not part of the Albanian literary language which only follows Tosk Albanian.
Shanghainese (上海閒話 [z̥ɑ̃̀héɦɛ̀ɦʊ̀]/上海言话 in Shanghainese), or the Shanghai language (simplified Chinese: 上海话 or 沪语; traditional Chinese: 上海話 or 滬語), is a dialect of Wu Chinese spoken in the city of Shanghai and the surrounding region. It is classified as part of the Sino-Tibetan family of languages. Shanghainese, like other Wu dialects, is largely not mutually intelligible with other Chinese varieties such as Mandarin. The term "Shanghainese" in English sometimes refers to all Wu Chinese dialects, though it is only partially intelligible with some other subbranches of the Wu language group.
Shanghainese is a representative dialect of Northern Wu; it contains vocabulary and expressions from the entire Northern Wu area (southern Jiangsu, northern Zhejiang). With nearly 14 million speakers, Shanghainese is also the largest single coherent form of Wu Chinese. It once served as the regional lingua franca of the entire Yangtze River Delta region.
Shanghainese is rich in consonants and pure vowels [i y ɪ e ø ɛ ə ɐ a ɑ ɔ ɤ o ʊ u] (of which 8 are phonemic). Like other northern Wu dialects, the Shanghai dialect has voiced initials [b d ɡ ɦ z v dʑ ʑ]. Neither Mandarin nor Cantonese has voiced
Yue, commonly known as Cantonese, is a primary branch of Chinese spoken in southern China.
The issue of whether Yue is a language in its own right or a dialect of a single Chinese language depends on conceptions of what a language is. Like the other branches of Chinese, Yue is considered a dialect for ethnic, political, and cultural reasons, but it is also considered a distinct language for linguistic reasons. Spoken Cantonese is mutually unintelligible with other varieties of Chinese, though intelligible to a certain degree in its written form.
The areas of China with the highest concentration of speakers are the provinces of Guangdong and (eastern) Guangxi and the regions of Hong Kong and Macau. There are also substantial Cantonese- and Taishanese-speaking minorities overseas in Southeast Asia, Canada, Australia, and the United States.
The prototypical use of the name "Cantonese" in English is for the Guangzhou (Canton) dialect of Yue, but it is commonly used for Yue as a whole. To avoid confusion, academic texts may call the primary branch of Chinese Yue, following the Mandarin pinyin spelling, and either restrict "Cantonese" to its common usage as the dialect of Guangzhou, or
The Kongo language, or Kikongo, is the Bantu language spoken by the Bakongo and Bandundu people living in the tropical forests of the Democratic Republic of the Congo, the Republic of the Congo and Angola. It is a tonal language and formed the base for Kituba, a Bantu creole and lingua franca throughout much of west central Africa. It was spoken by many of those who were taken from the region and sold as slaves in the Americas. For this reason, while Kongo still is spoken in the above-mentioned countries, creolized forms of the language are found in ritual speech of African-derived religions in Brazil, Jamaica, Cuba, Dominican Republic, and especially in Haiti. It is also one of the sources of the Gullah people's language and the Palenquero creole in Colombia. The vast majority of present-day speakers live in Africa. There are roughly seven million native speakers of Kongo, with perhaps two million more who use it as a second language.
It is also the base for a creole used throughout the region: Kituba also called Kikongo de L'état or Kikongo ya Leta ("Kongo of the state" in French or Kongo), Kituba and Monokituba (also Munukituba). The constitution of the Republic of the Congo
American Sign Language (ASL) is a sign language, a language in which the hands, arms, head, facial expression and body language are used to speak without sound. Like other sign languages, its grammar and syntax are distinct from any oral language, including English. In the 1960s, ASL was sometimes referred to as "Ameslan" but this term is now considered obsolete.
ASL is the predominant sign language of deaf communities in the United States and English-speaking parts of Canada. Although the United Kingdom and the United States / Canada share English as a common oral and written language, British Sign Language (BSL) is a completely different language from ASL, and they are not mutually intelligible. ASL is instead related to French Sign Language.
Besides North America, dialects of ASL or ASL-based creoles are used, sometimes alongside indigenous sign languages, in the Philippines, Malaysia, Singapore, the Dominican Republic, El Salvador, Haiti, Puerto Rico, Côte d'Ivoire, Burkina Faso, Ghana, Togo, Benin, Nigeria, Chad, Gabon, the Democratic Republic of the Congo, the Central African Republic, Mauritania, Kenya, Madagascar, Zimbabwe, Barbados, Bolivia, China, Trinidad and Tobago and
Khanty (Xanty), known previously as Ostyak, is the language of the Khant peoples. It is spoken in Khanty–Mansi and Yamalo-Nenets autonomous okrugs as well as in Aleksandrovsky and Kargosoksky districts of Tomsk Oblast in Russia. According to the 1994 Salminen and Janhunen study, there were 12,000 Khanty-speaking people in Russia.
The Khanty language has a large number of dialects. The western group includes the Obdorian, Ob, and Irtysh dialects. The eastern group includes the Surgut and Vakh-Vasyugan dialects, which, in turn, are subdivided into thirteen other dialects. All these dialects differ significantly from each other by phonetic, morphological, and lexical features to the extent that the three main "dialects" (northern, southern and eastern) are mutually unintelligible. Thus, based on their significant multifactorial differences, Eastern, Northern and Southern Khanty could be considered separate but closely related languages.
Cyrillic (version as of 2000)
Cyrillic (version as of 1958)
The Khanty written language was first created after the October Revolution on the basis of the Latin script in 1930, and then with the Cyrillic alphabet (with the additional
Vestinian is a scholarly term referring to an extinct Indo-European language documented only in two surviving inscriptions of the Roman Republic. It is presumed to have been anciently spoken by the tribe of the Vestini, who occupied the region within current Abruzzo from Gran Sasso to the Adriatic Sea in east-central Italy during that time. Vestini is the Roman exonym for the people. Not enough of their presumed language survives to classify it beyond Italic. Vestinian is one of a number of scantily attested Italic languages spoken in small regions of the Appennines directly east of Rome called generally "the minor dialects." There is currently no agreement on their precise classification.
Only two inscriptions survive.
CIL 1.394 from near Navelli in the Abruzzo, dated mid-third-century BC, is:
Translation into Latin:
Translation into English:
Romani (also Romany, Gypsy, or Gipsy; Romani: romani ćhib) is any of several languages of the Romani people belonging to the Indo-Aryan branch of the Indo-European language family. Many varieties of Romani are divergent and sometimes considered languages of their own. The largest of these are Vlax Romani (about 900,000 speakers), Balkan Romani (700,000), Carpathian Romani (500,000) and Sinti Romani (300,000). Some Romani communities speak mixed languages based on the surrounding language with retained Romani-derived vocabulary – these are known by linguists are Para-Romani varieties, rather than dialects of the Romani language itself.
Speakers of the Romani language usually refer to the language as řomani čhib "the Romani language" or řomanes "in a Rom way." This derives from the Romani word řom, meaning either "a member of the (Romani) group" or "husband". This is also where the term "Roma" derives in English, although some Roma groups refer to themselves using other demonyms (e.g. 'Kaale', 'Sinti', etc.). The English spelling "Rromani language" may also be found, reflecting a different transcription of the Romani phoneme ř.
Before the late nineteenth century, English-language
Franco-Provençal (Francoprovençal), Arpitan, or Romand (in Switzerland) (Vernacular: francoprovençâl, arpetan, patoué; Italian: francoprovenzale, arpitano; French: francoprovençal, arpitan, patois) is a Romance language with several distinct dialects that form a linguistic sub-group separate from Langue d'Oïl and Langue d'Oc. The name Franco-Provençal was given to the language by G.I. Ascoli in the 19th century because it shared features with French and Provençal without belonging to either. Arpitan, a neologism, is becoming a popular name for the language and the people who speak it.
Today, the largest number of Franco-Provençal speakers reside in the Aosta Valley, an autonomous region of Italy. The language is also spoken in alpine valleys in the province of Turin, two isolated towns in Foggia, and rural areas of the Romandie region of Switzerland. It constitutes one of the Gallo-Romance languages of France and is classified as a regional language of France, though its use is marginal. Organizations are attempting to preserve it through cultural events, education, scholarly research, and publishing.
The number of speakers of Franco-Provençal has been declining significantly.
The Mongolian language (in Mongolian script: , Mongɣol kele; in Mongolian Cyrillic: Монгол хэл, Mongol khel) is the official language of Mongolia and the best-known member of the Mongolic language family. The number of speakers across all its dialects may be 5.2 million, including the vast majority of the residents of Mongolia and many of the Mongolian residents of the Inner Mongolia autonomous region of China. In Mongolia, the Khalkha dialect, written in Cyrillic, is predominant, while in Inner Mongolia, the language is more dialectally diverse and is written in the traditional Mongolian script. In the discussion of grammar to follow, the variety of Mongolian treated is Standard Khalkha Mongolian (i.e., the standard written language as formalized in the writing conventions and in the school grammar), but much of what is to be said is also valid for vernacular (spoken) Khalkha and other Mongolian dialects, especially Chakhar.
Mongolian has vowel harmony and a complex syllabic structure for a Mongolic language that allows clusters of up to three consonants syllable-finally. It is a typical agglutinative language that relies on suffix chains in the verbal and nominal domains. While
Tajik, Tajik Persian, or Tajiki, (sometimes written Tadjik or Tadzhik; тоҷикӣ, تاجیکی, tojikī [tɔːdʒɪˈkiː]) is a variety of modern Persian spoken in Central Asia. Historically Tajiks called their language zabani farsī (Persian: زبان فارسی), meaning "Persian language"; the term zabani tajikī, or "Tajik language", was introduced in the 20th century by the Soviets. Most speakers of Tajik live in Tajikistan and Uzbekistan. Tajik is the official language of Tajikistan.
The dialect has diverged from Persian as spoken in Afghanistan and Iran, as a result of political borders, geographical isolation, the standardization process, and the influence of Russian and neighboring Turkic languages. The standard language is based on the north-western dialects of Tajik (region of old major city of Samarkand), which have been somewhat influenced by the neighboring Uzbek language as a result of geographical proximity. Tajik also retains numerous archaic elements in its vocabulary, pronunciation, and grammar that have been lost elsewhere in the Persophone world, in part due to its relative isolation in the mountains of Central Asia.
The most important historically Tajik/Persian-speaking cities of
Albanian (gjuha shqipe, pronounced [ˈɟuha ˈʃcipɛ], or shqip Albanian pronunciation: [ʃcip]) is an Indo-European language spoken by approximately 7.3 million people, primarily in Albania and Kosovo but also in other areas of the Balkans in which there is a majority Albanian population, including western Macedonia, southern Montenegro, southern Serbia and northwestern Greece. Sicily, and Ukraine. Additionally, speakers of Albanian can be found elsewhere throughout the latter two countries resulting from a modern diaspora, originating from the Balkans, that also includes Scandinavia, Switzerland, Germany, United Kingdom, Turkey, Australia, New Zealand, Netherlands, Singapore, Brazil, Canada, and the United States.
The Albanian language is a distinct Indo-European language that does not belong to any other existing branch; the other extant Indo-European isolate is Armenian. Sharing lexical isoglosses with Greek, Balto-Slavic, and Germanic, the vocabulary of Albanian is quite distinct. Once hastily grouped with Germanic and Balto-Slavic by the merger of PIE *ǒ and *ǎ into *ǎ in a supposed "northern group", Albanian has proven to be distinct from these two, as this vowel shift is only
The Komi language (in Komi: Коми кыв, transliteration: Komi kyv [komi kɨv]) is a Uralic language spoken by the Komi peoples in the northeastern European part of Russia. Komi may be considered a single language with several dialects, or a group of closely related languages, making up one of the two branches of the Permic branch of the family. The other Permic language is Udmurt, to which Komi is closely related.
Of the several Komi dialects or languages, two major varieties are recognized, closely related to one another: Komi-Zyrian, the largest group, serves as the literary basis within the Komi Republic; and Komi-Yazva, spoken by a small, isolated group of Komi to the north-west of Perm Krai and south of the Komi Republic. Permyak (also called Komi-Permyak) is spoken in Komi-Permyak Okrug, where it has literary status.
The Komi-Zyrian language (Коми кыв - Komi kyv) or simply Komi, Zyrian or Zyryan, is spoken by the Komi-Zyrians in Komi Republic and some other parts of Russia. It is disputed whether Zyrian is a separate language or a Komi dialect, because of its affinity to the Komi-Permyak language. In 1994, Komi-Zyrian had about 285,000 speakers.
It was written in the form of Old
Czech ( /ˈtʃɛk/; čeština Czech pronunciation: [ˈt͡ʃɛʃcɪna]) is a West Slavic language with about 12 million native speakers; it is the majority language in the Czech Republic and spoken by Czechs worldwide. The language was known as Bohemian in English until the late 19th century. Czech is similar to and mutually intelligible with Slovak and, to a lesser extent, with Polish and Sorbian.
Czech is widely spoken by most inhabitants of the Czech Republic. As given by appropriate laws, courts and authorities can enact and write out documents and judgements in the Czech language (also, financial authorities in the Slovak language). Czech can also be used in all official proceedings in Slovakia as granted by Article 6 of Slovak Minority Language Act 184/1999 Zb.
According to article 37, paragraph 4 of Charter of Fundamental Rights and Basic Freedoms people who do not speak Czech have the right to get an interpreter in a court of law. Instructions for use in Czech must be added to all marketed goods. The right to one's own language is guaranteed by the Constitution for all national and ethnic minorities.
Czech is also one of the 23 official languages in the European Union (since May
Malayalam (pronounced /mələˈjɑːləm/; മലയാളം, malayāḷam , IPA: [mɐləjaːɭəm]), is a language spoken in India predominantly in the state of Kerala. It is one of the 22 scheduled languages of India with official language status in the state of Kerala and the union territories of Lakshadweep and Pondicherry. It belongs to the Dravidian family of languages, and was spoken approximately by 33 million people according to the 2001 census. Malayalam is also spoken in the neighboring states of Tamil Nadu and Karnataka; the Nilgiris, Kanyakumari and Coimbatore districts of Tamil Nadu, and the Dakshina Kannada, Mangalore and Kodagu districts of Karnataka.
Malayalam most likely originated from Middle Tamil (Sen-Tamil-Malayalam) in the 6th century. An alternative theory proposes a split in even more ancient times. Malayalam incorporated many elements from Sanskrit through the ages and today over eighty percent of the vocabulary of Malayalam in scholarly usage is from Sanskrit. Before Malayalam came into being, Old Tamil was used in literature and courts of a region called Tamilakam, including present day Kerala state, a famous example being Silappatikaram. Silappatikaram was written by Chera
Berber or the Berber languages (Berber: Tamaziɣt or Tamazight, ⵜⴰⵎⴰⵣⵉⵖⵜ [tæmæˈzɪɣt], [θæmæˈzɪɣθ]) are a family of similar or closely related languages and dialects indigenous to North Africa. Berber is spoken by large populations of Morocco and Algeria, and by smaller populations in Libya, Tunisia, eastern Mali, western and northern Niger, northern Burkina Faso, Mauritania, and the Egyptian Siwa Oasis. Large Berber-speaking migrant communities have lived in Western Europe since the 1950s. In 2001, Berber became a national language of Algeria in the constitution, and in 2011, Berber became an official language of Morocco, in the new Moroccan constitution.
Berber constitutes a branch of the Afroasiatic language family and has been attested since ancient times. The number of ethnic Berbers (whether they speak Berber or not) is much higher than the current number of actual Berber speakers in North Africa. The bulk of the populations of the Maghreb countries are considered to have Berber ancestors. In Algeria, for example, a majority of the population consists of Arabized Berbers.
There is a movement among speakers of the closely related "Northern Berber" varieties to unite them into a
Mahl (މަހަލް) or Mahal, also known locally as Maliku Bas, is an Indo-Aryan language spoken by the people of Minicoy Island (Maliku), in the Union Territory of Lakshadweep, India. It is a variant of Dhivehi, the official language of Maldives.
The Lakhshadweep Administration refers to Divehi-bas as Mahl. The origin of the word "Mahl" is due to a misunderstanding on the part of a British civil servant who came to Minicoy in the Twentieth Century during the time of the Indian Empire. The official asked a local what his language was and he said "Divehi-bas". The Englishman looked confused as he had never heard of this language. Noticing this, the islander said "Mahaldeebu" as he knew that people on the subcontinent referred to the kingdom to the south (the Maldives) by that name. The local name was and is Divehiraajje (Kingdom of Islands) and the language is Divehi-bas (language of the islands). The English official recorded the language of Minicoy as Mahl.
Mahal has a continuous written history of about eight hundred years. The earliest writings were on the Lōmāfānu (copper-plate grants) of the 12th and 13th centuries. Early inscriptions on coral stone have also been found. The oldest
Serbo-Croatian or Serbo-Croat, less commonly Bosnian/Croatian/Montenegrin/Serbian (BCMS), is a South Slavic language and the primary language of Bosnia and Herzegovina, Croatia, Montenegro, and Serbia. Since it has four standard variants, it is a pluricentric language. Its variants do differ slightly, as is the case with other pluricentric languages (English, French, Spanish, German, Hindustani and Portuguese, among others), but not to a degree which would justify considering them as different languages. The differences between the variants do not undermine the integrity of the system as a whole and do not hinder mutual intelligibility. Compared to the differences between the variants of English, German, French, Spanish, Hindustani or Portuguese, the distinctions between the variants of Serbo-Croatian are less significant.
The language was standardised in the mid 19th century, many decades before the first Yugoslavia was established. From the very beginning, it has had a pluricentric standardisation. Croats and Serbs differ in religion and have historically lived under different empires, and have adopted slightly different literary forms as their respective standard variants. Since
Maltese (Malti) is the national language of Malta, and a co-official language of the country alongside English, while also serving as an official language of the European Union, the only Semitic language so distinguished. Maltese is descended from Siculo-Arabic (the Arabic dialect that developed in Sicily, and later in Malta, between the end of the ninth century and the end of the thirteenth century). About half of the vocabulary is borrowed from standard Italian and Sicilian; English words make up between 6% and 20% of the Maltese vocabulary, according to different estimates (see below). It is the only Semitic language written in the Latin script in its standard form.
Malta was occupied by the Fatimids, who exerted 220 years of linguistic influence, from 870 to 1090 CE.
The oldest reference to Maltese comes from the Benedictine monks of Catania, who were unable to open a monastery in Malta, in 1364, because they could not understand the native language. In 1436, in the will of a certain Pawlu Peregrino, Maltese is first identified as lingua maltensi. The oldest known document in Maltese is "Il Cantilena" (Maltese: Xidew il-Qada) a poem from the 15th century written by Pietro
Ukrainian (украї́нська мо́ва / ukrayins'ka mova, [ukrɑˈjɪɲsʲkɑ ˈmɔwɑ], formerly Ruthenian - ру́ська, руси́нська мо́ва / rus'ka, rusyns'ka mova) is a member of the East Slavic subgroup of the Slavic languages. It is the official state language of Ukraine and the principal language of the Ukrainians. Written Ukrainian uses a variant of the Cyrillic script.
The Ukrainian language traces its origins to the Old East Slavic of the early medieval state of Kievan Rus'. Ukrainian is a lineal descendant of the colloquial language used in Kievan Rus' (10th–13th century). From 1804 until the Russian Revolution Ukrainian was banned from schools in the Russian Empire of which Ukraine was a part at the time. It has always maintained a sufficient base in Western Ukraine where the language was never banned in its folklore songs, itinerant musicians, and prominent authors.
The standard Ukrainian language is regulated by the National Academy of Sciences of Ukraine (NANU), particularly by its Institute for the Ukrainian Language, Ukrainian language-informatical fund, and Potebnya Institute of Language Studies. Ukrainian, Russian, Belarusian, and Rusyn have a high degree of mutual intelligibility.
Lithuanian (lietuvių kalba) is the official state language of Lithuania and is recognized as one of the official languages of the European Union. There are about 2.9 million native Lithuanian speakers in Lithuania and about 200,000 abroad. Lithuanian is a Baltic language, closely related to Latvian, although they are not mutually intelligible. It is written in a Latin alphabet. The Lithuanian language is believed to be the most conservative living Indo-European language, retaining many features of Proto-Indo-European now lost in other Indo-European languages.
Lithuanian still retains many of the original features of the nominal morphology found in some ancient Indo-European languages like Sanskrit and Latin, and has therefore been the focus of much study in the area of Indo-European linguistics. Studies in the field of comparative linguistics have shown it to be the most conservative living Indo-European language.
Lithuanian and other Baltic languages passed through a Proto-Balto-Slavic stage, from which Baltic languages retain numerous exclusive and non-exclusive lexical, morphological, phonological and accentual isoglosses in common with the Slavic languages, which represent
Danish (dansk, pronounced [d̥anˀsɡ̊] ( listen); dansk sprog, [ˈd̥anˀsɡ̊ ˈsb̥ʁɔʊ̯ˀ]) is a North Germanic language spoken by around six million people, principally in the country of Denmark. It is also spoken by 50,000 Germans of Danish ethnicity in the northern parts of Schleswig-Holstein, Germany, where it holds minority language status. Danish is a mandatory subject in school in the Danish crown territories of the Faroe Islands (where it is also an official language after Faroese) and Greenland (where, however, the only official language since 2009 is Kalaallisut), as well as the former crown holding of Iceland. There are also Danish language communities in Argentina, the United States and Canada. Danish is mutually intelligible with Norwegian and Swedish (see "Classification").
Danish, together with Swedish, derives from the East Norse dialect group, while the old Norwegian dialects before the influence of Danish and Bokmål is classified as a West Norse language together with Faroese and Icelandic. A more recent classification based on mutual intelligibility separates modern spoken Danish, Norwegian and Swedish into a Mainland Scandinavian group while Icelandic and Faroese are
Tlapanec is an indigenous Mexican language spoken by more than 98,000 Tlapanec people in the state of Guerrero. Like other Oto-Manguean languages, it is tonal and has complex inflectional morphology. The Tlapanec themselves currently refer to their language using the adjective Me'phaa.
Before much information was known about it, Tlapanec (sometimes written "Tlappanec" in earlier publications) was either considered unclassified or linked to the controversial Hokan language family. It is now definitively considered part of the Oto-Manguean language family, of which it forms its own branch along with the extinct and very closely related Subtiaba language of Nicaragua.
Me'phaa people temporarily move to other locations, including Mexico City, Morelos and various locations in the United States, for reasons of work.
Ethnologue lists four principal varieties of Tlapanec:
Other sources of information, including native speakers and the Instituto Nacional de Lenguas Indígenas of the Mexican government, identify eight or nine varieties, which have been given official status: Acatepec, Azoyú, Malinaltepec, Nancintla, Teocuitlapa, Tlacoapa, Zapotitlán Tablas (with Huitzapula sometimes
Blissymbols or Blissymbolics was conceived as an ideographic writing system called Semantography consisting of several hundred basic symbols, each representing a concept, which can be composed together to generate new symbols that represent new concepts. Blissymbols differ from most of the world's major writing systems in that the characters do not correspond at all to the sounds of any spoken language.
Blissymbols were invented by Charles K. Bliss (1897–1985), born Karl Kasiel Blitz in the Austro-Hungarian city of Czernowitz (at present the Ukrainian city of Chernivtsi), which had a mixture of different nationalities that “hated each other, mainly because they spoke and thought in different languages.” Bliss graduated as a chemical engineer at the Vienna University of Technology, and joined an electronics company as a research chemist.
As the German Army invaded Austria in 1938, he was sent to the concentration camps of Dachau and Buchenwald. His German wife Claire managed to get him released, and they finally became exiles in Shanghai, where Bliss had a cousin.
Bliss devised Blissymbols while a refugee at the Shanghai Ghetto and Sydney, from 1942 to 1949. He wanted to create an
Spanish (español) is a Romance language that originated in Spain. It is also called Castilian (castellano listen (help·info)) after the particular region of Spain, Castile, where it originated. According to Ethnologue, there are approximately 387 million people speaking Spanish as a native language (mainly using population data from 1995 or before), making it the second most spoken language by number of native speakers after Mandarin. Sources using more current figures estimate up to 420 million native speakers, 460 million as a first and second language, and more than 500 million people including the Spanish speakers as a foreign language. Mexico contains the largest population of Spanish speakers. Spanish is one of the six official languages of the United Nations, and is used as an official language by the European Union and Mercosur.
Spanish is a part of the Ibero-Romance group that evolved from several dialects of spoken Latin in central-northern Iberia around the ninth century and gradually spread with the expansion of the Kingdom of Castile (present northern Spain) into central and southern Iberia during the later Middle Ages. Early in its history, the Spanish vocabulary was
Tulu (IPA: [ˈt̪uɭu]; Tulu: , ತುಳು ಬಾಸೆ or തുളു ബാസെ Tuḷu bāse, [ˈt̪uɭu ˈbɒːsæ] ) is a language spoken by 1.95 million native speakers (1997) mainly in the southwest part of Indian state of Karnataka and a small part of northern Kerala, which is known as Tulu Nadu. It belongs to the Dravidian family of languages.
In India, 1.72 million people speak it as their native language (2001), increased by 10 percent over the 1991 census. According to one estimate reported in 2009, Tulu is currently spoken by three to five million native speakers in the world. Native speakers of Tulu are referred to as Tuluva or Tulu people.
Separated early from Proto-South Dravidian, Tulu has several features not found in Tamil–Kannada. For example, it has the pluperfect and the future perfect, like French or Spanish, but formed without an auxiliary verb.
Robert Caldwell, in his pioneering work A Comparative Grammar of the Dravidian or South-Indian Family of Languages, called this language "peculiar and very interesting". According to him, "Tuḷu is one of the most highly developed languages of the Dravidian family. It looks as if it had been cultivated for its own sake." The language has a lot of written
Spanglish refers to the blend (at different degrees) of Spanish and English, in the speech of people who speak parts of two languages, or whose normal language is different from that of the country where they live. The Hispanic population of the United States and the British population in Argentina use varieties of Spanglish. Sometimes the pidgin spoken in Spanish holiday resorts which are exposed to both Spanish and English is called Spanglish. The similar code switching used in Gibraltar is called Llanito. Spanglish may also be known by a regional name. Spanglish does not have one unified dialect and therefore lacks uniformity; Spanglish spoken in New York, Miami, Texas, and California can be different. In Texas and California a large Mexican population can be found and within that population are Chicanos or second-generation Mexican-Americans. Some of the Spanglish words used by Chicanos could be incomprehensible to Hispanics from Florida.
Spanglish is not a pidgin language. It is totally informal; there are no set hard-and-fast rules. There are two phenomena of Spanglish, borrowing and code-switching. English borrowed words will usually be adapted to Spanish phonology.
Ahtna or Ahtena is the Na-Dené language of the Ahtna ethnic group of the Copper River area of Alaska. The language is also known as Copper River or Mednovskiy. There are 80 speakers out of a population of 500, and the language is facing extinction but many younger people are learning it to try to keep it from extinction. The Ya Ne Dah Ah School in Chickaloon, Alaska teaches the Ahtna language as a part of its curriculum.
The Ahtna language consists of four different dialects, three of the four are still spoken today. Ahtna closely related to Dena'ina.
The similar name "Atnah" occurs in the journals of Simon Fraser and other early European diarists in what is now British Columbia as a reference to the Tsilhqot'in people, another Northern Athapaskan group.
There are four main dialect divisions and eight bands (tribal unions):
The comparison of some animal names in the three Athabaskan languages:
Kazakh (also Qazaq and variants, natively Қазақ тілі, Qazaq tili, قازاق ٴتىلى; pronounced [qɑˈzɑq tɘˈlɘ]) is a Turkic language which belongs to the Kipchak (or Western Turkic) branch of the Turkic languages, closely related to Nogai and Karakalpak.
Kazakh is an agglutinative language, and it employs vowel harmony.
The Kazakh language has its speakers (mainly Kazakhs) spread over a vast territory from the Tian Shan mountains to the western shore of Caspian Sea. Kazakh is the official state language of Kazakhstan, in which nearly 10 million speakers are reported to live (based on the CIA World Factbook's estimates for population and percentage of Kazakh speakers) . More than a million speakers reside in China's Xinjiang Uyghur Autonomous Region. The 2002 Russian Census reported 560,000 Kazakh speakers in Russia. Other sizable populations of Kazakh speakers live in Mongolia (fewer than 200,000). Large numbers exist elsewhere in Central Asia (mostly in Uzbekistan) and other parts of former Soviet Union, and in Afghanistan, Iran, Turkey and other countries. There are also some Kazakh speakers in Germany who immigrated from Turkey in the 1970s.
Today, Kazakh is written in Cyrillic in
The Moabite language is an extinct Canaanite language, spoken in Moab (modern day central-western Jordan) in the early first millennium BC. It was written using a variant of the Phoenician alphabet.
Most of our knowledge about Moabite comes from the Mesha Stele, which is the only known extensive text in this language. In addition there are the three line El-Kerak Inscription and a few seals. The main features distinguishing Moabite from fellow Canaanite languages such as Hebrew are: a plural in -în rather than -îm (e.g. mlkn "kings" for Biblical Hebrew məlākîm), like Aramaic and Arabic; retention of the feminine ending -at which Biblical Hebrew reduces to -āh (e.g. qryt "town", Biblical Hebrew qiryāh) but retains in the construct state nominal form (e.g.qiryát yisrael "town of Israel"); and retention of a verb form with infixed -t-, also found in Arabic and Akkadian (w-’ltḥm "I began to fight", from the root lḥm.)
Pite Sami, also known as Arjeplog Sami, is a Sami language traditionally spoken in Sweden and Norway. It is a critically endangered language that has only about 25–50 native speakers left and is now only spoken on the Swedish side of the border along the Pite River in the north of Arjeplog and Arvidsjaur and in the mountainous areas of the Arjeplog municipality.
Pite Sámi has 9 cases:
Pite Sami verbs conjugate for three grammatical persons:
Pite Sami has five grammatical moods:
Pite Sami verbs conjugate for three grammatical numbers:
Pite Sami verbs conjugate for two simple tenses:
and two compound tenses:
Pite Sami, like Finnish, the other Sámi languages and Estonian, has a negative verb. In Pite Sámi, the negative verb conjugates according to mood (indicative, imperative and optative), person (1st, 2nd and 3rd) and number (singular, dual and plural). This differs from some other the other Sami languages, e.g., from Northern Sami, which do not conjugate according to tense and other Sami languages, that do not use the optative.
For non-past indicative versions that have more than one form, the second one is from the dialect spoken around Björkfjället and the third is from the
Sindarin is a fictional language devised by J. R. R. Tolkien, and used in his secondary world, often called Middle-earth.
Sindarin is one of the many languages spoken by the immortal Elves, called the Eledhrim [ˈɛlɛðrim] or Edhellim [ɛˈðɛllim] in Sindarin.
Called in English "Grey-elvish" or "Grey-elven", it was the language of the Sindarin Elves of Beleriand. These were Elves of the Third Clan who remained behind in Beleriand after the Great Journey. Their language became estranged from that of their kin who sailed over sea. Sindarin derives from an earlier form of language called Common Telerin which itself had evolved from Common Eldarin, the tongue of the Eldar before their divisions, e.g. those Elves who decided to follow the Vala Oromë and undertook the Great March to Valinor. Even before that the Eldar Elves spoke the original speech of all Elves, or Primitive Quendian.
In the Third Age (the setting of The Lord of the Rings), Sindarin was the language most commonly spoken by most Elves in the Western part of Middle-earth. Sindarin is the language usually referred to as the elf-tongue or elven-tongue in The Lord of the Rings.
When the Quenya-speaking Noldor returned to
Southern Sami (Åarjelsaemien gïele) is the southwestern-most of the Sami languages. It is a seriously endangered language; the last strongholds of this language are the municipalities of Snåsa and Hattfjelldal in Norway. There are approximately 2000 people considered ethnically Southern Sami in Norway and Sweden, but only approximately 600 of them can fluently speak the language.
Southern Sami is one of the six Sami languages that has an official written language, but only a few books have been published for the language, one of which is a good-size Southern Sami–Norwegian dictionary.
Southern Sami uses the Latin script: A/a, B/b, D/d, E/e, F/f, G/g, H/h, I/i, (Ï/ï), J/j, K/k, L/l, M/m, N/n, O/o, P/p, R/r, S/s, T/t, U/u, V/v, Y/y, Æ/æ, Ø/ø, Å/å
An alternative orthography replaces Æ/æ with Ä/ä and Ø/ø with Ö/ö. The variants Ä/ä, Ö/ö are used in Sweden, Æ/æ, Ø/ø in Norway, in accordance with the usage in Swedish and Norwegian, based on computer or type writer availability. The Ï/ï represents a back version of I/i, many texts do not distinguish between the two.
C/c, Q/q, W/w, X/x, Z/z are used in words of foreign origin.
Southern Sami has two dialects, the northern and the southern
Sardinian (Sardinian: sardu, limba / lingua sarda, Italian: sardo, lingua sarda) is a Romance language spoken and written on most of the island of Sardinia (Italy). It is considered the most conservative of the Romance languages in terms of phonology and is noted for its Paleosardinian substratum.
Since 1997 Sardinian has been an official language on the island, together with other languages spoken there. It is a co-official language, jointly with Italian.
The Sardinian language is one of the principal elements of Sardinian cultural heritage, and there is activity dedicated to studying the language and acknowledging its importance; the recognition of the Sardinian language as a prominent element of the cultural identity is diffusely supported by the population.
The Sardinian language has recently been recognised, together with other local languages, as an official regional language by the Sardinian Region; it can therefore be used for official purposes on the island.
In the last decade, the Sardinian language has been legally recognized (with Albanian, Catalan, German, Greek, Slovene, Croatian, French, Franco-Provençal, Friulian, Ladin, and Occitan) by the Law 482-1999, yet its
Arbëreshë, also known as Arbërisht, is an ethnolect spoken by the Arbëreshë, the group of Albanian-speaking minorities in Italy.
Arbëresh derives from the Tosk dialect spoken in southern Albania, and is spoken in Southern Italy in the regions of Calabria, Molise, Puglia, Basilicata, Campania, Abruzzi, and Sicily. All dialects are closely related to each other but are not entirely mutually intelligible.
The Arbëresh language retains many archaisms of medieval Albanian from the pre-Ottoman invasion of Albania in the 15th century. It also retains Greek language elements, including vocabulary and pronunciation. It has also preserved some conservative features that were lost in mainstream Albanian Tosk. For example, it has preserved certain syllable-initial consonant clusters which have been simplified in Standard Albanian (cf. Arbërisht gluhë /ˈɡluxə/ ('language/tongue'), vs. Standard Albanian gjuhë /ˈɟuhə/).
Arbërisht was commonly called 'Albanese' (Albanian in the Italian language) in Italy until the 1990s. Until recently, Arbërisht speakers had only very imprecise notions about how related or unrelated their language was to Albanian. Until the 1980s Arbërisht was exclusively a
The Bunjevac dialect (bunjevački govor) or Bunjevac language (bunjevački jezik) is a name for Štokavian-Western Ikavian dialect used by members of the Bunjevci community. The speakers live in parts of the autonomous province of Vojvodina in Serbia as well as in southern parts of Croatia. The accent is purely Ikavian, with /i/ for the Common Slavic vowel yat. Its speakers largely use the Latin alphabet, as is illustrated in their locally published newspaper.
In the 2002 census results published by the Statistical Office of Serbia, Bunjevac was not listed among main languages spoken in Serbia, but those that declared that their language is Bunjevac were listed in category "other languages". For example, in the municipality of Subotica, the number of those listed as speaking "other languages" (presumably Bunjevac) was 8,914.
The status of the Bunjevac dialect / language is vague, and it is considered a dialect of Croatian by some Croatian linguists. According to the 2002 census in Serbia, some members of the Bunjevac ethnic community declared that their native language is Croatian or Serbian. This doesn't mean that they don't use this specific dialect, merely that they don't consider
Modern Swedish (Swedish: nysvenska) is the linguistic term used for the Swedish language from the Bible translation of 1526 to the development of a common national language around 1880. The period can further be divided into Early Modern Swedish (1526–1750) and Late Modern Swedish (1750–1880).
Early Modern Swedish was established in 1526 with a complete Swedish translation of the Bible. The translation followed the spoken word rather closely, as opposed to the more Latin-inspired way of writing commonly used in the Middle Ages.
The Vasa Bible is considered to be a reasonable compromise between old and new; while not adhering to the spoken language of its day it was not overly conservative in its use of old forms. Though it was not completely consistent in spelling, particularly when it came to vowels, it was a major step towards a more consistent Swedish orthography. It established the use of the letters "ä" and "ö" in place of the older "æ" and "ø" and introduced the completely new "å" in place of "o" in many words. It also introduced conventions such as using ck instead of kk in words like tacka; "thank". The ongoing rivalry with Denmark can be argued to have some influence on
Wetarese (Atauru, Adabe) is the language of Wetar, an island in the south Maluku, Indonesia, as well as the nearby islands Liran and Atauro, the latter belonging to East Timor.
On Atauro, there are three dialects of Wetarese spoken, Rahesuk in the north, Resuk in the southeast, and Raklungu in the southwest. A fourth dialect, Dadu'a, is spoken by the descendants of a group of Ataurans who moved to Timor, in the villages of Manatuto.
The language is closely related to Galoli, spoken on Timor on the north coast near Atauro. However, Wetarese has been more influenced by Malay.
Elfdalian or Övdalian (Övdalsk or Övdalską in Elfdalian, Älvdalska or Älvdalsmål in Swedish) is a linguistic variety of the Scandinavian language branch spoken in the old parish of Övdaln, which is located in the south-eastern part of Älvdalen Municipality in Northern Dalarna, Sweden.
Traditionally regarded as a Swedish dialect, Elfdalian is today regarded by several linguists as a separate language. As some other Dalecarlian vernaculars spoken north of the Lake Siljan, Elfdalian retains numerous old grammatical and phonological features that have not changed considerably since Old Norse and is considered to be the most conservative and best preserved vernacular within the Dalecarlian branch. Having developed in relative isolation since the Middle Ages, quite a few linguistic innovations are also present in the language.
Elfdalian has around 3,000 speakers and its existence is severely threatened. However, it is possible that it will receive an official status as a minority language in Sweden, which would entail numerous protections and encourage its use in schools and by writers and artists. The Swedish Parliament was due to address this issue in 2007, but apparently has not yet
British Sign Language (BSL) is the sign language used in the United Kingdom (UK), and is the first or preferred language of some deaf people in the UK; there are 125,000 deaf adults in the UK who use BSL plus an estimated 20,000 children. The language makes use of space and involves movement of the hands, body, face and head. Many thousands of people who are not deaf also use BSL, as hearing relatives of deaf people, sign language interpreters or as a result of other contact with the British deaf community.
Records exist of a sign language existing within deaf communities in Britain as far back as 1570. British sign language has evolved, as all languages do, from these origins by modification, invention and importation. Thomas Braidwood, an Edinburgh teacher, founded 'Braidwood's Academy for the Deaf and Dumb' in 1760 which is recognised as the first school for the deaf in Britain. His pupils were the sons of the well-to-do. His early use of a form of sign language, the combined system, was the first codification of what was to become British Sign Language. Joseph Watson was trained as a teacher of the Deaf under Thomas Braidwood and he eventually left in 1792 to become the
Gujarati (Gujarati: ગુજરાતી Gujarātī) is an Indo-Aryan language, and part of the greater Indo-European language family. It is derived from a language called Old Gujarati (1100–1500 AD) which is the ancestor language of the modern Gujarati and Rajasthani languages. It is native to the Indian state of Gujarat, where it is the chief language, and to the adjacent union territories of Daman and Diu and Dadra and Nagar Haveli.
There are about 65.5 million speakers of Gujarati worldwide, making it the 26th most spoken native language in the world. Along with Romany and Sindhi, it is among the most western of Indo-Aryan languages. Gujarati was the first language of Mohandas Karamchand Gandhi, the "Father of the Nation of India", and Sardar Vallabhbhai Patel, the "Iron Man of India." Other prominent personalities whose first language is or was Gujarati include Swami Dayananda Saraswati, Morarji Desai, Narsinh Mehta, Dhirubhai Ambani, and J. R. D. Tata. & Muhammad Ali Jinnah the "Father of the Nation of Pakistan"
Gujarati (also having been variously spelled as Gujerati, Gujarathi, Guzratee, Guujaratee, Gujrathi, and Gujerathi) is a modern Indo-Aryan language evolved from Sanskrit. The
The Noongar ( /ˈnʊŋɑː/; alternatively spelt Nyungar, Nyoongar, Nyoongah, Nyungah, or Noonga) are an Indigenous Australian people who live in the south-west corner of Western Australia, from Geraldton on the west coast to Esperance on the south coast. Traditionally, they inhabited the region from Jurien Bay to the southern coast of Western Australia, and east to what is now Ravensthorpe and Southern Cross. The Noongar traditionally spoke dialects of the Noongar language, a member of the large Pama-Nyungan language family, but generally today speak Australian Aboriginal English, a dialect of the English language interspersed with Noongar words and grammar.
Prior to the arrival of Europeans, the Noongar population has been variously estimated at between 6,000 and some tens of thousands. Colonization by the British resulted in both violence and new diseases, taking a heavy toll on the population. Nowadays, however, according to the Noongar themselves, they number more than 28,000. The 2001 census figures (ABS) showed that 21,000 people identified themselves as indigenous in the south-west of Western Australia. In 2006, the community claimed to number over 28,000 people. Today, most of
Swiss German (German: Schweizerdeutsch, Alemannic German: Schwyzerdütsch, Schwiizertüütsch, Schwizertitsch) is any of the Alemannic dialects spoken in Switzerland and in some Alpine communities in Northern Italy. Occasionally, the Alemannic dialects spoken in other countries are grouped together with Swiss German, as well, especially the dialects of Liechtenstein and Austrian Vorarlberg, which are closely associated to Switzerland's.
Linguistically, Swiss German forms no unity. The linguistic division of Alemannic is rather into Low, High and Highest Alemannic, varieties of all of which are spoken both inside and outside of Switzerland. The reason "Swiss German" dialects constitute a special group is their almost unrestricted use as a spoken language in practically all situations of daily life, whereas the use of the Alemannic dialects in the other countries is restricted or even endangered.
The dialects of Swiss German must not be confused with Swiss Standard German, the variety of Standard German used in Switzerland.
Unlike most regional languages in modern Europe, Swiss German is the spoken everyday language of all social levels in industrial cities, as well as in the
Wakhi is an Indo-European language in the branch of Eastern Iranian language family and is intimately related to other Southeastern Iranian languages in the Pamir languages group.
Wakhi is one of several languages that belong to the Pamir language group. A reflection of this is the fact that the Wakhi people are occasionally called Pamiris. The origin of this language is Wakhan in Afghanistan and it is, according to sources, more than four thousand years old. It is spoken by the inhabitants the Wakhan Corridor of Afghanistan, parts of Gilgit–Baltistan (the former NAs) of Pakistan, Gorno-Badkhshan (mountainous-Badakhshan, in Russian) region of Tajakistan, and Xinjiang in western China. The Wakhi use the self-appellation ‘Xik’ (ethnic) and suffix it with ‘wor’/’war’ to denote their language as ‘Xik-wor’ themselves. The noun ‘Xik’ comes from ‘oxik’ (an inhabitant of ‘Ox’, for Wakhan, in Wakhi. There are other equivalents for the name Wakhi (Anglicised) or Wakhani (Arabic and Persian), Vakhantsy (Russian), Gojali/Gojo (Dingrik-wor/Shina), Guyits/Guicho (Borushaski), Wakhigi/Wakhik-war (Kivi-wor/Khow-wor) and Cert (Turki). The language belongs, as yet to be confirmed according to
Kildin Sami (also spelled Sámi or Saami; formerly Lappish) is a Sami language spoken by approximately 600 people on the Kola Peninsula in northwestern Russia. Kildin Sami is written using an official Cyrillic script.
Although about 600 people throughout the whole Kola Peninsula claim Kildin Sami as their native language, the area around Lovozero has the highest concentration of speakers. It is the largest of the Eastern Sami languages by virtue of the number of its speakers. Its future, however, appears to not be as bright as that of Skolt Sami or Inari Sami because the language is used actively by only very few people today. The Sami languages closest to Kildin are Ter Sami and Akkala Sami. The latter is sometimes considered to be a dialect of Kildin Sami.
Kildin Sami is written in an extended version of Cyrillic since the 1980s. The alphabet has three variants with some minor differences in certain letters, mostly in Ҋ vs. Ј and ’ (apostrophe) vs. Һ. Whereas the dictionary of Sammallahti/Xvorostuxina (1991) uses Ҋ and ’ (apostrophe), Kuruč at al. 1985 uses Ј and Һ. The third orthographic variant, used, e.g., by Kert (1986), has neither of these letters.
Note that the letters Ӓ,
Xhosa (/ˈkoʊsə/; Xhosa: isiXhosa [isikǁʰóːsa]) is one of the official languages of South Africa. Xhosa is spoken by approximately 7.9 million people, or about 18% of the South African population. Like most Bantu languages, Xhosa is a tonal language, that is, the same sequence of consonants and vowels can have different meanings when said with a rising or falling or high or low intonation. One of the most distinctive features of the language is the prominence of click consonants; the word "Xhosa" begins with a click.
Xhosa is written using a Latin alphabet. Three letters are used to indicate the basic clicks: c for dental clicks, x for lateral clicks, and q for post-alveolar clicks (for a more detailed explanation, see the table of consonant phonemes, below). Tones are not indicated in the written form.
Xhosa is the southernmost branch of the Nguni languages, which include Swati, Northern Ndebele and Zulu. There is some mutual intelligibility with the other Nguni languages, all of which share many linguistic features. Nguni languages are in turn part of the much larger group of Bantu languages, and as such Xhosa is related to languages spoken across much of Africa.
Xhosa is the most
Archi is a Northeast Caucasian language spoken by the 1,200 Archis in the village of Archib, southern Dagestan, Russia and the six surrounding smaller villages. It is used at home and the speakers have positive attitude toward it.
It is unusual for its many phonemes and for its contrast between several voiceless velar lateral fricatives and voiceless and ejective velar lateral affricates and a voiced velar lateral fricative. It is an ergative–absolutive language with four noun classes and has a remarkable morphological system with huge paradigms and irregularities on all levels. Mathematically, there are 1,502,839 possible forms that can be derived from a single verb root.
The classification of the Archi language has not been definitively established. Peter von Uslar felt it should be considered a variant of Avar, but Roderich von Erckert saw it is closer to Lak. The language has also been considered as a separate entity that could be placed somewhere between Avar and Lak. The Italian linguist Alfredo Trombetti placed Archi within the Avar-Ando-Dido group, but today the most widely recognized opinion follows that of the Soviet scholar Bokarev, who regards Archi as one of the
Celtiberian or Northeastern Hispano-Celtic is an extinct Indo-European language of the Celtic branch spoken by the Celtiberians in an area of the Iberian Peninsula lying between the headwaters of the Duero, Tajo, Júcar and Turia rivers and the Ebro river. This language is directly attested in nearly 200 inscriptions dated in the 2nd century BC and the 1st century BC, mainly in Celtiberian script, a direct adaptation of the northeastern Iberian script, but also in Latin alphabet. The longest extant Celtiberian inscriptions are those on three Botorrita plaques, bronze plaques from Botorrita near Zaragoza, dating to the early 1st century BC, labelled Botorrita I, III and IV (Botorrita II is in the Latin language).
Celtiberian was a Celtic language that shows the characteristic sound changes that define Celtic languages such as:
Enough has been preserved to show that the Celtiberian language could be called Q-Celtic (like Goidelic), and not P-Celtic like Gaulish. For some, this has served to confirm that the legendary invasion of Ireland by the Milesians, preserved in the Lebor Gabála Érenn, actually happened.
Since Brythonic is P-Celtic too, but as an Insular Celtic language more
Dari is a Northwestern Iranian ethnolect spoken as a first language by an estimated 8,000 to 15,000 Zoroastrians in and around the cities of Yazd and Kerman in central Iran. The ethnolect is often overlooked by linguists because the region is predominantly Muslim and because Dari is primarily spoken (rarely written).
Speakers of Dari may also be found among the Irani community of South Asia with major concentrations in Karachi, Pakistan and Bombay, India.
Dari is also known as Gabri (sometimes Gavrŭni or Gabrōni), or Behdināni. Dari has numerous dialects. It is incomprehensible to speakers of standard Persian.
Genealogically, Dari is a member of the Northwestern Iranian language subfamily, which includes several other closely related languages, for instance, Kurdish, Zazaki and Balochi. These Northwestern Iranian languages are a branch of the larger Western Iranian language group, which is in turn a subgroup of the Iranian language family.
Indigenous speakers of Dari prefer the name 'Dari', but among Iran's Muslims, the language is also known as 'Gabri'. This latter name can be considered offensive as it carries a pejorative connotation (cf. Gabr for details).
The Dari language has
Estonian (eesti keel; pronounced [ˈeːsti ˈkeːl] ( listen)) is the official language of Estonia, spoken natively by about 1.1 million people in Estonia and tens of thousands in various migrant communities. It belongs to the Finnic branch of the Uralic language family.
One distinctive feature that has caused a great amount of interest among linguists is what is traditionally seen as three degrees of phoneme length: short, long, and "overlong", such that /toto/, /toˑto/ and /toːto/ are distinct. In actuality, the distinction is not purely in the phoneme length, and the underlying phonological mechanism is still disputed.
The two different historical Estonian languages (sometimes considered dialects), the North and South Estonian languages, are based on the ancestors of modern Estonians' migration into the territory of Estonia in at least two different waves, both groups speaking considerably different Finnic vernaculars. Modern standard Estonian has evolved on the basis of the dialects of Northern Estonia.
The domination of Estonia after the Northern Crusades, from the 13th century to 1918 by Denmark, Germany, Sweden, and Russia delayed indigenous literacy in Estonia.
Ulster Scots or Ulster-Scots (Ulstèr-Scotch) generally refers to the dialects of the Scots language spoken in parts of Ulster in Ireland. Some definitions of Ulster Scots may also include Standard English spoken with an Ulster Scots accent. This is a situation like that of Lowland Scots and Scottish Standard English – where lexical items have been re-allocated to the phoneme classes that are nearest to the equivalent standard classes. Ulster Scots has been influenced by Hiberno-English, particularly Mid Ulster English, and by Ulster Irish. Ulster Scots has also influenced Mid Ulster English, which is the dialect of most people in Ulster. As a result of the competing influences of English and Scots, varieties of Ulster Scots can be described as 'more English' or 'more Scots'.
The Scots language was brought to Ulster during the early 17th century, when large numbers of Scots speakers arrived from Scotland during the Hamilton and Montgomery Settlements and the Ulster Plantation. The earliest Scots writing in Ulster dates from that time, and 20th century, written Scots from Ulster was almost identical with that of Scotland. However, since the revival of interest in the Ulster dialects
Bhojpuri ( pronunciation (help·info)) (Devnagri: भोजपूरी ; Nastaliq:بھوجپوري) is a predominant language spoken in parts of northern-eastern India. It is spoken in the western part of state of Bihar, the northwestern part of Jharkhand, and the Purvanchal region of Uttar Pradesh (UP), as well as adjoining parts of the Nepal Terai. Bhojpuri is also spoken widely in Guyana, Suriname, Fiji, and Mauritius. The variant of Bhojpuri of the Surinamese Hindustanis is also referred to as Sarnami Hindustani, Sarnami Hindi or just Sarnami and has experienced considerable Creole and Dutch lexical influence. More Indians in Suriname know Bhojpuri compared to Guyana and Trinidad where the language is largely forgotten. Bhojpuri language is also spoken in Karachi, Pakistan
Bhojpuri is part of the Eastern-Hindi languages which once extended from whole Bihar to Lower Doab of Purvanchal. While the rest of Bihar and Uttar Pradesh slowly adopted the new Hindi standard (Khari Boli), the language remained strong in the areas between Patna and Benaras. It is quite similar to Awadhi Language. Some scholar trace the literacy history of Bhojpuri from Siddha Sahitya itself, as early as 8th century A.D. Kabir’s
Portuguese ( português (help·info) or língua portuguesa) is a Romance language. It is the official language of Portugal, Brazil, Mozambique, Angola, Cape Verde, Guinea-Bissau, and São Tomé and Príncipe. Portuguese has co-official status (alongside the indigenous language) in Macau in East Asia, East Timor in South East Asia and in Equatorial Guinea in Central Africa; Portuguese speakers are also found in Goa, Daman and Diu in India.
With a total of 236 million speakers, Portuguese is the 6th most spoken language in the world, the 3rd most spoken language in the western hemisphere, and the most spoken language in the southern hemisphere.
Spanish author Miguel de Cervantes once called Portuguese "the sweet language" and Spanish playwright Lope de Vega referred to it as "sweet", while the Brazilian writer Olavo Bilac poetically described it as a última flor do Lácio, inculta e bela (the last flower of Latium, uneducated and beautiful). Portuguese is also termed "the language of Camões", after one of Portugal's greatest literary figures, Luís Vaz de Camões.
In March 2006, the Museum of the Portuguese Language, an interactive museum about the Portuguese language, was founded in São
The Lak language (лакку маз, lakːu maz) is a Northeast Caucasian language forming its own branch within this family. It is the language of the Lak people from the Russian autonomous republic of Dagestan, where it is one of six standardized languages. It is spoken by about 157,000 people.
In the past it was also referred to as Gazi-Qumuq (Гьази-Кьумукь), which was transliterated via Russian as Kazi-Kumukh (Кази-Кумук) or Kasi-Kumukh (Каси-Кумук), in which Gazi or Kasi means 'warrior for faith'. The standard language is based on the dialect of the city of Kumukh. This city should not be confused with the ethnic group, also present in the Caucasus. Other major dialects are the Arakul, Shadnin, Khosrekh, and Vitskhi (Wicxi, Vicxi) dialects.
Lak has throughout the centuries adopted a number of loanwords from Arabic, Turkish, Persian, and Russian. Ever since Dagestan was part of the USSR and later Russia, the largest portion of loanwords have come from Russian, especially political and technical vocabulary.
There is a newspaper and broadcasting station in Lak.
Until 1928, the Arabic script was used to write the language. Afterwards it was written with a Latin alphabet for ten years, and
Punjabi (Gurmukhi: ਪੰਜਾਬੀ Shahmukhi: پنجابی, Also Devanagari: पंजाबी) is an Indo-Aryan language spoken by inhabitants of the historical Punjab region (north western India and eastern Pakistan). In Pakistan, Punjabi is the most widely spoken native tongue. Punjabi can be subdivided into two varieties, Eastern Punjabi in both India and Pakistan, and Western Punjabi solely in Pakistan. There are some 104 million (2008) native speakers of the Punjabi language; an estimated 76 million in Pakistan (2008) and 28 million in India (2001), and millions in the UK, Canada and Persian Gulf countries, making it the 10th most widely spoken language in the world. Native speakers of the Punjabi language and their respective diaspora are referred to as ethnic Punjabis.
The Punjabi language has different dialects, spoken in the different sub-regions of greater Punjab. The Majhi dialect is Punjabi's prestige dialect and shared by both countries. This dialect is considered as textbook Punjabi and is spoken in the historical region of Majha, centralizing in Lahore and Amritsar.
Along with the Hindko and Western Pahari languages, Punjabi is unusual among modern Indo-European languages because it is a
Vlach (rumînjeašće/vlaški) is uncodified Eastern Romance language of the Vlachs of Serbia. It consists of two main dialects.
Serbian statistics list Vlach as minority language in Serbia and it's recognized by the Serbian Law. In the 2002 census, 40,054 people in Serbia declared themselves ethnic Vlachs and 54,818 people declared themselves native speakers of the Vlach language.
The Vlach language have official status, but it is not standardized.
Serbian Vlachs, as well as Aromanians are old Balkan nationalities that basically speak a Latin language or dialect. It varies from region to region because of geographical distance. They are the remnants of the interchange between the Greek and Latin worlds. They are Vlachs, Armani, Sarkatsani, but not Dacians which are ancestors of Romanians.
Its two main variants, Ungurjan and Caran, are close to some Romanian dialects with stronfg influence from Serbian. But to a well trained Vlach, even Vlahika in Greece is easily understood. They write out the spoken language in Greek, so if one knows the Greek alphabet one can easily decipher what is being said.
Their language was isolated from Romanian (actually, the base of the Romanian language
Esperanto (help·info) is the most widely spoken constructed international auxiliary language. Its name derives from Doktoro Esperanto ("Esperanto" translates as "one who hopes"), the pseudonym under which L. L. Zamenhof published the first book detailing Esperanto, the Unua Libro, on July 26, 1887. Zamenhof's goal was to create an easy-to-learn and politically neutral language that transcends nationality and would foster peace and international understanding between people with different regional and/or national languages.
Estimates of Esperanto speakers range from 10,000 to 2,000,000 active or fluent speakers, as well as perhaps a thousand native speakers, that is, people who learned Esperanto from birth as one of their native languages. Esperanto has a notable presence in over a hundred countries. Usage is highest in Europe, East Asia, and South America. The first World Congress of Esperanto was organized in France in 1905. Since then congresses have been held in various countries every year with the exception of years in which there were world wars. Although no country has adopted Esperanto officially, Esperanto was recommended by the French Academy of Sciences in 1921 and
Greek (ελληνικά IPA [eliniˈka] ellīnika or ελληνική γλώσσα IPA [eliniˈci ˈɣlosa]) ellīnikī glōssa is an independent branch of the Indo-European family of languages. Native to the southern Balkans Western Asia Minor and the Aegean, it has the longest documented history of any Indo-European language, spanning 34 centuries of written records. Its writing system has been the Greek alphabet for the majority of its history; other systems, such as Linear B and the Cypriot syllabary, were previously used. The alphabet arose from the Phoenician script, and was in turn the basis of the Latin, Cyrillic, Coptic, and many other writing systems.
The Greek language holds an important place in the histories of Europe, the more loosely defined Western world, and Christianity; the canon of ancient Greek literature includes works of monumental importance and influence for the future Western canon, such as the epic poems Iliad and Odyssey. Greek was also the language in which many of the foundational texts of Western philosophy, such as the Platonic dialogues and the works of Aristotle, were composed; the New Testament of the Christian Bible was written in Koiné Greek. Together with the Latin texts
Tosk is the southern dialect of the Albanian language. The line of demarcation between Tosk and Gheg (the northern dialect) is the Shkumbin River. Tosk is the basis of the standard Albanian language.
Tosk may also refer to the Tosk-speaking Albanian population of southern Albania, subgroups of which include the Myzeqars of Myzeqe, Labs of Labëria and Chams of Çamëria. The Arvanites of Greece and Arbëreshë of Italy are descendants of Tosk-speaking settlers, as are the original inhabitants of Mandritsa in Bulgaria.
Tosk, in its narrowest sense, may be applied to the people of Toskëria, the region to the north of the Vjosë river and south of the Shkumbin river, in the territory of Fier County. However, the name Toskëria itself is often used to name entire Tosk-speaking parts of Albania, in contrast to northern Gegëria.
Aramaic is a group of languages belonging to the Afroasiatic language phylum. The name of the language is based on the name of Aram, an ancient region in central Syria. Within this family, Aramaic belongs to the Semitic family, and more specifically, is a part of the Northwest Semitic subfamily, which also includes Canaanite languages such as Hebrew and Phoenician. Aramaic script was widely adopted for other languages and is ancestral to both the Arabic and modern Hebrew alphabets.
During its 3,000-year written history, Aramaic has served variously as a language of administration of empires and as a language of divine worship. It was the day-to-day language of Israel in the Second Temple period (539 BC – 70 AD), the language spoken by Jesus, the language of large sections of the biblical books of Daniel and Ezra, and is the main language of the Talmud. However, Jewish Aramaic was different from the other forms both in lettering and grammar. Parts of the Dead Sea Scrolls are in Jewish Aramaic showing the unique Jewish lettering, related to the unique Hebrew language.
Aramaic's long history and diverse and widespread use has led to the development of many divergent varieties which
Neapolitan (autonym: nnapulitano; Italian: napoletano), also known as Southern Italian and Neapolitan–Calabrian, is the language of southern continental Italy, including the city of Naples. It is named not after the city, but after the Kingdom of Naples, which once covered most of this area and of which Naples was the capital.
On October 14, 2008 a law by the Region of Campania stated that the Neapolitan language was to be protected. It has been recognized by UNESCO as a language and a heritage.
Neapolitan has had a significant influence on the intonation of Rioplatense Spanish, of the Buenos Aires region of Argentina.
The Neapolitan dialects are distributed throughout most of continental southern Italy, historically united during the Kingdom of Naples and the Kingdom of the Two Sicilies, specifically southern Lazio (Gaeta and Sora districts), southern Marche, Abruzzo, Molise, Basilicata, Campania (Naples), northern and central Apulia, and northernmost Calabria. The dialects are part of a varied dialect continuum, so the varieties in southern Lazio, Marche, Abruzzo, Molise, Apulia, Lucania and Calabria can typically be recognizable as regional groups of dialects. In eastern Abruzzo
Occitan (English pronunciation: /ˈɒksɪˌtæn/; French pronunciation: [ɔk.si'tɑ̃]; Occitan: [utsiˈta]), known also as Lenga d'òc by its native speakers (Occitan: [ˈleŋɡɔ ˈðɔ(k)]; French: Langue d'oc), is a Romance language spoken in southern France, Italy's Occitan Valleys, Monaco, and Spain's Val d'Aran: the regions sometimes known unofficially as Occitania. It is also spoken in the linguistic enclave of Guardia Piemontese (Calabria, Italy). Occitan is a descendant of the spoken Latin language of the Roman Empire, as are languages such as Italian, Portuguese, Spanish, Romanian and Sardinian. It is an official language in Catalonia, (known as Aranese in Val d'Aran). Occitan's closest relative is Catalan. Since September 2010, the Parliament of Catalonia has considered Aranese Occitan to be the officially preferred language for use in the Val d'Aran.
The term Provençal (Occitan: provençal, provençau or prouvençau, IPA: [pruβenˈsal, pʀuveⁿˈsaw]) may be used as a traditional synonym for Occitan but, nowadays, “Provençal” is mainly understood as an Occitan dialect spoken in Provence.
The long-term survival of Occitan is in question. According to the UNESCO Red Book of Endangered
The Rumsen language (also known as Rumsien, San Carlos Costanoan and Carmeleno) is one of eight Ohlone languages, historically spoken by the Rumsen people of Northern California. The Rumsen language was spoken from the Pajaro River to Point Sur, and on the lower courses of the Pajaro, as well as on the Salinas and Carmel Rivers, and the region of the present-day cities of Salinas, Monterey and Carmel.
One of eight languages within the Ohlone branch of the Utian family, it became one of two important native languages spoken at the Mission San Carlos Borroméo de Carmelo founded in 1770, the other being the Esselen language.
The last fluent speaker of Rumsen (and probably the last fluent speaker of any Costanoan language) was Isabel Meadows, who died in 1939. The Bureau of American Ethnology linguist John Peabody Harrington conducted very extensive fieldwork with Meadows in the last several years of her life. These notes, still mostly unpublished, now constitute the foundation for current linguistic research and revitalization efforts on the Rumsen language.
Dialects of the Rumsen language were spoken by four independent local tribes, including the Rumsen themselves, the Ensen of the
Friulan ( furlan (help·info) or affectionately marilenghe in Friulan, friulano in Italian, furlanisch in German, furlanščina in Slovene; also Eastern Ladin), is a Romance language belonging to the Rhaeto-Romance family, spoken in the Friuli region of northeastern Italy. Friulan has around 800,000 speakers, the vast majority of whom also speak Italian. It is sometimes called Eastern Ladin, since it shares the same roots as Ladin, although over the centuries it has diverged under the influence of surrounding languages, including German, Italian, Venetian, and Slovene. Documents in Friulan are attested from the 11th century, and poetry and literature dating as far back as 1300. By the 20th century, there was a revival of interest in the language, which has continued to this day.
A question which causes many debates is the influence of the Latin spoken in Aquileia and surrounding areas. Some claim that it had peculiar features that later passed into Friulan. Epigraphs and inscriptions from that period show some variants if compared to the standard Latin language, but most of these are common to other areas of the Roman Empire; often it is cited that Fortunatianus, bishop of Aquileia
Galatian is an extinct Celtic language once spoken in Galatia in Asia Minor (modern Turkey) from the 3rd century BC up to at least the 4th century AD, although ancient sources suggest it was still spoken in the 6th century.
Of the language only a few glosses and brief comments in classical writers and scattered names on inscriptions survive. Altogether they add up to about 120 words, mostly personal names ending in -riks (cf. Gaulish -rix/-reix, Old Irish ri, Gothic language -reiks, Old Frankish rik, Latin rex) "king", some ending in -marus, dative -mari (cf. Gaulish -maros, Old Irish mor, Welsh mawr) "great", and tribal names like Ambitouti (Old Irish imm- "around", Old Irish tuath "tribe"), and a lexical item drunemeton "place of assembly" (cf. Old Irish drui "druid", Old Irish neimed "holy place"). Galatian is a Continental Celtic language contemporary and closely related to the Gaulish language. St. Jerome wrote in a comment to "the epistle of St Paul to the Galatians" that the Galatians spoke the same language as the Treveri (Trier).
Kanuri is a dialect continuum spoken by some four million people, as of 1987, in Nigeria, Niger, Chad and Cameroon, as well as small minorities in southern Libya and by a diaspora in Sudan. It belongs to the Western Saharan subphylum of Nilo-Saharan. Kanuri is the language associated with the Kanem and Bornu empires which dominated the Lake Chad region for a thousand years.
The basic word order of Kanuri sentences is subject–object–verb. It is typologically unusual in simultaneously having postpositions and post-nominal modifiers – for example, "Bintu's pot" would be expressed as nje Bintu-be, "pot Bintu-of".
Kanuri has three tones: high, low, and falling. It has an extensive system of consonant weakening (for example, sa- "they" + -buna "have eaten" > za-wuna "they have eaten".
Traditionally a local lingua franca, its usage has declined in recent decades. Most first-language speakers speak Hausa or Arabic as a second language.
Kanuri is spoken mainly in lowlands of the Lake Chad basin, with speakers in Cameroon, Chad, Niger, Nigeria and Sudan.
Ethnologue divides Kanuri into the following languages, while many linguists (e.g. Cyffer 1998) regard them as dialects of a single
Galician /ɡəˈlɪʃən/ (galego IPA: [ɡaˈleɣo]) is a language of the Western Ibero-Romance branch. It is mainly spoken in Galicia, an autonomous community located in northwestern Spain, where it is co-official with Spanish language. The language is also spoken in some border zones of the neighbouring Spanish regions of Asturias and Castile and León, as well as Galician immigrant communities in Latin America.
Galician is closest in relation to the Portuguese language, the two splitting from the former Galician-Portuguese language after Portugal became an independent kingdom. Due to the Celtic identity of the Galician people, the language features many words of Celtic and Proto-Celtic origin.
It is regulated in Spain by the Royal Galician Academy, yet independent organisations such as the Galician Association of Language and the Galician Academy of the Portuguese Language see it as a dialect of Galician-Portuguese.
Modern Galician and its southern sibling, Portuguese, originated from a common mediaeval ancestor called variously by modern linguists as Galician-Portuguese or Mediaeval Galician or Old Galician or Old Portuguese. This common ancestral stage developed in the territories of
Hindi, or more precisely Modern Standard Hindi and also known as Manak Hindi, High Hindi, Nagari Hindi, and Literary Hindi, is a standardised and sanskritised register of the Hindi-Urdu language. It is the native language of "a relatively small number of speakers" in Delhi, Haryana, Western Uttar Pradesh, northeastern Madhya Pradesh, and parts of eastern Rajasthan, and is one of the official languages of the Republic of India.
Colloquial Hindi is mutually intelligible with another register of Hindi-Urdu called (Modern Standard) Urdu. Mutual intelligibility decreases in literary and specialized contexts which rely on educated vocabulary. Speakers of both Hindi and Urdu frequently assert that they are distinct languages, despite the fact that native speakers generally cannot tell the colloquial languages apart. The number of native speakers of Standard Hindi is unclear. According to the 2001 Indian census, 258 million people in India reported their native language to be "Hindi". However, this includes large numbers of speakers of Hindi languages other than Standard Hindi; as of 2009, the best figure Ethnologue could find for Khariboli dialect (the basis of Hindustani) was a 1991
British English (or BrEn, BrE, BE, en-UK or en-GB) is the broad term used to distinguish the forms of the English language used in the United Kingdom from forms used elsewhere. The Oxford English Dictionary applies the term to English "as spoken or written in the British Isles; esp[ecially] the forms of English usual in Great Britain", reserving "Hiberno-English" for the "English language as spoken and written in Ireland". Nevertheless, Hiberno-English forms part of the broad British English continuum. Others, such as the Cambridge Academic Content Dictionary, define it as the "English language as it spoken and written in England.".
There are slight regional variations in formal written English in the United Kingdom. For example, although the words wee and little are interchangeable in some contexts, wee (as an adjective) is almost exclusively written by some people from some parts of northern Britain (and especially Scotland) or from Northern Ireland, whereas in Southern England and Wales, little is used predominantly. Nevertheless, there is a meaningful degree of uniformity in written English within the United Kingdom, and this could be described by the term British English. The
Chaldean Neo-Aramaic is a Northeastern Neo-Aramaic dialect. Chaldean Neo-Aramaic is spoken on the plain of Mosul in northern Iraq, as well as by the Chaldean communities worldwide. Most speakers are Chaldean Catholics.
Chaldean Neo-Aramaic is one of a number of modern Northeastern Aramaic languages spoken in the region of Kurdistan, between Lake Urmia in Iranian Azerbaijan in northern Iraq near Dohuk and near the Turkish border. Jews and Christians speak different dialects of Aramaic that are often mutually unintelligible. The Christian dialects have been heavily influenced by Classical Syriac, the literary language of Syriac Christianity in antiquity. Therefore Christian Neo-Aramaic has a dual heritage: literary Syriac and colloquial Eastern Aramaic. The Christian dialects are often called Soureth, or Syriac. In Iraqi Arabic. Chaldean Neo-Aramaic is spoken mostly in the mountainous regions in far north Iraq.
Before the schism of 1552, most Christians in this region were members of the Church of the East. When schism split the church, most of the Christians of the region opted for communion with the Roman Catholic Church and became members of the Chaldean Catholic Church.
The Hawaiian language (Hawaiian: ʻŌlelo Hawaiʻi) is a Polynesian language that takes its name from Hawaiʻi, the largest island in the tropical North Pacific archipelago where it developed. Hawaiian, along with English, is an official language of the state of Hawaii. King Kamehameha III established the first Hawaiian-language constitution in 1839 and 1840.
For various reasons, including Territorial legislation establishing English as the official language in schools, the number of native speakers of Hawaiian gradually decreased during the period from the 1830s to the 1950s. Hawaiian was essentially displaced by English on six of the seven inhabited islands. As of 2001, native speakers of Hawaiian amounted to under 0.1% of the statewide population. Linguists are worried about the fate of this and other endangered languages.
Nevertheless, from about 1949 to the present, there has been a gradual increase in attention to, and promotion of, the language. Public Hawaiian-language immersion preschools called Pūnana Leo were started in 1984; other immersion schools followed soon after. The first students to start in immersion preschool have now graduated from college and many are fluent
Interlingua ( /ɪntərˈlɪŋɡwə/; ISO 639 language codes ia, ina) is an international auxiliary language (IAL), developed between 1937 and 1951 by the International Auxiliary Language Association (IALA). It ranks among the top three most widely used IALs (after Esperanto and perhaps Ido), and is the most widely used naturalistic IAL: in other words, its vocabulary, grammar and other characteristics are largely derived from natural languages. Interlingua was developed to combine a simple, mostly regular grammar with a vocabulary common to the widest possible range of languages, making it unusually easy to learn, at least for those whose native languages were sources of Interlingua's vocabulary and grammar. Conversely, it is used as a rapid introduction to many natural languages. Interlingua literature maintains that (written) Interlingua is comprehensible to the hundreds of millions of people who speak a Romance language, though it is actively spoken by only a few hundred.
The name Interlingua comes from the Latin words inter, meaning between, and lingua, meaning tongue or language. These morphemes are identical in Interlingua. Thus, Interlingua would be "between language", or
Nobiin, or Mahas, is a Northern Nubian language of the Nilo-Saharan phylum. ‘Nobiin’ is the genitive form of Nòòbíí ‘Nubian" and literally means ‘(language) of the Nubians". Another term used is Noban tamen, meaning ‘the Nubian language’.
Nubian peoples immigrated into the Nile Valley from the southwest, where other Nubian languages are still spoken, at least 2,500 years ago, and Old Nubian, the language of the Nubian kingdoms, is considered ancestral to Nobiin. Nobiin is a tonal language with contrastive vowel and consonant length. The basic word order is subject–object–verb.
Nobiin is currently spoken along the banks of the Nile river in southern Egypt and northern Sudan by approximately 495,000 Nubians. Present-day Nobiin speakers are almost universally bilingual in local varieties of Arabic—Egyptian and Sudanese. Many Nobiin-speaking Nubians were forced to relocate in 1963–1964 to make room for the construction of the Aswan High Dam at Aswan, Egypt and for the upstream Lake Nasser.
There is no standardized orthography for Nobiin. It has been written in both Latinized and Arabic scripts; also, recently there have been efforts to revive the Old Nubian alphabet. This article
Inari Sámi (anarâškielâ) is a Uralic, Sami language spoken by the Inari Sami of Finland. It has approximately 300 speakers, the majority of whom are middle-aged or older and live in the municipality of Inari. According to the Sami Parliament of Finland 269 persons used Inari Sami as their first language. It is the only Sami language that is spoken exclusively in Finland. The language is classified as being seriously endangered as few children learn the language.
The first book in Inari Sámi was Anar sämi kiela aapis kirje ja doctor Martti Lutherus Ucca katkismus, which was written and translated by Edvard Wilhelm Borg in 1859. The written history of modern Inari Sámi, however, is said to begin with Lauri Arvid Itkonen's translation of the history of the Bible in 1906, although he had already translated some other books into Inari Sámi before that (Martin Luther and John Charles Ryles). After that, Inari Sami was mainly published in books written by linguists, Frans Äimä and Erkki Itkonen, in particular. For many years, very little literature was written in Inari Sami, although Sämitigge has funded and published a lot of books, etc., in recent years.
Since 1992, Finland's Sami have
Ume Sami is a Sami language spoken in Sweden and (formerly) in Norway. It is a moribund language with only about 10 native speakers left which used to be spoken mainly along the Ume River in the south of present-day Arjeplog, in Sorsele and Arvidsjaur.
Unlike its southern neighbor Southern Sami, Ume Sami has consonant gradation. However, gradation is more limited than it is in the more northern Sami languages, because it occurs only after long vowels or diphthongs. Consonants following a short vowel do not participate in gradation.
The verbs in Ume Sami have three persons:
There are three grammatical numbers: singular, dual and plural.
Ume Sami has two grammatical moods:
Ume Sami, like Finnish, the other Sámi languages and Estonian, has a negative verb. In Ume Sami, the negative verb conjugates according to mood (indicative and imperative), person (1st, 2nd and 3rd) and number (singular, dual and plural).
Ume Sami is one of the four Sami languages that do not have an official written language, although it was the first Sami language to be written extensively. The New Testament was published in Ume Sami in 1755 and the first Bible in Sami was also published in Ume Sami, in 1811.
Sicilian (lu sicilianu, Italian: lingua siciliana, also known as Siculu or Calabro-Sicilian) is a Romance language. It is spoken on the island of Sicily and its satellite islands; in southern and central Calabria (where it is called Southern Calabro); in the southern parts of Apulia, the Salento (where it is known as Salentino); and Campania, on the Italian peninsula, where it is called Cilentano (Gordon, 2005). Ethnologue (see below for more detail) describes Sicilian as being "distinct enough from Standard Italian to be considered a separate language" (Gordon). Some assert that Sicilian represents the oldest Romance language derived from Vulgar Latin (Privitera, 2004), but this is not a widely-held view amongst linguists, and is sometimes strongly criticized (2004, p. 151).
Sicilian is currently spoken by the majority of the inhabitants of Sicily and by emigrant populations around the world. The latter are found in the countries which attracted large numbers of Sicilian immigrants during the course of the past century or so, especially the United States, Canada, Australia and Argentina. In the past two or three decades, large numbers of Sicilians were also attracted to the
Macedonian (македонски јазик, makedonski jazik, pronounced [maˈkɛdɔnski ˈjazik] ( listen)) is a South Slavic language spoken as a first language by approximately 2–3 million people principally in the region of Macedonia but also in the Macedonian diaspora. It is the official language of the Republic of Macedonia and holds the status of official minority language in parts of Albania, Romania and Serbia.
Standard Macedonian was implemented as the official language of the Socialist Republic of Macedonia in 1945 and has since developed a thriving literary tradition. Most of the codification was formalized during the same period.
Macedonian dialects form a continuum with Bulgarian dialects; together in turn they form a broader continuum with Serbo-Croatian through the transitional Torlakian dialects. The name of the Macedonian language is a matter of political controversy in Greece as is its distinctiveness in Bulgaria.
The Macedonian language belongs to the eastern group of the South Slavic branch of Slavic languages in the Indo-European language family, together with Bulgarian. The modern Macedonian language is unrelated to the Ancient Macedonian language. Macedonian's closest
Montenegrin (Crnogorski jezik, Црногорски језик) is an incipient standardized register of the Serbo-Croatian language as spoken by Montenegrins used as the official language of Montenegro. The same subdialect of Shtokavian is also the basis of standard Bosnian, Croatian and Serbian, so all are mutually intelligible and are a single language by that criterion despite being distinct national standards.
The idea of a Montenegrin standard language separate from Serbian appeared in 1990s and gained traction in 2000s via proponents of Montenegrin independence. Montenegrin became the official language of Montenegro with the ratification of a new constitution on 22 October 2007. The Montenegrin standard is still emerging. Its orthography was established 10 July 2009 with the addition of two letters to the alphabet, though grammar and a school curriculum are yet to be approved.
In January 2008, the government of Montenegro formed the Council for the Codification of the Montenegrin Language, which aims to standardize the Montenegrin language according to international norms. Proceeding documents will, after verification, become a part of the educational programme in Montenegrin schools.
Basque (endonym: Euskara, IPA: [eus̺ˈkaɾa]) is the ancestral language of the Basque people, who inhabit the Basque Country, a region spanning an area in northeastern Spain and southwestern France. It is spoken by 27% of Basques in all territories (714,136 out of 2,648,998). Of these, 663,035 live in the Spanish part of the Basque country and the remaining 51,100 live in the French part.
In academic discussions of the distribution of Basque in Spain and France, it is customary to refer to three ancient provinces in France and four Spanish provinces. Native speakers are concentrated in a contiguous area including parts of the Spanish Autonomous Communities of the Basque Country and Navarre and in the western half of the French Département of Pyrénées-Atlantiques. The Basque Autonomous Community is an administrative entity within the binational ethnographic Basque Country incorporating the traditional Spanish provinces of Biscay, Gipuzkoa, and Álava, which retain their existence as politico-administrative divisions.
These provinces and many areas of Navarre are heavily populated by ethnic Basques, but the Euskara language had, at least until the 1990s, all but disappeared from most of
Fataluku (also known as Dagaga, Dagoda', Dagada) is a Papuan language spoken by approximately 30,000 people of Fataluku ethnicity in the eastern areas of East Timor, especially around Lospalos and a dialect of it, Oirata, is spoken in Kisar, Moluccas in Indonesia. It is a Papuan language, and is usually considered a Trans–New Guinea language. It is given the status of a national language under the constitution.
The letter 'c' and the letter combination 'tx' are pronounced as 'ch'.
Hó, Rau. Yes.
Upe, Kapare. No.
Lulue. Thank you.
Tali lulue. Thank you very much.
Helupai, Aka natxuni You're welcome.
Ant ivi nere. Excuse me.
Ó lai'i. Hello.
Nita tana fale. Goodbye. Ia toto. " take care"
Ihani koice. See you later.
Naunop irauni. Good morning.
Vacu hici /Meucia irauni. Good afternoon.
Mua koun irauni. Good evening.
Muna koun irauni. Good night.
Eceremu. think or guest.
Lemara. go home.
Old French (franceis, françois, romanz; Modern French ancien français) was the Romance dialect continuum spoken in territories that span roughly the northern half of modern France and parts of modern Belgium and Switzerland from the 9th century to the 14th century. It was then known as the langue d'oïl (oïl language) to distinguish it from the langue d'oc (Occitan language, also then called Provençal), whose territory bordered that of Old French to the south. The Norman dialect was also spread to England, Ireland, the Kingdom of Sicily and the Principality of Antioch in the Levant.
Gaulish, maybe the only survivor of the continental Celtic languages in Roman times, slowly became extinct during the long centuries of Roman dominion. Only several dozen words (perhaps 200, if we add Gaulish etymology) survive in modern French, for example chêne, ‘oak tree’ and charrue ‘plough'; Delamarre (2003, pp. 389–90) lists 167.
Despite attempts to explain some phonetic changes by the action of a Gaulish substratum, only one of them is sure, because this fact is clearly attested in the Gaulish language epigraphy, e.g. : on the potteries of la Graufesenque (1st c. AD), there is the Latin word (from
Persian (locally known as فارسی fārsi, دری darī, and тоҷикӣ تاجیکی tojikī) is an Iranian language within the Indo-Iranian branch of the Indo-European languages. It is primarily spoken in Iran, Afghanistan, Tajikistan, and countries which historically came under Persian influence. The Persian language is classified as a continuation of Middle Persian, the official religious and literary language of Sassanid Persia, itself a continuation of Old Persian, the language of the Persian Empire in the Achaemenid era. Persian is a pluricentric language and its grammar is similar to that of many contemporary European languages.
Persian has ca. 110 million native speakers, holding official status respectively in Iran, Afghanistan and Tajikistan. For centuries Persian has also been a prestigious cultural language in Central Asia, South Asia, and Western Asia.
Persian has had a considerable, mainly lexical influence on neighboring languages, particularly the Turkic languages in Central Asia, Caucasus, and Anatolia, neighboring Iranian languages, as well as Armenian, and Indo-Aryan languages, especially Urdu. It also exerted some influence on Arabic, particularly Iraqi Arabic, while borrowing
Northern or North Sami (Sámegiella or Davvisámegiella, formerly Davvisámi or Davvisaami; disapproved exonym Lappish or Lapp) is the most widely spoken of all Sami languages. The speaking area of Northern Sami covers the northern parts of Norway, Sweden and Finland. The number of Northern Sami speakers is estimated to be somewhere between 15,000 and 25,000. About 2000 of these live in Finland and between 5000 and 6000 in Sweden.
Northern Sami was first described by Knud Leem (En lappisk Grammatica efter den Dialect, som bruges af Field-Lapperne udi Porsanger-Fiorden) in 1748 and in dictionaries in 1752 and 1768. One of Leem's fellow grammaticians was Anders Porsanger, who studied at the Trondheim Cathedral School and other schools, but who was unable to publish his work on Sami due to racist attitudes at the time. Unfortunately, the majority of his work has disappeared.
The roots of the current orthography for Northern Sami were laid by Rasmus Rask who, after discussions with Nils Vibe Stockfleth, published Ræsonneret lappisk sproglære efter den sprogart, som bruges af fjældlapperne i Porsangerfjorden i Finmarken. En omarbejdelse af Prof. Knud Leems Lappiske grammatica in 1832. Rask
Acadian French (French: Français acadien), is a regionalized dialect of Canadian French. It is spoken by the francophone population of the Canadian province of New Brunswick, by small minorities in areas in the Gaspé region of eastern Quebec, by small groups of francophones in Nova Scotia and Prince Edward Island, in the Magdalen Islands in the Gulf of St. Lawrence, and a small swath of the northernmost portion (St. John's Valley) of the U.S state of Maine. The remaining majority of predominately-francophone Quebec speak Quebec French.
Since there was no linguistic contact with France from the late eighteenth century until the twentieth century, Acadian French retained features that died out during the French standardization efforts of the nineteenth century. That can be seen in examples like:
Many aspects of Acadian French (vocabulary, alveolar "r", etc.) are still common in rural areas in the West of France. Speakers of Metropolitan French and even of other Canadian dialects sometimes have minor difficulties understanding Acadian French.
See also Chiac, a variety with strong English influence, and Saint Mary's Bay French, a distinct variety of Acadian French spoken around Clare
Aragonese ( /ˌærəɡɒˈniːz/; aragonés: [aɾaɡoˈnes] in Aragonese) is a Romance language spoken by between 10,000 and 30,000 people throughout the valleys of the Pyrenees in Aragon, Spain, mainly in the comarcas of Somontano de Barbastro, Jacetania, Alto Gállego, Sobrarbe, and Ribagorza. It is the only modern language that developed from medieval Navarro-Aragonese.
While informally known as fabla ("talk" or "speech"), Aragonese is also commonly referred to by the names of its numerous local dialects which arose from fragmentation the language underwent over the centuries.
Aragonese originated in the early Middle Ages, as one of many Latin dialects developed in the Pyrenees on top of a strong Basque-like substratum. The original Kingdom of Aragon (formed by the counties of Aragon, Sobrarbe and Ribagorza) was progressively expanded from the mountain ranges towards the South, pushing the Moors farther south in the Reconquista and spreading the Aragonese language.
The dynastic union of the Catalan Counties and the Kingdom of Aragon—which formed the Aragonese Crown in the twelfth century—did not result in a merging of the language forms of the two territories; Catalan continued to be spoken
Italian ( italiano (help·info) or lingua italiana) is a Romance language spoken mainly in Europe: Italy, Switzerland, San Marino, Vatican City, by minorities in Malta, Monaco, Croatia, Slovenia, France, Libya, Eritrea, and Somalia, and by immigrant communities in the Americas and Australia. Many speakers are native bilinguals of both standardised Italian and other regional languages.
According to the Bologna statistics of the European Union, Italian is spoken as a mother tongue by 65 million people in the EU (13% of the EU population), mainly in Italy, and as a second language by 14 million (3%). Including the Italian speakers in non-EU European countries (such as Switzerland and Albania) and on other continents, the total number of speakers is more than 85 million.
In Switzerland, Italian is one of the four official languages; it is studied and learned in all the confederation schools and spoken, as mother tongue, in the Swiss cantons of Ticino and Grigioni and by the Italian immigrants that are present in large numbers in German- and French-speaking cantons. It is also the official language of San Marino, as well as the primary language of Vatican City. It is co-official in
Udmurt (удмурт кыл, udmurt kyl) is an Uralic language, part of the Permic subgroup, spoken by the Udmurt natives of the Russian constituent republic of Udmurtia, where it is coofficial with Russian. It is written in the Cyrillic script with five additional characters (Ӝ/ӝ, Ӟ/ӟ, Ӥ/ӥ, Ӧ/ӧ, and Ӵ/ӵ). Together with Komi and Komi-Permyak languages, it constitutes the Permic grouping. Among outsiders, it has traditionally been referred to by its Russian exonym, Votyak. Udmurt has borrowed vocabulary from the neighboring languages Tatar and Russian.
Ethnologue estimates 550,000 native speakers (77%) in an ethnic population of 750,000 in the former USSR (1989 census).
The Udmurt alphabet is based on the Russian Cyrillic alphabet:
The language does not distinguish between long and short vowels and does not have vowel harmony.
The consonants /f x t͡s/ are restricted to loanwords, and are traditionally replaced by /p k t͡ɕ/ respectively.
Udmurt is an agglutinating language. It uses affixes to express possession, to specify mode, time, and so on.
Depending on the style, about 10 to 30 percent of the Udmurt lexicon consists of loanwords. Many loanwords are from the Tatar
Tetum (also Tetun) is an Austronesian language spoken on the island of Timor. It is spoken in Belu Regency in Indonesian West Timor, and across the border in East Timor, where it is one of the two official languages. In East Timor a creolized form, Tetun Dili, is widely spoken fluently as a second language; without previous contact, Tetum and Tetun Dili are unintelligible. Besides the grammatical simplification involved in creolization, Tetun Dili has been greatly influenced by the vocabulary of Portuguese, the other official language of East Timor.
Tetum has four dialects:
Tetun-Belu and Tetun-Terik are not spoken or well understood outside their home territories. Tetun-Prasa is the form of Tetum that is spoken throughout East Timor. Although Portuguese was the official language of Portuguese Timor until 1975, Tetun-Prasa has always been the predominant lingua franca in the eastern part of the island.
In the fifteenth century, before the arrival of the Portuguese, Tetum had spread through central and eastern Timor as a contact language under the aegis of the Belunese-speaking Kingdom of Wehali, at that time the most powerful kingdom in the island. The Portuguese (present in Timor
Torlakian or Torlak (Serbo-Croatian pronunciation: [tɔ̌rlaːk]) is a name given to the group of South Slavic dialects of southeastern Serbia (southern Kosovo – Prizren), northeastern Republic of Macedonia (Kumanovo, Kratovo and Kriva Palanka dialects), western Bulgaria (Belogradchik–Godech–Tran-Breznik), which is intermediate between Serbo-Croatian, Bulgarian and Macedonian.
Some linguists classify it as an Old Shtokavian dialect or as a fourth dialect of Serbo-Croatian along with Shtokavian, Chakavian, and Kajkavian. Others classify it as a western Bulgarian dialect, in which case it is called Transitional. Torlakian is not standardized, and its subdialects significantly vary in some features.
Speakers of the dialectal group are primarily ethnic Serbs, Bulgarians, and Macedonians. There are also smaller ethnic communities of Croats (the Krashovani) in Romania and Slavic Muslims (the Gorani) in southern Kosovo.
The Torlakian dialects are intermediate between Bulgarian, Macedonian and Serbo-Croatian, and have been variously described, in whole or in parts, as belonging to each of these three neighbouring languages. In the nineteenth century, their classification was hotly contested
Hawaii Pidgin English, Hawaii Creole English, HCE, or simply Pidgin, is a creole language based in part on English used by many residents of Hawaii. Although English and Hawaiian are the co-official languages of the state of Hawaii, Pidgin is used by many Hawaiian residents in everyday conversation and is often used in advertising toward Hawaiians. The ISO 639-3 language code for Hawaii Pidgin (Hawaii Creole English) is hwc. In the Hawaiian language, "Hawaiian Creole English" is called "ʻōlelo paʻi ʻai", which literally means "pounding-taro language".
Pidgin (or Hawaii Creole) originated as a form of communication used between English speaking residents and non-English speaking immigrants in Hawaii. It supplanted the pidgin Hawaiian used on the plantations and elsewhere in Hawaii. It has been influenced by many languages, including Portuguese, Hawaiian, and Cantonese. As people of other language backgrounds were brought in to work on the plantations, such as Japanese, Filipinos, and Koreans, Pidgin acquired words from these languages. Japanese loanwords in Hawaii lists some of those words originally from Japanese. It has also been influenced to a lesser degree by Spanish spoken by
Kalaw Lagaw Ya (Kala Lagaw Ya), or the Western Torres Strait Language (several other names, see below), is the language indigenous to all the central and western Torres Strait Islands, Queensland, Australia. On some islands it has now largely been replaced by Brokan (Torres Strait Creole English).
Before colonisation in the 1870s–1880s, it was the major lingua franca of the area in both Australia and Papua, and is still widely spoken by neighbouring Papuans and by some Aboriginal people. However many second (or nth) language speakers it has is unknown. It also has a "cut-it-short" (simplified/foreigner) form, as well as a pidginised form. The simplified form is fairly prevalent on Badu [Badhu] (Kala Lagaw Ya territory) and neighbouring Moa, particularly among younger people.
The language is known by several names besides Kalaw Lagaw Ya, most of which (including Kalaw Lagaw Ya), are strictly speaking names of dialects, spelling variants, dialect variants, and the like - and include translations of the English terms, Western Island Language and Central Island Language. The following list includes most of the commonest:
In literature on the language the abbreviations KLY (Kalaw Lagaw
Modern Standard Arabic (MSA; Arabic: اللغة العربية الفصحى al-luġatu l-ʿarabiyyatu l-fuṣḥā "the most eloquent Arabic language"), Standard Arabic, or Literary Arabic is the standard and literary variety of Arabic used in writing and in most formal speech.
Most western scholars distinguish two standard (al-)fuṣḥā (الفصحى) varieties of the Arabic language: the Classical Arabic (CA) (اللغة العربية التراثية) of the Qur'an and early Islamic (7th to 9th centuries) literature, and Modern Standard Arabic (MSA) (اللغة العربية المعيارية الحديثة), the standard language in use today. The modern standard language is based on the Classical language. Most Arabs consider the two varieties to be two registers of one language, although the two registers can be described in Arabic as فصحى العصر fuṣḥā l-ʿaṣr (MSA) and فصحى التراث fuṣḥā t-turāṯ (CA).
Classical Arabic, also known as Qur'anic Arabic, is the language used in the Qur'an as well as in numerous literary texts from Umayyad and Abbasid times (7th to 9th centuries).
Classical Arabic is often considered to be the parent language of all the spoken varieties of Arabic, but recent scholarship, such as Clive Holes' (2004), shades this view, showing
Nahuatl (Nahuatl pronunciation: [ˈnaːwatɬ] ( listen), with stress on the first syllable) is a group of related languages and dialects of the Nahuan branch of the Uto-Aztecan language family. Altogether they are spoken by an estimated 1.5 million Nahua people, most of whom live in Central Mexico. All Nahuan languages are indigenous to Mesoamerica.
Nahuatl has been spoken in Central Mexico since at least the 7th century AD. It was the language of the Aztecs, who dominated what is now central Mexico during the Late Postclassic period of Mesoamerican history. During the centuries preceding the Spanish conquest of Mexico, the Aztec Empire had expanded to incorporate most of Mexico, and its influence caused the variety of Nahuatl spoken by the residents of Tenochtitlan becoming a prestige language in Mesoamerica. At the conquest, with the introduction of the Latin alphabet, Nahuatl also became a literary language and many chronicles, grammars, works of poetry, administrative documents and codices were written in the 16th and 17th centuries. This early literary language based on the Tenochtitlan variety has been labeled Classical Nahuatl and is among the most studied and best documented
Pāli (also Pāḷi) is a Middle Indo-Aryan language (of Prakrit group) of the Indian subcontinent. It is best known as the language of many of the earliest extant Buddhist scriptures, as collected in the Pāḷi Canon or Tipitaka, and as the liturgical language of Theravada Buddhism.
The word Pāli itself signifies "line" or "(canonical) text". This name for the language seems to have its origins in commentarial traditions, wherein the Pāli (in the sense of the line of original text quoted) was distinguished from the commentary or vernacular translation that followed it in the manuscript. As such, the name of the language has caused some debate among scholars of all ages; the spelling of the name also varies, being found with both long "ā" [ɑː] and short "a" [a], and also with either a retroflex [ɭ] or non-retroflex [l] "l" sound. Both the long ā and retroflex ḷ are seen in the ISO 15919/ALA-LC rendering, Pāḷi; however, to this day there is no single, standard spelling of the term, and all four possible spellings can be found in textbooks. R. C. Childers translates the word as "series" and states that the language "bears the epithet in consequence of the perfection of its grammatical
Lule Sami (julevsámegiella) is a Uralic, Sami language spoken in Lule Lappmark, i.e., around the Lule River, Sweden and in the northern parts of Nordland county in Norway, especially Tysfjord municipality, where Lule Sami is an official language. It is written in the Latin script, having an official alphabet.
With 1,500 to 2,000 speakers it is the second largest of all Sami languages. It is reported that the number of native speakers is in sharp decline among the younger generations. The language has, however, been standardised in 1983 and elaborately cultivated ever since.
Lule Sámi has seven cases:
Like the other Uralic languages, the nominative singular is unmarked and indicates the subject of a predicate. The nominative plural is also unmarked and is always formally the same as the genitive singular.
The genitive singular is unmarked and looks the same as the nominative plural. The genitive plural is marked by an -j. The genitive is used:
The accusative is the direct object case and it is marked with -v in the singular. In the plural, its marker is -t, which is preceded by the plural marker -j.
The inessive marker is -n in the singular and the plural, when it is then preceded
Shughni is one of the Pamir languages of the Southeastern Iranian language group. Its distribution is in the Gorno-Badakhshan Autonomous Province in Tajikistan and Badakhshan Province in Afghanistan.
Shughni tends towards SOV word order, distinguishes a masculine and a feminine gender in nouns and some adjectives as well as the 3rd person singular of verbs. Shughni distinguishes between an absolutive and an oblique case in its system of pronouns. The Rushani dialect is noted for a typologically unusual 'double-oblique' construction, also called a 'transitive case', in the past tense.
Rushani, Bartangi, Oroshor (Roshorvi), Khufi, and Shughni proper are considered to be dialects. However, Bartangi and Khufi are quite distinct, and may be separate languages.
Yiddish (ייִדיש yidish or אידיש idish, literally "Jewish") is a High German language of Ashkenazi Jewish origin, spoken in many parts of the world. It developed as a fusion of Hebrew and Aramaic into German dialects with the infusion of Slavic and traces of Romance languages. It is written in the Hebrew alphabet.
The language originated in the Ashkenazi culture that developed from about the 10th century in the Rhineland and then spread to Central and Eastern Europe and eventually to other continents. In the earliest surviving references to it, the language is called לשון־אַשכּנז (loshn-ashknez = "language of Ashkenaz") and טײַטש (taytsh, a variant of tiutsch, the contemporary name for the language otherwise spoken in the region of origin, now called Middle High German). In common usage, the language is called מאַמע־לשון (mame-loshn, literally "mother tongue"), distinguishing it from Biblical Hebrew and Aramaic, which are collectively termed לשון־קודש (loshn-koydesh, "holy tongue"). The term "Yiddish" did not become the most frequently used designation in the literature of the language until the 18th century.
For a significant portion of its history, Yiddish was the primary spoken
Enga (also Caga, Tsaga, Tchaga) is a language of the East New Guinea Highlands that is spoken by approximately 180,000 people in Enga Province, Papua New Guinea. It has the largest body of speakers of any native language in New Guinea.
Irish (Gaeilge), also known as Irish Gaelic, is a Goidelic language of the Indo-European language family, originating in Ireland and historically spoken by the Irish people. Irish is now spoken as a first language by a minority of Irish people, as well as being a second language of a larger proportion of the population. Irish enjoys constitutional status as the national and first official language of the Republic of Ireland. It is an official language of the European Union and an officially recognised minority language in Northern Ireland.
Irish was the predominant language of the Irish people for most of their recorded history, and they brought their Gaelic speech with them to other countries, notably Scotland and the Isle of Man, where it gave rise to Scottish Gaelic and Manx. It has the oldest vernacular literature in Western Europe. In the Elizabethan era the Gaelic language was viewed as something barbarian and as a threat to all things English in Ireland. There was then enacted a systematic campaign to destroy all things Irish, including the Irish language. Irish forms of dress were banned and no form of Irish names was recognised (something which still exists to this day in
Knaanic (also called Canaanic, Leshon Knaan, Judaeo-Czech, or Judaeo-Slavic) is an extinct West Slavic Jewish language, formerly spoken in the lands of the Western Slavs, notably the Czech lands, but also the lands of modern Poland, Lusatia and other Sorbian regions. It became extinct in the Late Middle Ages.
The name comes from the land of Knaan, a geo-ethnological term denoting the Jewish populations living east of the Elbe river (as opposed to the Ashkenazi Jews living to the West of it, or the Sephardi Jews of Iberian Peninsula). As such, the land is often simply translated as Slavonia, or Slavic Europe.
The term is derived from ancient Canaan (Hebrew כנען "kəna'an"). The term Canaan was used by Jews in Europe for the Slavic peoples, as a punning reference to the so-called "curse of Canaan" (Genesis 9:25), that Canaan shall "be a slave".
The language became extinct some time in the Middle Ages, possibly due to expansion of the Ashkenazi culture and their own Yiddish language based on German. This hypothesis is often backed up with a large number of Yiddish loanwords of Slavic origin, many of which were no longer in use in Slavic languages themselves at the time of the Ashkenazi
Latin (/ˈlætɪn/; Latin: lingua latīna; IPA: [ˈlɪŋɡʷa laˈtiːna]) is an Italic language originally spoken in Latium and Ancient Rome. Along with most European languages, it is a descendant of the ancient Proto-Indo-European language. It originated in the Italian peninsula. Although it is considered a dead language, many students, scholars, and members of the Christian clergy speak it fluently, and it is still taught in some primary and secondary and many post-secondary educational institutions around the world. Recently, there has been some revival of interest in American schools. Latin is still used in the creation of new words in modern languages of many different families, including English, and in biological taxonomy. Latin and its daughter Romance languages are the only surviving languages of the Italic language family. Other languages of the Italic branch are attested in the inscriptions of early Italy, but were assimilated to Latin during the Roman Republic.
The extensive use of elements from vernacular speech by the earliest authors and inscriptions of the Roman Republic make it clear that the original, unwritten language of the Roman Monarchy was an only partially deducible
Romanesco or Romanesque is a regional language or sociolect subsumed within the Italian language spoken in Rome. It is part of the Central Italian dialects and is thus genetically closer to the Tuscan dialect and Standard Italian.
There exist a few notable grammatical and idiomatic differences from Standard Italian. Rich in expressions and sayings, Romanesco is used for informal communication by most natives of Rome, often in a mix with Italian.
As shown by several medieval manuscripts, the medieval Roman dialect was more similar to southern dialect, such as those spoken in Naples. In the 16th century, it received a strong influence of the Tuscan dialect (from which modern Italian derives) after the immigration of people from that region in the wake of the Sack of Rome (1527). Therefore current Romanesco has grammar and roots rather different from other dialects in the Latium region; further, usually Romanesco is fully understandable for other Italian speakers.
Romanesco also influenced the dialect of the area of modern Latina, which was reclaimed in the early 1920 and mostly populated by immigrants from northern Italy: it became the local dialect as it was spoken by the small but
Vietnamese (tiếng Việt) is the national and official language of Vietnam. It is the mother tongue of Vietnamese people (Kinh), and of about three million overseas Vietnamese. It also is spoken as a first or second language by many ethnic minorities of Vietnam. It is part of the Austro-Asiatic language family of which it has the most speakers by a significant margin (several times larger than the other Austro-Asiatic languages put together). Much of Vietnamese vocabulary has been borrowed from Chinese, and it was formerly written using the Chinese writing system, albeit in a modified format and was given vernacular pronunciation. As a byproduct of French colonial rule, the language displays some influence from French, and the Vietnamese alphabet (quốc ngữ) in use today is a Latin alphabet with additional diacritics for tones and certain vowels and consonants.
As the national language of the major ethnic group, Vietnamese is spoken throughout Vietnam by the Vietnamese people, as well as by ethnic minorities. It also is spoken in overseas Vietnamese communities, most notably in the United States, where it has more than one million speakers and is the seventh most-spoken language (it
Äynu (or Aini, Ejnu, Abdal) is a Turkic language spoken in western China known in various spelling as Aini, Aynu, Ainu, Eyni or by the Uyghur Abdal (ئابدال), in Russian sources Эйну́, Айну, Абдал, by the Chinese as Ainu. Some linguists call it a mixed language, having a mostly Turkic grammar, essentially Uyghur, but a mainly Iranian vocabulary. Other linguists argue that it does not meet the technical requirements of a mixed language. It is spoken by the Äynu, a nomadic people. The Äynu people call their language Äynú (ئەينۇ) [ɛjˈnu].
Äynu is spoken in Western China in the Xinjiang Uyghur Autonomous Region on the edge of the Taklimakan Desert in the Tarim Basin.
The only speakers of Äynu are adult men. Uyghur is spoken with outsiders and with women, who do not speak Äynu. Uyghur is spoken at home when it is not necessary to disguise one's speech.
Äynu numerals are borrowed from Persian:
1 yäk, 2 du, 3 si, 4 čar, 5 pänǰ, 6 šäš, 7 häp(t), 8 häš(t), 9 noh, 10 dah, 20 bist, 100 säd, 1000 hazar
The Armenian language (հայերէն in TAO or հայերեն in RAO, Armenian pronunciation: [hɑjɛˈɾɛn]—hayeren) is an Indo-European language spoken by the Armenian people. It is the official language of the Republic of Armenia as well as in the region of Nagorno-Karabakh. The language is also widely spoken by Armenian communities in the Armenian diaspora. It has its own script, the Armenian alphabet, and is of interest to linguists for its distinctive phonological developments within Indo-European.
Linguists classify Armenian as an independent branch of the Indo-European language family. Armenian shares a number of major innovations with Greek, and some linguists group these two languages together with Phrygian and the Indo-Iranian family into a higher-level subgroup of Indo-European which is defined by such shared innovations as the augment. More recently, others have proposed a Balkan grouping including Greek, Armenian, Phrygian and Albanian.
Armenian has a long literary history, with a fifth-century Bible translation as its oldest surviving text. Its vocabulary has been heavily influenced by Western Middle Iranian languages, particularly Parthian, and to a lesser extent by Greek, Latin,
Bulgarian (български език, pronounced: [ˈbɤ̞ɫɡɐrski ɛˈzik]) is an Indo-European language, a member of the Southern branch of the Slavic language family.
Bulgarian, along with the closely related Macedonian language (collectively forming the East South Slavic languages), has several characteristics that set it apart from all other Slavic languages: changes include the elimination of case declension, the development of a suffixed definite article (see Balkan language area) and the lack of a verb infinitive; but it retains and has further developed the Proto-Slavic verb system. Various evidential verb forms exist to express unwitnessed, retold, and doubtful action. Estimates of the number of people around the world who speak Bulgarian fluently range from about 9 million to 12 million.
The development of the Bulgarian language may be divided into several periods.
Bulgarian was the first "Slavic" language attested in writing. As Slavic linguistic unity lasted into late antiquity, in the oldest manuscripts this language was initially referred to as языкъ словяньскъ, "the Slavic language". In the Middle Bulgarian period this name was gradually replaced by the name языкъ блъгарьскъ, the
Hakka is one of the major Chinese subdivisions or varieties and is spoken natively by the Hakka people in southern China and the island of Taiwan and throughout the diaspora areas of East Asia, Southeast Asia and around the world.
Due to its original usage in scattered isolated regions where communication is limited to the local area, the Hakka language has developed numerous variants or dialects, spoken in Guangdong, Fujian, Jiangxi, Guangxi, Sichuan, Hunan, and Guizhou provinces, including Hainan island, Singapore and Taiwan. Hakka is not mutually intelligible with Mandarin, Wu, Minnan, or other branches of Chinese. It is most closely related to Gan, and is sometimes classified as a variety of Gan.
Taiwan, where Hakka language is the mother tongue of a significant minority of the island's residents, is an important world center for study of the language. Pronunciation differences exist between the Taiwanese Hakka dialect and China's Guangdong Hakka dialect, and even in Taiwan two local varieties of Hakka exist within that dialect.
The Moi-yen/Moi-yan (梅縣, Pinyin: Méixiàn) dialect of northeast Guangdong in China has been taken as the "standard" dialect by the Chinese government.
Ido ( /ˈiːdoʊ/) is a language created to be a universal second language for speakers of diverse backgrounds. Ido was specifically designed to be grammatically, orthographically, and lexicographically regular, and above all easy to learn and use. In this sense, Ido is classified as a constructed international auxiliary language.
Ido was created in 1907 out of a desire to reform perceived flaws in Esperanto, a language that had been created for the same purpose 20 years earlier. The name of the language traces its origin to the Esperanto word ido, meaning "offspring", since the language is a "descendant" of Esperanto. After its inception, Ido gained support from some in the Esperanto community, but following the sudden death in 1914 of one of its most influential proponents, Louis Couturat, it declined in popularity. There were two reasons for this: first, the emergence of further schisms arising from competing reform projects; and second, a general lack of awareness of Ido as a candidate for an international language. These obstacles weakened the movement and it was not until the rise of the Internet that it began to regain momentum.
Ido uses the same 26 letters as the English
Kurdish (Kurdish: Kurdî or کوردی) is a dialect continuum spoken by the Kurds in western Asia. It is part of the Iranian branch of the Indo-Iranian group of Indo-European languages.
The Kurdish language has about 35 million potential speakers today. According to the CIA World Factbook, 10% of the total population of Iran speaks Kurdish. The CIA factbook estimates the number to be between 25–30 million.
Kurdish is not a unified standard language but a discursive construct of languages spoken by ethnic Kurds, referring to a group of speech varieties that are not necessarily mutually intelligible unless there has been considerable prior contact between their speakers. The second official language of Iraq, referred to only as 'Kurdish' in political documents, is in fact an academic and standardized version of the Sorani dialect of a branch of languages spoken by Kurds.
The written literary output in Kurdic languages was confined mostly to poetry until the early 20th century, when a general written literature began to be developed. In its written form today "Kurdish" has two regional standards, namely Kurmanji in the northern parts of the geographical region of Kurdistan, and Sorani
Nepal Bhasa (नेपाल भाषा, Nēpāl bhāṣā, also known as Newāh Bhāy) is one of the major languages of Nepal. It was Nepal's administrative and day-to-day language from the 14th to the late 18th centuries. Nepal Bhasa is spoken today as a mother tongue by the Newars, the indigenous inhabitants of Nepal Mandala, which consists of the Kathmandu Valley and surrounding regions.
Outside Nepal, Nepal Bhasa is also spoken in India, particularly in Sikkim where it is one of the 11 official languages.
Nepal Bhasa is classified as a Sino-Tibetan language, but it has been greatly influenced by Indo-Aryan languages.
The earliest occurrences of the name Nepal Bhasa can be found in the manuscripts Narad Sanhita, dated 1380 AD, and Amarkosh, dated 1389 AD. Since then, the name has been used widely on inscriptions, manuscripts, documents and books.
In the 1920s, the name of the language known as Khaskura, Gorkhali or Parbatiya was changed to Nepali, and Nepal Bhasa began to be officially referred to as Newari while the Newars continued using the original term. Similarly, the term Gorkhali in the former national anthem entitled "Shreeman Gambhir" was changed to Nepali in 1951.
On 8 September 1995,
Skolt Sami (sääˊmǩiõll) is a Uralic, Sami language spoken by approximately 400 speakers in Finland, mainly in Sevettijärvi, and approximately 20–30 speakers of the Njuõˊttjäuˊrr (Notozero) dialect in an area surrounding Lake Lovozero in Russia. Skolt Sami used to also be spoken on the Neiden area of Norway, although it has died out there. It is written using a Roman orthography that was made official in 1973.
Skolt Sami was spoken in four villages on Finnish territory prior to the Second World War. In Petsamo, Skolt Sami was spoken in Suonikylä and the village of Petsamo. This area was ceded to Russia in the Second World War, and the Skolts were evacuated to the villages of Inari, Sevettijärvi and Nellim in the Inari municipality.
Skolt Sami is spoken by approximately 400 individuals, nearly all of whom live in Finland; very few speakers remain today on the Russian side. On the Finnish side of the border, the language is recognized by the government as one of the official Sami languages used in Lapland and can thus be used by anyone conducting official business in that area. It is an official language in the municipality of Inari, and elementary schools there offer courses in the
Ter Sami is the easternmost of the Sami languages. It was traditionally spoken in the northeastern part of the Kola Peninsula, but now it is a moribund language; in 2004, only ten speakers were left. By 2010, the number of speakers had decreased to two.
In the end of the 19th century, there were six Ter Sami villages in the eastern part of the Kola Peninsula, with a total population of approximately 450. In 2004, there were approximately 100 ethnic Ter Sami of whom two elderly persons speak the language; the rest have shifted their language to Russian.
The rapid decline in the number of speakers was caused by Soviet collectivisation, during which use of the language was prohibited in schools and homes in the 1930s, and the largest Ter Sami village, Jokanga, was declared "perspectiveless" and its inhabitants were forced to move to the Gremikha military base.
There are no educational materials or facilities in Ter Sami, and the language has no standardized orthography. The language is incompletely studied and documented; text specimens, audio recordings as well as dictionaries for linguistic purposes exist, but no grammatical description is available.
The earliest known documentation
Slovene or Slovenian (slovenski jezik or slovenščina, not to be confused with slovenčina, the native name of Slovak) belongs to the group of South Slavic languages. It is spoken by approximately 2.5 million speakers worldwide, the majority of whom live in Slovenia. It is the first language of about 1.85 million people and is one of the 23 official and working languages of the European Union.
Standard Slovene is the national standard language that was formed in the 18th century, mostly based on Upper and Lower Carniolan dialect groups, the latter being a dialect spoken by Primož Trubar. Since Prekmurje dialect has been omitted from the formation of the standard that was finalized in the 19th and 20th centuries, its speakers still feel disconnected from it and use the dialect more widely than in other regions. In some regions of the Slovene Lands, where the compulsory schooling was in German and Italian, i.e. in the Austrian state of Carinthia and in case of the Slovene minority in Italy, the dialects are more preserved. For example, Resian and Torre (Ter) dialects in the Italian Province of Udine differ most from other Slovene dialects.
The distinctive characteristics of Slovene are
Chicomuceltec (also Chikomuselteko or Chicomucelteco; archaically, Cotoque) is a Mayan language formerly spoken in the region defined by the municipios of Chicomuselo, Mazapa de Madero, and Amatenango de la Frontera in Chiapas, Mexico, as well as some nearby areas of Guatemala. By the 1970s–80s it had become extinct, with recent reports in Mayanist literature finding that there are no living native speakers. Communities of contemporary Chicomucelteco descendants, numbering approximately 1500 persons in Mexico and 100 in Guatemala are Spanish-speakers.
Chicomuceltec was formerly sometimes called Cakchiquel Mam, although it is only distantly related to the Cakchiquel or Mam, being much closer to Wastek (Huastec).
The Chicomuceltec language was first documented in modern linguistic literature as a distinct language in the late 19th century, where it appeared in an account published by linguist Karl Sapper of his travels in northern Mesoamerica 1888–95.
Chicomuceltec's relationship with Wastek was established in the late 1930s (Kroeber 1939), which concluded via word-list comparisons with other Mayan languages that it bore a higher degree of affinity with Wastek than other Mayan
Classical Arabic (CA), also known as Quranic Arabic, is the form of the Arabic language used in literary texts from Umayyad and Abbasid times (7th to 9th centuries). It is based on the Medieval dialects of Arab tribes. Modern Standard Arabic (MSA) is the direct descendant used today throughout the Arab World in writing and in formal speaking, for example, prepared speeches, some radio broadcasts, and non-entertaining content. While the lexis and stylistics of Modern Standard Arabic are different from Classical Arabic, the morphology and syntax have remained basically unchanged (though MSA uses a subset of the syntactic structures available in CA). The vernacular dialects, however, have changed more dramatically. In the Arab world, little distinction is made between CA and MSA, and both are normally called al-fuṣḥā (الفصحى) in Arabic, meaning 'the clearly spoken one' or the 'language of eloquence'.
Because the Qur'an is written in Classical Arabic, the language is considered by most Muslims to be sacred. It is mostly the language in which Muslims recite their prayers, regardless of what language they use in everyday life.
Classical Arabic has its origins in the central and northern
Cree (Nēhiyawēwin / ᓀᐦᐃᔭᐍᐏᐣ; also known as Cree–Montagnais, Cree–Montagnais–Naskapi) is an Algonquian language spoken by approximately 117,000 people across Canada, from the Northwest Territories and Alberta to Labrador, making it the aboriginal language with the highest number of speakers in Canada. Despite numerous speakers within this wide-ranging area, the only region where Cree has any official status is in the Northwest Territories, alongside eight other aboriginal languages.
Endonyms are Nēhiyawēwin ᓀᐦᐃᔭᐍᐏᐣ (Plains Cree), Nīhithawīwin (Woods Cree), Nēhinawēwin and Nehirâmowin (Atikamekw), Nehilawewin (Western Montagnais, Piyekwâkamî dialect), Leluwewn (Western Montagnais, Betsiamites dialect), Innu-Aimûn (Eastern Montagnais), Iynu-Ayamûn (Southern Inland East Cree), Iyiyiw-Iyimiwin (Northern East Cree).
The Cree dialect continuum can be divided by many criteria. Dialects spoken in northern Ontario and the southern James Bay, Lanaudière, and Mauricie regions of Quebec make a distinct difference between /ʃ/ (sh as in she) and /s/, while those to the west (where both are pronounced /s/) and east (where both are pronounced either /ʃ/ or /h/) do not. In several dialects,
Jarawara (also Jaruara, Jaruára, Yarawara) is a dialect of Madi, an Arauan language spoken in Amazonas, Brazil. Jarawara is spoken by approximately 155 people.
The glottal stop [ʔ] has a limited distribution.
The liquid /r/ may be realized as a trill [r], flap [ɾ], or lateral [l]. The palatal stop /ɟ/ may be realized as a semivowel [j].
The glottal fricative /h̃/ is nasalized. See rhinoglottophilia.
Māori or te reo Māori (pronounced [ˈmaːɔɾi, tɛ ˈɾɛɔ ˈmaːɔɾi]), commonly te reo ("the language"), is the language of the indigenous population of New Zealand, the Māori. It has the status of an official language in New Zealand. Linguists classify it within the Eastern Polynesian languages as being closely related to Cook Islands Māori, Tuamotuan, Rapanui and Tahitian; somewhat less closely to Hawaiian and Marquesan; and more distantly to the languages of Western Polynesia, including Samoan, Tokelauan, Niuean and Tongan. According to a 2001 survey on the health of the Māori Language, the number of very fluent adult speakers was about 9% of the Māori population, or 29,000 adults.
New Zealand has three official languages – English, Māori and New Zealand Sign Language. Māori gained this status with the passing of the Māori Language Act in 1987. Most government departments and agencies have bilingual names; for example, the Department of Internal Affairs Te Tari Taiwhenua, and places such as local government offices and public libraries display bilingual signs and use bilingual stationery. The New Zealand Post recognises Māori place-names in postal addresses. Dealings with government
Norwegian (norsk) is a North Germanic language spoken primarily in Norway, where it is the official language. Together with Swedish and Danish, Norwegian forms a continuum of more or less mutually intelligible local and regional variants (see Danish language).
These Scandinavian languages together with the Faroese language and Icelandic language, as well as some extinct languages, constitute the North Germanic languages (also called Scandinavian languages). Faroese and Icelandic are hardly mutually intelligible with Norwegian in their spoken form, because continental Scandinavian has diverged from them.
As established by law and governmental policy, there are two official forms of written Norwegian – Bokmål (literally "book tongue") and Nynorsk (literally "new Norwegian"). The Norwegian Language Council is responsible for regulating the two forms, and recommends the terms "Norwegian Bokmål" and "Norwegian Nynorsk" in English. Two other written forms without official status also exist: Riksmål ("national language"), which is to a large extent the same language as Bokmål, but somewhat closer to the Danish language, is regulated by the Norwegian Academy, which translates it as
The Pennsylvania German language (usually referred to as Pennsylvania Dutch, or simply as Dutch, in American English; usually referred to in Pennsylvania German as Deitsch, Pennsylvania Deitsch or Pennsilfaanisch Deitsch) is a variety of West Central German possibly spoken by more than 250,000 people in North America. It has traditionally been the language of the Pennsylvania Dutch, descendants of late 17th and early 18th century immigrants to the US states of Pennsylvania, Maryland, Virginia and North Carolina from southern Germany, eastern France and Switzerland. Although for many, the term 'Pennsylvania Dutch' is often taken to refer to the Old Order Amish and related groups exclusively, the term should not imply a connection to any particular religious group. The Amish and Mennonites originally made up only a small percentage of the Pennsylvania German population.
In this context, the word "Dutch" does not refer to the Dutch people or their descendants. Instead it is probably left over from an archaic sense of the English word "Dutch"; compare German Deutsch ('German'), Dutch Duits ('German'), Diets ('Dutch'), which once referred to any people speaking a non-peripheral
Tobelo (Indonesian: Bahasa Tobelo) is a West Papuan language spoken on the eastern Indonesian island of Halmahera and on parts of several neighboring islands. The Tobelo-speaking heartland is in the district (Indonesian kecematan) of Tobelo, located on the western shore of Kao Bay. The district capital, also known as Tobelo, serves as a regional commercial and administrative center and is the largest settlement on Halmahera.
Six principal dialects are generally recognized (Voorhoeve 1988):
The Tugutil or "Forest Tobelo" may constitute an additional dialectal variant, but this has not been satisfactorily documented.
In addition, based on cognition percentages in a basic vocabulary list, Voorhoeve 1988 identifies five other North Halmaheran language varieties as dialects of the putative Northeast Halmaheran language. Together with Tobelo, these varieties are Galela, Loloda, Modole, Pagu and Tabaru. Most speakers consider these to be distinct languages from Tobelo, although they do acknowledge a certain degree of mutual intelligibility.
Most all of the 50 languages of Maluku have some sort of directional system. At the least the up/down distinction seems to be an areal feature. But
The Laz language (ლაზური ნენა, lazuri nena; Georgian: ლაზური ენა, lazuri ena, or ჭანური ენა, č'anuri ena, also chanuri ena; Turkish: Lazca) is a Kartvelian language spoken by the Laz people on the Southeast shore of the Black Sea. It is estimated that there are around 30,000 native speakers of Laz in Turkey, in a strip of land extending from Melyat to the Georgian border (officially called Lazistan until 1925), and about 2,000 in Georgia.
Laz is one of the four South Caucasian languages. Along with Mingrelian, it forms the Zan branch of this language family. The two languages are very closely related, to the extent that some linguists refer to Mingrelian and Laz as dialects or regional variants of a single Zan language, a view held officially in the Soviet era and still so in Georgia today. In general, however, Laz and Mingrelian are classed as separate languages, due both to the long-standing separation of their communities of speakers (500 years) and to a lack of mutual intelligibility.
The ancient kingdom of Colchis was located in the same region the Laz speakers are found in today, and its inhabitants probably spoke an ancestral version of the language. Colchis was the setting
Bambara, also known as Bamana, and Bamanankan by speakers of the language, is a language spoken in Mali, and to a lesser extent Burkina Faso, Senegal by as many as six million people (including second language users). The Bambara language is the language of people of the Bambara ethnic group, numbering about 4,000,000 people, but serves also as a lingua franca in Mali (it is estimated that about 80 percent of the population speak it as a first or second language). It is a Subject–object–verb language and has two tones.
Bambara is a language/dialect of the Manding language cluster, a cluster of languages whose ethnic-speakers generally trace their cultural history to the ancient city of Manding, where modern-day Kita, Mali now exists. Bambara is classified as part of the larger, very broad Mandé group. Dialects of Manding are generally considered (among native speakers) to be mutually intelligible – dependent on exposure or familiarity with dialects between speakers – and spoken by approximately 20 million people in the countries Burkina Faso, Senegal, Guinea Bissau, Guinea, Liberia, Côte d'Ivoire and The Gambia.
It uses seven vowels a, e, ɛ, i, o, ɔ and u (the letters approximate
The extinct Dacian language developed from Proto-Indo-European (PIE), possibly in the Carpathian region sometime in the period 3000-1500 BC. The language was probably extinct by AD 600. In the 1st century AD, it was probably the predominant language of the ancient regions of Dacia and Moesia and possibly of some surrounding regions.
While there is unanimous agreement among scholars that Dacian was an Indo-European language, there are divergent opinions about its place within the IE family: (1) Dacian was a dialect of the extinct Thracian language, or vice versa e.g. Baldi (1983) and Trask (2000). (2) Dacian was a language distinct from Thracian but related to it, belonging to the same branch of the Indo-European family (a "Thraco-Dacian", or "Daco-Thracian" branch has been theorised by some linguists). (A dated view, now largely rejected, considered the extinct Phrygian language also to belong to the same branch as Dacian and Thracian). (3) Dacian was a language unrelated to either Thracian or Phrygian, each of these languages belonging to different branches of IE e.g. Georgiev (1977) and Duridanov (1985).
The Dacian language is poorly documented. Unlike for Phrygian, which is
Hindi-Urdu (हिंदी उर्दू, ہندی اردو) is an Indo-Aryan language and the lingua franca of North India and Pakistan. It is also known as Hindustani (हिन्दुस्तानी, ہندوستانی, Hindustānī, IPA: [ɦɪ̃n̪d̪ʊsˈt̪aːni], literally: "of Hindustan"), and historically, as Hindavi or Rekhta. It derives primarily from the Khariboli dialect of Delhi, and incorporates a large amount of vocabulary from Persian, Arabic, Sanskrit and Turkic. It is a pluricentric language, with two official forms, Standard Hindi and Standard Urdu, which are standardized registers of it. The colloquial languages are all but indistinguishable, and even the official standards are nearly identical in grammar, though they differ in literary conventions and in academic and technical vocabulary, with Urdu retaining stronger Persian, Central Asian and Arabic influences, and Hindi relying more heavily on Sanskrit. Before the Partition of British India, the terms Hindustani, Urdu, and Hindi were synonymous; all covered what would be called Urdu and Hindi today. The term Hindustani is still used for the colloquial language and lingua franca of India and Pakistan, for example for the language of Bollywood films, as well as for several
Indonesian (Bahasa Indonesia) is the official language of Indonesia. It is a standardized register of Malay, an Austronesian language which has been used as a lingua franca in the Indonesian archipelago for centuries.
Indonesia is the fourth most populous nation in the world. Of its large population, the number of people who speak Indonesian fluently is fast approaching 100%, making Indonesian, and thus Malay, one of the most widely spoken languages in the world.
Most Indonesians, aside from speaking the national language, are often fluent in another regional language (examples include Javanese, Sundanese and Madurese) which are commonly used at home and within the local community. Most formal education, as well as nearly all national media and other forms of communication, are conducted in Indonesian. In East Timor, which was an Indonesian province from 1975 to 1999, Indonesian is recognised by the constitution as one of the two working languages (the other being English), alongside the official languages of Tetum and Portuguese.
The Indonesian name for the language is Bahasa Indonesia (literally "the language of Indonesia"). This term is occasionally found in English. Indonesian
Thai (ภาษาไทย Phasa Thai [pʰāːsǎː tʰāj] ( listen)), more precisely Central Thai or Siamese, is the national and official language of Thailand and the native language of the Thai people, Thailand's dominant ethnic group. Thai is a member of the Tai group of the Tai–Kadai language family. Some words in Thai are borrowed from Pali, Sanskrit and Old Khmer. It is a tonal and analytic language. Thai also has a complex orthography and relational markers. Thai is mutually intelligible with Lao. Native speakers of these two languages can understand one another without great difficulty.
Thai is the official language of Thailand, spoken by over 20 million people (2000), Standard Thai is based on the register of the educated classes of Bangkok. In addition to Siamese Thai, Thailand is home to other related Tai languages, including:
Most speakers of dialects and minority languages speak Central Thai as well, since it is the language used in schools and universities all across the kingdom.
Numerous languages not related to Thai are spoken within Thailand. Near Laos and Burma, ethnic minority hill tribes people speak Hmong–Mien (Yao), Karen, Lisu, and others. Near Cambodia many communities speak
Toki Pona is a constructed language, first published online in mid-2001. It was designed by translator and linguist Sonja Elen Kisa of Toronto.
Toki Pona is a minimal language. Like a pidgin, it focuses on simple concepts and elements that are relatively universal among cultures. Kisa designed Toki Pona to express maximal meaning with minimal complexity. The language has 14 phonemes and 123 root words. It is not designed as an international auxiliary language but is instead inspired by Taoist philosophy, among other things.
The language is designed to shape the thought processes of its users, in the style of the Sapir–Whorf hypothesis in Zen-like fashion. This goal, together with Toki Pona's deliberately restricted vocabulary, has led some to feel that the language, whose name literally means "simple language", "good language", or "goodspeak", resembles George Orwell's fictional language Newspeak.
Sonja Elen Kisa is a translator (English, French and Esperanto) and linguist living in Toronto, Ontario, Canada. In addition to designing Toki Pona, Kisa has translated parts of the Tao Te Ching into English and Esperanto.
Kisa officially uses letters of the Latin alphabet to represent
Jamaican Patois, known locally as Patois (Patwa or Patwah) or Jamaican, and called Jamaican Creole by linguists, is an English-lexified creole language with West African influences spoken primarily in Jamaica and the Jamaican diaspora. It is not to be confused with Jamaican English nor with the Rastafarian use of English. The language developed in the 17th century, when slaves from West and Central Africa were exposed to, learned and nativized the vernacular and dialectal forms of English spoken by their masters: British English, Scots and Hiberno-English. Jamaican Patois features a creole continuum (or a linguistic continuum)—meaning that the variety of the language closest to the lexifier language (the acrolect) cannot be distinguished systematically from intermediate varieties (collectively referred to as the mesolect) nor even from the most divergent rural varieties (collectively referred to as the basilect). Jamaicans themselves usually refer to their dialect as patois, a French term without a precise linguistic definition.
Significant Jamaican-speaking communities exist among Jamaican expatriates in Miami, New York City, Toronto, Hartford, Washington, D.C., Nicaragua, Costa
Standard Chinese or Modern Standard Chinese, also known as Mandarin, Putonghua, Guoyu and Modern Standard Mandarin, is a standardized variety of Chinese and the official language of the People's Republic of China and the Republic of China (Taiwan), and is one of the four official languages of Singapore.
The phonology of Standard Chinese or Standard Mandarin is based on the Beijing dialect, but its vocabulary is drawn from the large and diverse group of Chinese dialects spoken across northern, central and southwestern China, a linguistic area whose varieties are collectively known as Mandarin Chinese. The grammar is standardized to the body of modern literary works that define written vernacular Chinese (originally Baihua), the colloquial alternative to Classical Chinese developed around the turn of the 20th century in China. The name "Mandarin" originally referred to the language used by the imperial court in Beijing and sometimes by imperial officials elsewhere (simplified Chinese: 官话; traditional Chinese: 官話; pinyin: Guānhuà; literally "speech of officials"), and as such was adopted as a synonym for Modern Standard Chinese in the 20th century, but the term became ambiguous as its
Arabic (العربية al-ʻarabīyah or عربي/عربى ʻarabī ) ( [al ʕarabijja] (help·info) or ( [ʕarabi] (help·info)) is a name applied to the descendants of the Classical Arabic language of the 6th century AD. This includes both the literary language and the spoken Arabic varieties.
The literary language is called Modern Standard Arabic or Literary Arabic. It is currently the only official form of Arabic, used in most written documents as well as in formal spoken occasions, such as lectures and news broadcasts. In 1912, Moroccan Arabic was official in Morocco for some time, before Morocco joined the Arab League.
The spoken Arabic varieties are spoken in a wide arc of territory stretching across the Middle East and North Africa.
Arabic languages are Central Semitic languages, most closely related to Hebrew, Aramaic, Ugaritic and Phoenician. The standardized written Arabic is distinct from and more conservative than all of the spoken varieties, and the two exist in a state known as diglossia, used side-by-side for different societal functions.
Some of the spoken varieties are mutually unintelligible, both written and orally, and the varieties as a whole constitute a sociolinguistic language.
Early Modern English (sometimes abbreviated to EModE) is the stage of the English language used from the beginning of the Tudor period until the English Interregnum and Restoration, or from the transition from Middle English in the late 15th century to the transition to Modern English during the mid to late 17th century.
Prior to and following the accession of James I to the English throne in 1603 the emerging English standard began to influence the spoken and written Middle Scots of Scotland.
Modern readers of English are generally able to understand texts written in the late phase of the Early Modern English period (e.g. the first edition of the King James Bible and the works of William Shakespeare), while texts from the earlier phase (such as Le Morte d'Arthur) may present more difficulties. The Early Modern English of the early 17th century forms the base of the grammatical and orthographical conventions that survive in Modern English.
The change from Middle English to Early Modern English was not just a matter of vocabulary or pronunciation changing: it was the beginning of a new era in the history of English. An era of linguistic change in a language with large variations in
The Frisian languages are a closely related group of Germanic languages, spoken by about 500,000 members of Frisian ethnic groups, who live on the southern fringes of the North Sea in the Netherlands and Germany. The Frisian languages are the second closest living languages to English, after Scots. However, modern English and Frisian are mostly unintelligible to each other. Frisian languages bear similarities to Low German, Dutch (from which many Frisian words have been borrowed), and Danish. Additional shared linguistic characteristics between the Great Yarmouth area, Friesland, and Denmark are likely to have resulted from the close trading relationship these areas maintained during the centuries-long Hanseatic League of the Late Middle Ages.
There are three varieties of Frisian: West Frisian, Saterland Frisian, and North Frisian. Some linguists consider these three varieties, despite their mutual unintelligibility, to be dialects of one single Frisian language, while others consider them to be three separate languages, as do their speakers. Of the three, the North Frisian language especially is further segmented into several strongly diverse dialects. Stadsfries is not Frisian,
The Gaulish (also Gallic) language is an extinct Celtic language that was spoken by the Gauls, a people who inhabited the region known as Gaul (Cisalpine and Transalpine) from the Iron Age through the Roman period. It was historically spoken through what are now mainly France, Northern Italy, Switzerland, eastern Belgium, Luxembourg and western Germany before being supplanted by Vulgar Latin and various Germanic languages from around the 4th century onwards. Gaulish is paraphyletically grouped with Celtiberian as Continental Celtic. Lepontic is considered to be either a dialect of or a language closely related to Gaulish. Galatian is the form of Gaulish spoken in Asia Minor after 281 BC.
Gaulish is a P-Celtic language, though some inscriptions (e.g. the Coligny Calendar) potentially show Q-Celtic characteristics (however, this is a matter of debate among Celticists). Gaulish has a very close relationship to Insular Celtic (Goidelic and Brythonic), and many forms are identical in the two. Epigraphical remains have been uncovered across all of what used to be Roman Gaul, which covered modern France, as well as parts of Switzerland, Italy, Germany, and Belgium.
Oriya (ଓଡ଼ିଆ oṛiā), officially spelled Odia, is an Indian language, belonging to the Indo-Aryan branch of the Indo-European language family. It is mainly spoken in the Indian states of Orissa and in parts of West Bengal, Jharkhand, Chhattisgarh and Andhra Pradesh. Oriya is one of the many official languages in India; it is the official language of Orissa and the second official language of Jharkhand. Oriya is the predominant language of Orissa, where Oriya speakers comprise around 83.33% of the population according to census surveys.
Outside Orissa, there are also significant Oriya-speaking populations in other linguistic regions, such as the Midnapore district of West Bengal, the East Singhbhum, West Singhbhum Seraikela Kharsawan district, Simdega, Gumla, Khunti, Ranchi district of Jharkhand, the Srikakulam, Vizianagaram & Vishakhapatnam District of Andhra Pradesh, eastern districts of Chhattisgarh state. Due to the increasing migration of labour, the west Indian state of Gujarat also has a significant Oriya speaking population with Surat being the city with the second largest Oriya-speaking population in India. The Oriya-speaking people are also found in significant numbers in
Cupeño is an extinct Uto-Aztecan language, formerly spoken by the Cupeño people of Southern California, USA, who now speak English. Roscinda Nolasquez was the last native speaker of Cupeño.
The language was originally spoken in Cupa, Wilaqalpa, and Paluqla, San Diego County, California, and later and around the Pala Indian Reservation.
Cupeño is an agglutinative language, where words use suffix complexes for a variety of purposes with several morphemes strung together.
Cupeño inflects its verbs for transitivity, tense, aspect, mood, person, number, and evidentiality.
Evidentiality is expressed in Cupeño with clitics, which generally appear near the beginning of the sentence. =ku'ut 'reportative' (mu=ku'ut 'and it is said that...') =am 'mirative' =$he 'dubitative'
There are two inflected moods, realis =pe and irrealis =e'p.
The pronominals of Cupeño appear in many different forms and structures. The following appear attached only to past-tense verbs.
Future simple verbs are unmarked. Past simple verbs have past-tense pronouns; past imperfect add the imperfect modifier shown below.
/ɛ/ and /o/ appear largely in Spanish loanwords, but also as allophones of /ə/ in native Cupeño words.
Quenya (pronounced [ˈkʷwɛnja]) is a fictional language devised by J. R. R. Tolkien, and used in his Secondary world, often called Middle-earth.
Quenya is one of the many Elvish languages spoken by the immortal Elves, called Quendi in Quenya. The tongue actually called Quenya was in origin the speech of two clans of Elves living in Eldamar ("Elvenhome"), the Noldor and the Vanyar. Quenya translates as simply "language", or in contrast to other tongues that the Elves met later in their long history "elf-language". In the Second Age the Wise of Númenor learned the Quenya tongue. In the Third Age, (the time of the setting of The Lord of the Rings) Quenya was no longer a living language for the Noldor of Middle-earth. Exilic Quenya was learned at an early age by all Elves of Noldorin origin, and it continued to be used in spoken and written form, but their mother-tongue was another Elven-tongue, Sindarin.
Tolkien began with devising the language at around 1910 and re-structured the grammar four times until Quenya reached its final state. The vocabulary however remained relatively stable throughout the creation process. Also the name of the language itself was repeatedly changed by
Australian English (AusE, AuE, AusEng, en-AU) is a major variety of the English language and is used throughout Australia. Although English has no official status in the Constitution, Australian English is Australia's de facto official language and is the first language of the majority of the population.
Australian English started diverging from British English after the founding of the colony of New South Wales in 1788 and was recognised as being different from British English by 1820. It arose from the intermingling of children of early settlers from a great variety of mutually intelligible dialectal regions of the British Isles and quickly developed into a distinct variety of English.
The earliest form of Australian English was first spoken by the children of the colonists born into the colony of New South Wales. This very first generation of children created a new dialect that was to become the language of the nation. The Australian-born children in the new colony were exposed to a wide range of different dialects from all over Britain and Ireland, in particular from Ireland and South East England.
The native-born children of the colony created the new dialect from factors
Avestan ( /əˈvɛstən/) is an East Iranian language known only from its use as the language of Zoroastrian scripture, i.e. the Avesta, from which it derives its name. Its area of composition comprised – at least – Arachosia/Sīstān, Herat, Merv and Bactria. The Yaz culture has been regarded as a likely archaeological reflection of early East Iranian culture as described in the Avesta. Its status as a sacred language has ensured its continuing use for new compositions long after the language had ceased to be a living language.
Avestan, which is associated with northeastern Iran and Old Persian, which belongs to the southwest, plus Gathic or the Old Avestan (language of the Gathas, the Hymns of Zarathushtra – 2nd millennium BC) are the only languages (of what must have been a great variety) to have left written traces; they together constitute what is called the Old Iranian Languages. The Old Iranian language group is a branch of the Indo-Iranian language group, which is in turn a branch of the Indo-European language group.
Iranian languages are traditionally classified as "eastern" or "western", and within this framework Avestan is classified as Eastern Iranian. But this distinction is
Dutch ( Nederlands (help·info)) is a West Germanic language and the native language of most of the population of the Netherlands, and about sixty percent of the populations of Belgium and Suriname, the three member states of the Dutch Language Union. Most speakers live in the European Union, where it is a first language for about 23 million and a second language for another 5 million people. It also holds official status in the Caribbean island nations of Aruba, Curaçao, and Sint Maarten, while historical minorities remain in parts of France and Germany, and to a lesser extent, in Indonesia, and up to half a million native Dutch speakers may be living in the United States, Canada, and Australia. The Cape Dutch dialects of Southern Africa have been standardised into Afrikaans, a mutually intelligible daughter language of Dutch which today is spoken to some degree by an estimated total of 15 to 23 million people in South Africa and Namibia.
Dutch is closely related to German and English and is said to be between them. Apart from not having undergone the High German consonant shift, Dutch—like English—has mostly abandoned the grammatical case system, is relatively unaffected by the
Faroese, (in Faroese: føroyskt, pronounced [ˈføːʐɪst]), is an Insular Nordic language spoken as a native language by about 45,000 people in the Faroe Islands and about 25,000-30,000 Faroese people in Denmark and elsewhere. It is one of four languages descended from the Old West Norse language spoken in the Middle Ages, the others being Icelandic, Norwegian and the extinct Norn, which is thought to have been mutually intelligible with Faroese. Faroese and Icelandic, its closest extant relative, are not mutually intelligible in speech, but the written languages resemble each other quite closely, largely owing to Faroese' etymological orthography.
Around AD 900 the language spoken in the Faroes was Old Norse, which Norwegian settlers had brought with them during the time of the landnám that began in AD 825. However, many of the settlers were not from present-day Norway but descendants of Norwegian settlers in the Irish Sea. In addition, native Norwegian settlers often married women from Norse Ireland, Orkney, or Shetland before settling in the Faroe Islands and Iceland. As a result, the Irish language influenced both Faroese and Icelandic. There is some debatable evidence of Irish
Huarijio (Huarijío in Spanish; also spelled Guarijío, Varihío, and Warihío) is an Uto-Aztecan language of the states of Chihuahua and Sonora in northwestern Mexico. It is spoken by around 5,000 people, most of whom are monolinguals.
The language has two variants, known as Mountain Guarijio (guarijío de la sierra) and River Guarijio (guarijío del río). The mountain variant is chiefly spoken in the eastern portion of the municipality of Uruachi (with a small number of speakers in Moris to the north and Chínipas to the south) and around Arechuyvo, in the state of Chihuahua. The river variant is found to the southwest: most speakers inhabit the Río Mayo basin to the north of San Bernardo in the Sonoran municipality of Álamos.
Speakers of Mountain Guarijio self-identify as warihó and call River Guarijio speakers macurawe or makulái. River Guarijio speakers call themselves warihío and call Mountain speakers "tarahumaras". Contact between the two groups is scant and, although the linguistic differences between the two are slight, speakers report that mutual comprehension is difficult.
Guarijio is an agglutinative language, where words use suffix complexes for a variety of purposes with
Highest Alemannic (Hegschtalemannisch) is a branch of Alemannic German and is often considered to be part of the German language, even though mutual intelligibility with Standard German and other non-Alemannic German dialects is very limited.
Highest Alemannic dialects are spoken in alpine regions of Switzerland: In the Bernese Oberland, in the German-speaking parts of the Canton of Fribourg, in the Valais (see Walliser German) and in the Walser settlements (mostly in Switzerland, but also in Italy and in Austria; see Walser German). In the West, the South and the South-East, they are surrounded by Romance languages; in the North, by High Alemannic dialects. In the Swiss canton of Graubünden (Grisons) only the Walser exclaves in the Romansh part and the Prättigau, Schanfigg and Davos are Highest Alemannic; the Rhine Valley with Chur and Engadin are High Alemannic.
The distinctive feature of the Highest Alemannic dialects is the lack of hiatus diphthongization, for instance [ˈʃniː.ə(n)] 'to snow', [ˈb̥uː.ə(n)] 'to build' vs. High Alemannic [ˈʃnei̯jə], [ˈb̥ou̯wə].
Many High Alemannic dialects have different verbal plural endings for all three persons, for instance wir singe(n) 'we
Hurrian is a conventional name for the language of the Hurrians (Khurrites), a people who entered northern Mesopotamia around 2300 BC and had mostly vanished by 1000 BC. Hurrian was the language of the Mitanni kingdom in northern Mesopotamia, and was likely spoken at least initially in Hurrian settlements in Syria. It is generally believed that the speakers of this language originally came from the Armenian mountains and spread over southeast Anatolia and northern Mesopotamia at the beginning of the 2nd millennium BC.
Hurrian is an ergative, agglutinative language that, together with Urartian, constitutes the Hurro-Urartian family. I.M. Diakonoff and S. Starostin see similarities between Hurrian and the Northeast Caucasian languages, and thus place it in the Alarodian family. Examples of the proposed phonological correspondences are PEC *l- > Hurrian t-, PEC *-dl- > Hurrian -r- (Diakonoff & Starostin).
Some scholars, such as I. J. Gelb and E. A. Speiser, tried to equate Hurrians and "Subarians".
The earliest Hurrian text fragments consist of lists of names and places from the end of the third millennium BC. The first full texts date to the reign of king Tish-atal of Urkesh, at the
Kannada (/'kʌnnəɖɑː/) , or Kanarese /kænəˈriːz/, is a language spoken in India predominantly in the state of Karnataka. Kannada, whose native speakers are called Kannadigas (Kannaḍigaru) and number roughly 70 million, is one of the 40 most spoken languages in the world. It is one of the scheduled languages of India and the official and administrative language of the state of Karnataka.
The Kannada language is written using the Kannada script, which evolved from the 5th century Kadamba script. Kannada is attested epigraphically from about one and a half millennia, and literary Old Kannada flourished in the 6th century Ganga dynasty and during 9th century Rashtrakuta Dynasty. With an unbroken literary history of over a thousand years, the excellence of Kannada literature continues into the present day. Works of Kannada literature have received eight Jnanpith awards and fifty-six Sahitya Akademi awards.
Based on the recommendations of the Committee of Linguistic Experts, appointed by the Ministry of Culture, the Government of India officially recognised Kannada as a classical language. In July 2011, a centre for the study of classical Kannada was established under the aegis of Central
The Marwari language (Mārwāṛī;मारवाड़ी), also variously Marvari, Marwadi, Marvadi), is spoken in the Indian state of Rajasthan. Marwari is also found in the neighboring state of Gujarat and Haryana and in Eastern Pakistan. With some 13.2 million speakers (as of 1997, ca.), it is the largest language by number of speakers of the Marwari subgroup of the Rajasthani language. There are 13 million speakers in Rajasthan and rest 200,000 in Eastern Pakistan. There about 23 dialects of the Marwari Language.
It is popularly written in Devanagari script, as is Hindi, Sanskrit and Nepali. Marwari currently has no official status as a language of education and government. There has been a push in the recent past for the national government to recognize this language and give it a scheduled status. The state of Rajasthan recognizes Rajasthani as a language.
In Pakistan, there are two varieties of Marwari. They may or may not be close enough to Indian Marwari to be considered the same language.
The Marwari language was used in the recent Indian movie, Paheli, where it was mixed with Hindi so it is understandable to the main stream (Hindi speakers) audience. Marwari is still spoken widely in and
Ngunnawal or Gundungurra is an Australian Aboriginal language, the traditional language of the Ngunawal and Gandangara peoples. There are contradictory claims as to whether they are one language or two. The name Burragorang is applied to either.
Ngunawal is generally classified with the Ngarigo language of the limestone plains of Monaro/Maneroo in NSW to ACT across the Monaro tableland through to the Australian Alps of NSW/VIC Snowy Mountains. They fall within the tentative (and perhaps geographic) Yuin–Kuric group of the Pama–Nyungan family.
Some meanings for Ngunawal words:
Tamil (தமிழ், tamiḻ, [t̪ɐmɨɻ] , alternative spelling: Thamizh) is a Dravidian language spoken predominantly by Tamil people of South India and North-east Sri Lanka. It has official status in the Indian state of Tamil Nadu and in the Indian union territory of Puducherry. Tamil is also a national language of Sri Lanka and an official language of Singapore. It is one of the 22 scheduled languages of India and is the first language that was declared a classical language by the government of India in 2004. Tamil is also spoken by significant minorities in Malaysia, USA and Mauritius as well as emigrant communities around the world.
Tamil is one of the longest surviving classical languages in the world according to available evidence. It is also the only Indian language other than Sanskrit to be considered ancient and authentically original in its form and rich literature It has been described as "the only language of contemporary India which is recognizably continuous with a classical past" and having "one of the richest literatures in the world". Tamil literature has existed for over 3600 years. The earliest epigraphic records found on rock edicts and hero stones date from around the
Umbrian is an extinct Italic language formerly spoken by the Umbri in the ancient Italian region of Umbria. Within the Italic languages it is closely related to the Oscan group and is therefore associated with it in the group of Osco-Umbrian languages. Since that classification was first formulated a number of other languages in ancient Italy were discovered to be more closely related to Umbrian. Therefore a group was devised to contain them, the Umbrian languages.
Umbrian is known from about 30 inscriptions dated from the 7th through 1st centuries BC. The largest cache by far is the Iguvine Tablets, nine inscribed bronze tablets found in 1444 in an underground chamber at Gubbio (ancient Iguvium). Two have since disappeared. The remaining seven contain notes on the ceremonies and statutes for priests of the ancient pagan religion in the region. Sometimes they are called the Eugubian tablets after the medieval name of Iguvium/Eugubium. The tablets contain 4000-5000 words.
Other minor inscriptions are from Todi, Assisi and Spoleto.
The Iguvine tablets were written in two alphabets. The older, the Umbrian alphabet, like other Old Italic alphabets, was derived from the Etruscan
Volapük (/ˈvɒləpʊk/; [volaˈpyk]), also Volapuk, is a constructed language, created in 1879–1880 by Johann Martin Schleyer, a Roman Catholic priest in Baden, Germany. Schleyer felt that God had told him in a dream to create an international language. Volapük conventions took place in 1884 (Friedrichshafen), 1887 (Munich) and 1889 (Paris). The first two conventions used German, and the last conference used only Volapük. In 1889, there were an estimated 283 clubs, 25 periodicals in or about Volapük, and 316 textbooks in 25 languages; at that time the language claimed nearly a million adherents. Volapük was largely displaced in the late 19th and early 20th centuries, specifically by Esperanto, Ido and Interlingua.
Note: ä, ö and ü do not have alternative forms such as the ae, oe and ue of German.
There are no diphthongs; each vowel letter is pronounced separately.
The author Alfred A. Post mentions in his Comprehensive Volapük Grammar some additional letters created by Schleyer:
Polysyllabic words are always stressed on the final vowel; for example, neai "never" is pronounced [ne.a.ˈi]. (However, the question clitic "-li" does not affect the stress of the word it attaches to.) Where
Hebrew (/ˈhiːbruː/) (עִבְרִית ʿIvrit, [ʔivˈʁit] (help·info) or [ʕivˈɾit] (help·info)) is a West Semitic language of the Afroasiatic language family. Culturally, it is considered by Jews and other ethnic or religious groups as the language of the Jewish people, although other Jewish languages had originated among diaspora Jews, and the Hebrew language was also used by non-Jewish groups, such as the ethnically related Samaritans.
Modern Hebrew is one of the two official languages of Israel (the other being Arabic), while Classical Hebrew is used for prayer or study in Jewish communities around the world. The earliest examples of written Hebrew date from the 10th century BCE to the late Second Temple period, after which the language developed into Mishnaic Hebrew.
Ancient Hebrew is also the liturgical tongue of the Samaritans, while modern Hebrew or Arabic is their vernacular, though today only about 700 Samaritans remain. As a foreign language it is studied mostly by Jews and students of Judaism and Israel, archaeologists and linguists specializing in the Middle East and its civilizations, by theologians, and in Christian seminaries.
The core of the Torah (the first five books of
Kituba is a widely used lingua franca in Central Africa. It is a creole language based on Kikongo, a family of closely related Bantu languages. It is an official language in Republic of the Congo and Democratic Republic of the Congo.
It is not entirely accurate to call Kituba a creole language as it lacks the distinction between superstrate and substrate influence that is typical of creole development.
Kituba is known by many names among its speakers. In the Republic of Congo it is called Munukutuba or Kituba. The former is a grammatically incorrect phrase which means literally "I to speak". The latter means simply "speech". The name Kituba is used in the constitution of the Republic of Congo.
In the Democratic Republic of the Congo it is called Kikongo ya leta (i.e. Kikongo of the state administration), but it is often called in short Kikongo, especially out of the region of ethnic Bakongo people. The constitution of the Democratic Republic of Congo lists Kikongo as one of the national languages. In fact, it refers to Kikongo ya leta (i.e. Kituba), because a translation of the constitution itself is written in Kituba but no translation exists in Kikongo!
There are also other
Old Prussian (Prussian: Prūsiskan or Prūsiskai Bilā) is an extinct Baltic language, once spoken by the Old Prussians, the indigenous peoples of Prussia (not to be confused with the later and much larger German state of the same name), now north-eastern Poland and the Kaliningrad Oblast of Russia.
The language is called “Old Prussian” to avoid confusion with the German dialects Low Prussian and High Prussian, and the adjective “Prussian”, which is also often used to relate to the later German state. The Old Prussian name for the nation, not being Latinized, was Prūsa.
Old Prussian began to be written down in the Latin alphabet in about the 13th century. A small amount of literature in the language survives.
In addition to Prussia proper the original territory of the Old Prussians may also have included eastern parts of Pomerelia (some parts of the region east of the Vistula River). The language may have also spoken much further east and south in what became Polesia and part of Podlasia with the conquests by Rus and Poles starting in the 10th century and by the German colonisation of the area which began in the 12th century. According to the archaeologist Marija Gimbutas, the entire
The Sotho language, also known as Sesotho, Southern Sotho, or Southern Sesotho, is a Bantu language spoken primarily in South Africa, where it is one of the 11 official languages, and in Lesotho, where it is the national language. It is an agglutinative language which uses numerous affixes and derivational and inflexional rules to build complete words.
Sotho is a Southern Bantu language, belonging to the Niger–Congo language family within the Sotho languages branch of Zone S (S.30). It is most closely related to other major languages in the Sotho–Tswana language group: Tswana (Setswana), the Northern Sotho languages (Sesotho sa Leboa), Kgalagari (SheKgalagari) and Lozi (Silozi). Sesotho is, and has always been, the name of the language in the language itself, and this term has come into wider use in English since the 1980s, especially in South African English and in Lesotho. Sesotho is the autoglottonym or name of the language used by its native speakers as defined by the United Nations. Sotho is the heteroglottonym. It is also sometimes referred to as Southern Sotho, principally to distinguish it from Northern Sotho.
The Sotho languages are in turn closely related to other
The Walser language (German: Walserdeutsch), also known as Walliser German (locally Wallisertiitsch), is a group of Highest Alemannic dialects spoken in Walser settlements in parts of Switzerland, Italy, Liechtenstein, and Austria and in the German-speaking part of the Canton of Wallis (French: Valais), in the uppermost Rhône valley.
The terms Walser and Walliser are geographic; there is no linguistic divide. Specific Walser dialects can be traced to eastern or western dialects of Wallis canton. Conservative Walser dialects are more similar to the respective groups of Wallis dialects than to neighboring Walser dialects.
The German-speaking immigration to the Wallis started in the 8th century from the canton of Bern. There were presumably two different immigration routes that led to two main groups of Walliser dialects. In the 12th or 13th century, the Walliser began to settle other parts of the alps. These new settlements are known as Walser migration. In many of these settlements, people still speak Walser.
The dialects are difficult to understand for other Swiss Germans (called Üsserschwyzer 'outer Swiss' by the Walliser). This is because in the isolated valleys of the high
The Wastek (Huastec) language is a Mayan language of Mexico, spoken by the Huastecs living in rural areas of San Luis Potosí and northern Veracruz. Though relatively isolated from them, it is related to the Mayan languages spoken further south and east in Mexico and Central America. According to the 2005 population census, there were about 150,000 speakers of Wastek in Mexico (some 90,000 in San Luis Potosí and some 50,000 in Veracruz). The language is called Teenek (with varying spellings) by its speakers, and this name has gained currency in Mexican national and international usage in recent years.
The now-extinct Chicomuceltec language was most closely related to Wastek.
The first linguistic description of the Huastec language accessible to Europeans was written by Andrés de Olmos, who also wrote the first grammatical descriptions of Nahuatl and Totonac.
Wastek-language programming is carried by the CDI's radio station XEANT-AM, based in Tancanhuitz de Santos, San Luis Potosí.
Huastec has three dialects, which have a time depth of no more than 400 years (Norcliffe 2003:3). It is spoken in a region of east-central Mexico known as the Huaxteca.
Instituto Nacional de Estadística,
The Belarusian language (беларуская мова, BGN/PCGN: byelaruskaya mova, Scientific: belaruskaja mova, łac.: biełaruskaja mova), sometimes referred to as White Ruthenian, is the language of the Belarusian people. It is an official language of Belarus, along with Russian, and is spoken abroad, chiefly in Russia, Ukraine, and Poland. Prior to Belarus gaining its independence from the Soviet Union in 1991, the language was known in English as Byelorussian or Belorussian, transliterating the Russian name, белорусский язык, or alternatively as White Ruthenian or White Russian. Following independence, it became known also as Belarusian.
Belarusian is one of the East Slavic languages, and shares many grammatical and lexical features with other members of the group. To some extent, Russian, Ukrainian and Belarusian are mutually intelligible. Its predecessor stage is known as Old Belarusian (14th to 17th centuries), in turn descended from Old East Slavic (10th to 13th centuries).
According to the 1999 Belarus Census, the Belarusian language is declared as a "language spoken at home" by about 3,686,000 Belarusian citizens (36.7% of the population) as of 1999. About 6,984,000 (85.6%) of
Bosnian (bosanski / босански [bɔ̌sanskiː]) is a standardized register of the Serbo-Croatian language, a South Slavic language, spoken by Bosniaks. As a standardized form of the Shtokavian dialect, it is one of the three official languages of Bosnia and Herzegovina.
The same subdialect of Shtokavian is also the basis of standard Croatian and Serbian, as well as Montenegrin, so all are mutually intelligible. Up until the dissolution of former SFR Yugoslavia, they were treated as a unitary Serbo-Croatian language, and that term is still used to refer to the common base (vocabulary, grammar and syntax) of what are today officially four national standards. The Bosnian standard uses both Latin and Cyrillic alphabets. Bosnian is notable amongst the three varieties of Serbo-Croatian for having an eclectic assortment of Arabic, Turkish and Persian loanwords, largely due to the language's interaction with those cultures through Islamic ties. This is historically corroborated by the introduction and use of Arebica as a successor script for the Bosnian language, replacing Bosančica upon the introduction of Islam; first amongst the elite, then amongst the public. The Bosnian language also
The Tamyen people (also spelled as Tamien, Thamien) are one of eight linguistic divisions of the Ohlone (Coastanoan) people groups of Native Americans who lived in Northern California. The Tamyen lived throughout the Santa Clara Valley. The use of the name Tamyen is on record as early as 1777, it comes from the Ohlone name for the location of the first Mission Santa Clara (Mission Santa Clara de Thamien) on the Guadalupe River. Father Pena mentioned in a letter to Junipero Serra that the area around the mission was called Thamien by the native people. The missionary fathers erected the mission on January 17, 1777 at the native village of So-co-is-u-ka.
The Tamyen people spoke the Tamyen language, a Northern Ohlone language, which has been extinct since possibly the early 19th century. "Tamyen", also called Santa Clara Costanoan, has been extended to mean the Santa Clara Valley Indians, as well as for the language they spoke. Tamyen is listed as one of the Costanoan language dialects in the Utian family. It was the primary language that Natives spoke at the first and second Mission Santa Clara (both founded in 1777). Linguistically, it is thought that Chochenyo, Tamyen and Ramaytush
Urartian, Vannic, and (in older literature) Chaldean (Khaldian, or Haldian) are conventional names for the language spoken by the inhabitants of the ancient kingdom of Urartu that was located in the region of Lake Van, with its capital near the site of the modern town of Van, in the Armenian Highland, modern-day Eastern Anatolia region of Turkey. It was probably spoken by the majority of the population around Lake Van and in the areas along the upper Zab valley.
First attested in the 9th century BCE, Urartian ceased to be written after the fall of the Urartian state in 585 BCE, and presumably it became extinct due to the fall of Urartu. It must have been replaced by an early form of Armenian, perhaps during the period of Achaemenid Persian rule, although it is only in the fifth century CE that the first written examples of Armenian appear.
Urartian was an ergative, agglutinative language, which belongs to neither the Semitic nor the Indo-European families but to the Hurro-Urartian family (whose only other known member is Hurrian). It survives in many inscriptions found in the area of the Urartu kingdom, written in the Assyrian cuneiform script. There have been claims of a separate
Volscian was a Sabellic Italic language, which was spoken by the Volsci and closely related to Oscan and Umbrian.
It is attested in an inscription found in Velitrae (Velletri), dating probably from early in the 3rd century BC; it is cut upon a small bronze plate (now in the Naples Museum), which must have once been fixed to some votive object, dedicated to the god Declunus (or the goddess Decluna). The language of this inscription is clear enough to show the very marked peculiarities that rank it close to the language of the Iguvine Tables. It shows on the one hand the labialization of the original velar q (Volscian pis = Latin quis), and on the other hand it palatalizes the guttural c before a following i (Volscian facia Latin faciat). Like Umbrian also, but unlike Latin and Oscan, it has degraded all the diphthongs into simple vowels (Volscian se parallel to Oscan svai; Volscian deue, Old Latin and Oscan deiuai or deiuoi). This phenomenon of what might have been taken for a piece of Umbrian text appearing in a district remote from Umbria and hemmed in by Latins on the north and Oscan-speaking Samnites on the south is a most curious feature in the geographical distribution of the
The Chinese language (汉语/漢語 Hànyǔ; 华语/華語 Huáyǔ; 中文 Zhōngwén) is a language or language family consisting of varieties which are mutually intelligible to varying degrees. Originally the indigenous languages spoken by the Han Chinese in China, it forms one of the branches of Sino-Tibetan family of languages. About one-fifth of the world's population, or over one billion people, speaks some variety of Chinese as their native language. Internal divisions of Chinese are usually perceived by their native speakers as dialects of a single Chinese language, rather than separate languages, although this identification is considered inappropriate by some linguists and sinologists.
Chinese is distinguished by its high level of internal diversity, although all varieties of Chinese are tonal and analytic. There are between 7 and 13 main regional groups of Chinese (depending on classification scheme), of which the most spoken, by far, is Mandarin (about 850 million), followed by Wu (90 million), Cantonese (Yue) (70 million) and Min (50 million). Most of these groups are mutually unintelligible, although some, like Xiang and the Southwest Mandarin dialects, may share common terms and some degree
The Hän language (Dawson, Han-Kutchin, Moosehide) is a Native American endangered language spoken in only two places: Eagle, Alaska and Dawson City, Yukon. There are only a few fluent speakers left (perhaps about 10), all of them elderly.
It is a member of the Athabaskan language family, which is part of the larger Na-Dené family. The name of the language is derived from the name of the people, "Hän Hwëch'in", which in the language means "people who live along the river", the river being the Yukon. There are currently efforts to revive the language locally.
The consonants of Hän in the standard orthography are listed below (with IPA notation in brackets):
Istriot is a Romance language spoken in the Western Region on the coast of the Istrian Peninsula, especially in the towns of Rovinj (Italian: Rovigno) and Vodnjan (Italian: Dignano), on the upper northern part of the Adriatic Sea, in Croatia.
Istriot is a Romance language related to the Ladin populations of the Alps. According to the Italian linguist Matteo Bartoli, the Ladin area extended until 1000 CE, from southern Istria to Friuli and eastern Switzerland.
Its classification remains unclear, due to the specificities of the language, which has always had a very limited number of speakers. Istriot can be viewed:
When Istria was a region of the Kingdom of Italy, Istriot was considered by the authorities as a sub-dialect of Venetian.
Its speakers never referred to it as "Istriot". Traditionally, it had six names after the six towns where it was spoken. In Vodnjan it was named "Bumbaro", in Bale "Vallese", in Rovinj "Rovignese", in Šišan "Sissanese", in Fažana "Fasanese" and in Galižana "Gallesanese". The term Istriot was coined by the 19th century Italian linguist Graziadio Isaia Ascoli.
There are about 1,000 speakers left making it an endangered language.
Ladin (Italian: Ladino German: Ladinisch), is a language consisting of a group of dialects (which some consider part of a unitary Rhaeto-Romance language) mainly spoken in the Dolomite Mountains in Northern Italy in the provinces South Tyrol, Trentino and Belluno. It is closely related to the Swiss Romansh and Friulian.
A standard written variety of Ladin (Ladin Dolomitan) has been developed by the Office for Ladin Language Planning as a common communication tool across the whole Ladin-speaking region, but it is not popular among Ladin speakers.
Ladin should not be confused with Ladino (also called Judeo-Spanish), which is a Romance dialect of Spanish.
Ladin is recognized as a minority language in 54 Italian municipalities belonging to the provinces of South Tyrol, Trentino and Belluno. Around 92,000 people live in this area, but it is not possible to assess the exact number of Ladin speakers, because only in the provinces of South Tyrol and Trentino the inhabitants declare their native language on the occasion of the general census of the population, which takes place every ten years.
In the 2011 census, 20,548 inhabitants of South Tyrol declared Ladin as their native language. It
Mandalorian (Mando￢ﾀﾙa) is a fictional language spoken by the Mandalorian of Star Wars. It is currently being developed into a working language by Karen Traviss.
Portions exist in the Republic Commando video game and its spin-off books, ''Republic Commando: Hard Contact'' , ''Republic Commando: Triple Zero'', Republic Commando: True Colors, and Republic Commando: Order 66. Online samples include fan-created dictionaries, web logs by Traviss, and two downloadable guides from StarWars.com. A recent article in Star Wars Insider contained a guide to the language, and online audio clips have been released as well.
The language started as Republic Commando game lyrics by Jesse Harlin at LucasArts, and it is from that that Traviss has begun developing the language.
Mandalorian grammar is simple and bears many similarities to English/Galactic Basic. It is primarily a spoken language, so ease of pronunciation is a key factor. One notable feature of the fictional language is that it has no gender, relying on context to convey the proper message.
In keeping with the people who speak it, they have no word for hero, but have a dozen for stab. They have numerous expressions for numerous
Novial [nov- ("new") + IAL, International Auxiliary Language] is a constructed international auxiliary language (IAL) intended to facilitate international communication and friendship, without displacing anyone's native language. It was devised by Professor Otto Jespersen, a Danish linguist who was previously involved in the Ido movement, and subsequently in the development of Interlingua.
Its vocabulary is based largely on the Germanic and Romance languages and its grammar is influenced by English.
Novial was first introduced in Jespersen's book An International Language in 1928. It was updated in his dictionary Novial Lexike in 1930, and further modifications were proposed in the 1930s, but the language became dormant with Jespersen's death in 1943. In the 1990s, with the revival of interest in constructed languages brought on by the Internet, some people rediscovered Novial.
Novial was first described in Jespersen’s book An International Language (1928). Part One of the book discusses the need for an IAL, the disadvantages of ethnic languages for that purpose, and common objections to constructed IALs. He also provides a critical overview of the history of constructed IALs with
The Slavic dialects of Greece are the dialects of Macedonian and Bulgarian spoken by minority groups in the regions of Macedonia and Thrace in northern Greece. Usually, these dialects are classified as either Bulgarian or Pomak in Thrace, transitional dialects in East Macedonia, and Macedonian in Central and West Macedonia. Until the official codification of the Macedonian language in 1944 many linguists considered all the dialects to be Bulgarian. This has remained the predominant opinion in Bulgarian linguistics and dialectology and the position of the Bulgarian governments, and it is also supported by some international scholars.
Slavic tribes began settling in the region of Macedonia and Thrace in the 6th century. In the following centuries these two peoples mixed together. During Ottoman rule, most of the Orthodox-Bulgarian population of Macedonia had not formed a national identity separate from the Bulgarians in Moesia and Thrace and were instead identified through their religious affiliation. In the Middle Ages and later, until the 20th century, the Slav-speaking population of Aegean Macedonia was identified mostly as Bulgarian or Greek. The Muslim Slavic-speakers in Western
Coast Tsimshian, known by its speakers as Sm'álgyax, is a Tsimshianic language spoken by the Tsimshian nation in northwestern British Columbia and southeastern Alaska. Sm'algyax means literally "real or true language."
Strictly speaking, Tsimshian is not a language indigenous to Alaska, but has been spoken there since missionary William Duncan moved to Metlakatla on Annette Island in 1887 and took some of the native Canadians with him. A few Tsimshian also live in Ketchikan.
There is much debate over which family the Tsimshianic languages belong to. Many scholars believe that they are part of the controversial Penutian language stock, which includes languages spoken throughout the Pacific Northwest and California. Though probable, the existence of a Penutian stock has yet to be definitively proven. Some linguists still maintain that the Tsimshianic family is not closely related to any North American language.
The linguist Tonya Stebbins estimated the number of speakers of Coast Tsimshian in 2001 as around 400 and in 2003 as 200 or fewer (see references below). Whichever figure is more accurate, she added in 2003 that most speakers are over 70 in age and very few are under 50. About
Aequian is an extinct language presumed spoken by the people the Romans termed Aequi and Aequicoli living in the Alban hills of northeast Latium and the central Apennines east of them during the early and middle Roman Republic; that is, approximately from the 5th to the 3rd century BC, when they were defeated by the armies of Rome and were subsequently Romanized. As the area was heavily colonized by Latin speakers from Rome, most of the inscriptions from there are in Latin. Two undated inscriptions appear to be in a different dialect, termed Aequian by the scholars with the presumption that in fact they represent the language of the entire pre-Roman tribe. Not enough text survives to deduce any more than that it belonged to the Italic branch of the Indo-European language family.
Aequian is scantily documented by two inscriptions. Conway's publication of Italic inscriptions adds a gloss, several place names and several dozen personal names, but it does not distinguish which of these are certainly endonyms and which are Latin exonyms in use by the Latin-speaking population. It is possible that they would have been the same in both cases, but such a hypothesis remains unproven.
Balochi is a Northwestern Iranian language. It is the principal language of the Baloch of Balochistan It is also spoken as a second language by some Brahui. It is designated as one of nine official languages of Pakistan.
The Balochi vowel system has at least eight vowels: five long vowels and three short vowels. The long vowels are /aː/, /eː/, /iː/, /oː/, and /uː/. The short vowels are /a/, /i/ and /u/. The short vowels have more centralized phonetic qualities than the long vowels.
Southern Balochi (at least as spoken in Karachi) also has nasalized vowels, most importantly /ẽː/ and /ãː/.
The following consonants are common to both Western Balochi and Southern Balochi. The place of articulation of the consonants /s/, /z/, /n/, /ɾ/ and /l/ is claimed to be alveolar in Western Balochi, while at least the /ɾ/ is claimed to be dental in Southern Balochi. The stops /t/ and /d/ are claimed to be dental in both dialects.
In addition, /f/ is listed for Southern Balochi, but is found in few words. /x/ (voiceless velar fricative) in some loanwords in Southern Balochi corresponding to /χ/ (voiceless uvular fricative) in Western Balochi; and /ɣ/ (voiced velar fricative) in some loanwords in
Brahui (Urdu: براہوی) or Brahvi (براوی) is a Dravidian language spoken by Brahui people of Pakistan, Afghanistan and Iran, and expatriate communities in Qatar, United Arab Emirates, and Iraq. It is isolated from the nearest Dravidian-speaking neighbour population by a distance of more than 1,500 kilometres (930 mi).
Brahui is spoken in the southwest region of Pakistan, as well as regions of Afghanistan and Iran which border Pakistan; however, many members of the ethnic group no longer speak Brahui. The 2005 edition of Ethnologue reports that there are some 2.2 million speakers; 90% of those live in Pakistan, mainly in the region of Balochistan.
Brahui belongs, with Kurukh (Oraon) and Malto, to the northern and Kannada , to the sourthern subfamily of the Dravidian family of languages. It has been influenced by the Iranian languages spoken in the area, especially Balochi.
Brahui language is seen as a recent migrant language to its present region. Scholars accept that Brahui could only have migrated to Balochistan from central India after 1000 AD. The absence of any older Iranian (Avestan) loanwords in Brahui supports this hypothesis. The main Iranian contributor to Brahui vocabulary,
Kyrgyz or Kirgiz, also Kirghiz, Kyrghiz, Qyrghiz (Кыргызча or Кыргыз тили, قىرعىز تىلى, Kırgızça or Kırgız tili) is a Turkic language and one of the two official languages of Kyrgyzstan, the other being Russian. It is a member of the Kazakh-Nogai subgroup of the Kypchak languages, and modern day language convergence has resulted in an increasing degree of mutual intelligibility between Kyrgyz and Kazakh.
Kyrgyz is spoken by about 4 million people in Kyrgyzstan, China, Afghanistan, Kazakhstan, Tajikistan, Turkey, Uzbekistan, Pakistan and Russia. Kyrgyz was originally written in the Turkic runes, gradually replaced by an Arabic alphabet (in use until 1928 in USSR, still in use in China). Between 1928 and 1940, the Latin-based Uniform Turkic Alphabet was used. In 1940 due to general Soviet policy, a Cyrillic alphabet eventually became common and has remained so to this day, though some Kyrgyz still use the Arabic alphabet. When Kyrgyzstan became independent following the Soviet Union's collapse in 1991, there was a popular idea among some Kyrgyz people to make transition to the Latin alphabet (taking in mind a version closer to the Turkish alphabet, not the original alphabet of
Yucatec Maya (Yukatek Maya in the revised orthography of the Academia de Lenguas Mayas de Guatemala), called Màaya t'àan (lit. "Maya speech") by its speakers, is a Mayan language spoken in the Yucatán Peninsula and northern Belize. To native speakers, it is known only as Maya - "Yucatec" is a tag linguists use to distinguish it from other Mayan languages (such as K'iche' and Itza' Maya).
In the Mexican states of Yucatán, some parts of Campeche, Tabasco, Chiapas, and Quintana Roo, Maya remains many speakers' first language today, with approximately 800,000 to 1.2 million speakers.
A characteristic feature of Yucatec Mayan (and all Mayan languages) is the use of ejective consonants - /pʼ/, /tʼ/, /kʼ/. Often referred to as glottalized consonants, they are produced at the same place of oral articulation as their non-ejective stop counterparts - /p/, /t/, /k/. However, the release of the lingual closure is preceded by a raising of the closed glottis to increase the air pressure in the space between the glottis and the point of closure, resulting in a release with a characteristic popping sound. These sounds are written using an apostrophe after the letter to distinguish them from the
Northeastern Mandarin is the dialect of Mandarin Chinese spoken in what historically was Manchuria. It is very similar to the Beijing dialect upon which Standard Chinese (Pǔtōnghuà) is based.
The dialect is spoken by people in the Northeastern part of Mainland China; areas like Liaoning (except its southern part from Dalian to Dandong where Jiao Liao Mandarin is spoken), Jilin, and Heilongjiang provinces. With over 100 million people living in the Northeastern part of China, the Northeastern Mandarin-speaking population is quite large. Like other Mandarin dialects, differences between Northeastern Mandarin and other forms arise from the wide geographical distribution and cultural diversity of northern China. A form of Northeastern Mandarin (with some words from Udege and Nanai) is apoken by the Taz people nearby in the far east of Russia, primarily in Primorsky Krai. Overseas, Northeastern Mandarin is being spoken in increasingly larger communities in the Chinatowns of New York City in the United States.
Northeastern Mandarin can be subdivided into regional sub-dialects named for major cities where there might be slight differences.
Usually, speakers of Northeastern Mandarin
In linguistics, Old Dutch (or Old Low Franconian) denotes the Frankish dialects spoken and written in the Low Countries during the Early Middle Ages. It is regarded as the primary stage in the development of a separate Dutch language. It evolved from Old Frankish around the 6th century and in turn evolved into Middle Dutch around the 12th century.
Old Dutch was spoken by the populace which erstwhile occupied present-day Netherlands, northern Belgium, parts of northern France, and the Lower Rhine and Westphalia regions of Germany. The inhabitants of present-day Dutch provinces—including Groningen, Friesland and the coast of North Holland—spoke Old Frisian, while those of the east (Achterhoek, Overijssel and Drenthe) exercised Old Saxon.
Old West Low Franconian and Old East Low Franconian (cf. Limburgian) are very closely related, the divergence being that the latter shares more traits with neighboring Middle Franconian (e.g., Ripuarian and Mosel Franconian) while having fewer similarities to Old Saxon. While both forms of Low Franconian were instrumental to the framing of Middle Dutch, Old East Low Franconian did not contribute much to standard Dutch, which is based on the
Votic or Votian (vađđa ceeli or maaceeli – also written vaďďa tšeeli, maatšeeli) is the language spoken by the Votes of Ingria. It is closely related to Estonian and belongs to the Finnic branch of the Uralic languages. Votic is spoken only in Krakolye and Luzhitsy, two villages in Kingiseppsky District, and is close to extinction (Language death). In 1989 there were 62 speakers left, the youngest born in 1938. In its 24 December 2005 issue, The Economist wrote that there are only approximately 20 speakers left. Some linguists have claimed that Votic is actually a dialect of Estonian.
In the 19th century it was already declining in favour of Russian (there were around 1,000 speakers of the language by the start of the World War I), but its decline was accelerated as Joseph Stalin took power. WWII had a devastating effect on the Votic language, with the number of speakers considerably decreased as a result of military offensives, forced migration to Finland under the Nazi regime, and the Stalinist policy of "dispersion" immediately after the war. Since then, the Votes have, as far as possible, concealed their Votic identity, pretending to be Russians in the predominantly Russian
Pashto (پښتو, Pax̌to, IPA: [paʂˈto, paçˈto, puxˈto]; also spelled Pukhto or Pushto), also known as Afghani, is the native language of the Pashtun people of South-Central Asia. Pashto is a member of the Eastern Iranian languages group, and is descended from Avestan, the oldest preserved Iranian language. Pashto is spoken in Afghanistan, Pakistan as well as by the Pashtun diaspora around the world.
Pashto belongs to the Northeastern Iranic branch of the Indo-Iranian language family, although Ethnologue lists it as Southeastern Iranian. The number of Pashtuns or Pashto-speakers is estimated 50-60 million people world wide. Pashto is one of the two official languages of Afghanistan (the other being Dari Persian), and a regional language in western and northwestern Pakistan.
As the national language of Afghanistan, Pashto is primarily spoken in the east, south and southwest, but also in some northern and western parts of the country. The exact numbers of speakers are unavailable, but different estimates show that Pashto is the mother tongue of 35-60% of the total population of Afghanistan.
In Pakistan, Pashto is a provincial language, spoken as a first language by about 15.42% of
Assyrian Neo-Aramaic (also known as Assyrian, Aisorski, Assyrianci, Assyriski, Lishana Aturaya, Neo-Syriac, Sooreth, Suret, Sureth, or Suryaya Swadaya) is a Neo-Aramaic dialect, spoken by an estimated 220,000 people (1994 SIL estimate), formerly in the area between Lake Urmia, north-western Iran, northern Iraq, north-eastern Syria and Siirt, south-eastern Turkey, but now more widely throughout the Assyrian–Chaldean–Syriac diaspora.
Ethnologue estimates that as of the mid 1990s, about 80,000 speakers lived in the Assyrian homeland in the Middle East, while the majority of speakers lived abroad, most of them in the United States or in Europe. Most speakers are members of the Assyrian Church of the East and the Ancient Church of the East. Assyrian Neo-Aramaic is to a considerable extent mutually intelligible with Chaldean Neo-Aramaic and to a lesser extent with Turoyo.
The Neo-Aramaic languages evolved from Middle Aramaic by the 13th century. The division of the Assyrian from Chaldean Neo-Aramaic was a consequence of the religious schism of 1552 which led to the formation of the Assyrian Church of the East.
Assyrian Neo-Aramaic is one of a number of modern Eastern Aramaic dialects
Azerbaijani or Azeri (Azərbaycanca, Azərbaycan dili) is a language belonging to the Turkic language family, spoken in southwestern Asia by the Azerbaijani people, primarily in Azerbaijan and northwestern Iran. Azerbaijani is member of the Oghuz branch of the Turkic languages and is closely related to Turkish, Qashqai and Turkmen. Turkish and Azerbaijani are known to closely resemble each other, and the native speaker of one language is able to understand the other, though it is easier for a speaker of Azerbaijani to understand Turkish than the other way around.
Today′s Azerbaijani languages evolved from the Eastern Oghuz branch of Western (Oghuz) Turkic which spread to Southwestern Asia during medieval Turkic migrations, and has been heavily influenced by Persian. Arabic also influenced the language, but Arabic words were mainly transmitted through the intermediary of literary Persian.
Azerbaijani gradually supplanted the Iranian languages in what is now northern Iran (most notably the Tat, Azari, and Middle Persian varieties), and a variety of Caucasian languages in the Caucasus, particularly Udi. By the beginning of the 16th century, it had become the dominant language of the
Nedebang (ISO 639-3 nec) is a Papuan language spoken in the villages of Balungada and Baulang in the eastern district of Pantar island in the Alor archipelago of Indonesia. There are also Nedebang speakers in Air Panas, administratively part of Balungada but located 1 km from the main village.
The logonym Nedebang is widely recognized by speakers, though some prefer the logonym Klamu. The former refers to the name of an ancestral village located on a ridge above the area in which the speakers now reside. This area is reportedly still used for gardening and for traditional ceremonies. The term Klamu refers to a tribe (Indonesian suku). The Klamu people moved from Nedebang to the coast prior to Indonesian independence, probably in the 1930s. Religion plays a significant role in the social and linguistic dynamics of the region. With the exception of Air Panas, the people of Balungada are Christian, and most residents are of Klamu descent. In contrast, the villages of Baulang and Air Panas are Islamic and contain significant populations of Austronesian-speakers who have migrated from Baranusa. The language thus appears to be more viable in Balungada. There appears to be no strong
The Northern Ndebele language, isiNdebele, Sindebele, or Ndebele is an African language belonging to the Nguni group of Bantu languages, and spoken by the Ndebele or Matabele people of Zimbabwe.
isiNdebele is related to the Zulu language spoken in South Africa. This is because the Ndebele people of Zimbabwe descend from followers of the Zulu leader Mzilikazi, who left KwaZulu in the early nineteenth century during the Mfecane.
The Northern and Southern Ndebele languages are not variants of the same language; though they both fall in the Nguni group of Bantu languages, Northern Ndebele is essentially a dialect of Zulu, and the older Southern Ndebele language falls within a different subgroup. The shared name is due to contact between Mzilikazi's people and the original Ndebele, through whose territory they crossed during the Mfecane.
Pronunciation of Ndebele words is relatively easy in comparison to many languages because the vowels are quite constant, with each vowel having basically one sound, and the accent is usually on the penultimate syllable.
There are five basic vowel sounds; a, o, u are very constant and e and i have only slight variation
a is pronounced like a in father;
Polari (or alternatively Parlare, Parlary, Palare, Palarie, Palari; from Italian parlare, "to talk") is a form of cant slang used in Britain by actors, circus and fairground showmen, merchant navy sailors, criminals, prostitutes, and the gay subculture. It was popularised in the 1960s by camp characters Julian and Sandy in the popular BBC radio show Round the Horne. There is some debate about its origins, but it can be traced back to at least the 19th century, and possibly the 16th century. There is a longstanding connection with Punch and Judy street puppet performers who traditionally used Polari to converse.
Polari is a mixture of Romance (Italian or Mediterranean Lingua Franca), London slang, backslang, rhyming slang, sailor slang, and thieves' cant. Later it expanded to contain words from the Yiddish language, from the U.S. forces (present in the UK during World War II) and from 1960s drug users. It was a constantly developing form of language, with a small core lexicon of about 20 words (including bona, ajax, eek, cod, naff, lattie, nanti, omi, palone, riah, zhoosh (tjuz), TBH, trade, vada), and over 500 other lesser-known words. A Channel Four television documentary revealed
Scythian languages ( /ˈsɪθiən/ or /ˈsɪðiən/) refers to all the languages spoken by all the peoples of a vast region of Eurasia named Scythia extending from the Vistula river in East Europe to Sakastan and Mongolia in Central Asia during ancient times. Included also are some languages of eastern Iran and the Central Asian subcontinent. These peoples were at some time by some ancient authors designated as "Scythians" with the form of the name customarily known to them. Many languages must have been spoken by the Scythians. Their modes of subsistence varied from sedentary and agricultural to nomadic and pastoral. They were both warriors and merchants.
The dominant ethnic groups among the Scythians, however, were nomadic pastoralists of Central Asia and the Pontic-Caspian steppe. Fragments of their speech known from inscriptions and words quoted in ancient authors as well as analysis of their names indicate that it was of the Indo-European language family, was Indo-Iranian, Iranian and most specifically Eastern Iranian. Further classification is uncertain and elusive. Alexander Lubotsky summarizes the known linguistic landscape as follows:
The vast majority of Scythological scholars
The Yoruba language (natively èdè Yorùbá) is a Niger–Congo language spoken in West Africa. The number of speakers of Yoruba was estimated at around 20 million in the 1990s. The native tongue of the Yoruba people, is spoken, among other languages, in Nigeria, Benin, and Togo and in communities in other parts of Africa, Europe and the Americas. A variety of the language, Lucumi, from olukunmi is used as the liturgical language of the Santeria religion of Cuba, Puerto Rico, and the Dominican Republic. It is most closely related to the Owo Itsekiri language spoken in the Niger-Delta and Igala spoken in central Nigeria.
Yoruba is classified within the Edekiri languages, which together with the isolate Igala form the Yoruboid group of languages within the Volta-Niger branch of the Niger-Congo phylum. The linguistic unity of the Niger-Congo phylum dates to deep prehistory, estimates ranging around 15 kya (the end of the Upper Paleolithic). The Atlantic–Congo core of this group would have formed roughly 8,000 years ago. The Benue-Congo branch (which also includes the Bantoid branch) separated from Atlantic-Congo around the 5th millennium BC, ultimately spreading out in the Bantu expansion,
Cajun French (sometimes called Louisiana Regional French) is a variety of dialects of the French language spoken primarily in Louisiana, specifically in the southern and southwestern parishes.
While historically other Louisiana French dialects, including Colonial or Plantation Society French, have been spoken in the state, these are now considered to have largely merged with the original Cajun dialects. However, there are still significant populations of Louisiana Creoles, from White-Americans, African-Americans, and Native American tribes who continue to speak this variety of French. Parishes where this dialect is found include, but are not limited to, Avoyelles, Iberia, Pointe Coupée, St. Martin, St. Landry, St. Mary, St. Tammany, Terrebonne, Plaquemines, and other parishes south of Orleans.
Cajun French is not the same as Louisiana Creole. Cajun French is almost solely derived from Acadian French as it was spoken in the French colony of Acadia (located in what are now the Maritime provinces of Canada and in Maine) at the time of the expulsion of the Acadians in the mid-18th century; however, a significant amount of cultural vocabulary is derived from Spanish, German, Portuguese,
Icelandic ( íslenska (help·info)) is a North Germanic language, the main language of Iceland. Its closest relative is Faroese.
Icelandic is an Indo-European language belonging to the North Germanic or Nordic branch of the Germanic languages. Historically, it was the westernmost of the Indo-European languages prior to the colonisation of the Americas. Icelandic, Faroese, Norn, and Norwegian formerly comprised West Nordic; Danish and Swedish comprised East Nordic. The Nordic languages are now divided into Insular Nordic and mainland Scandinavian languages. Norwegian is now grouped with Danish and Swedish because of its mutual intelligibility with those languages due to its heavy influence from them over the last millennium, particularly from Danish.
Most Western European languages have greatly reduced levels of inflection, particularly noun declension. In contrast, Icelandic retains a four-case synthetic grammar comparable to, but considerably more conservative and synthetic than, German. It is inappropriate to compare the grammar of Icelandic to that of the more conservative Baltic, Slavic, and Indic languages of the Indo-European family, many of which retain six or more cases,
Bengali (Bengali: বাংলা Bangla [ˈbaŋla] ( listen)) is an eastern Indo-Aryan language. It is native to the region of eastern South Asia known as Bengal, which comprises present day Bangladesh, the Indian state of West Bengal, and parts of the Indian states of Tripura and Assam. It is written using the Bengali script. With about 193 million native and about 230 million total speakers, Bengali is one of the most spoken languages (ranked sixth) in the world. The National song and the National anthem of India, and the National anthem of Bangladesh were composed in Bengali.
Along with other Eastern Indo-Aryan languages, Bengali evolved circa 1000–1200 AD from the Magadhi Prakrit, which developed from a dialect or group of dialects that were close to, but different from, Vedic and Classical Sanskrit. It is now the primary language spoken in Bangladesh and is the second most commonly spoken language in India.
With a rich literary tradition arising from the Bengali Renaissance, Bengali binds together a culturally diverse region and is an important contributor to Bengali nationalism. In former East Bengal (today Bangladesh), the strong linguistic consciousness led to the Bengali Language
Haida (X̲aat Kíl, X̲aadas Kíl, X̲aayda Kil) is the language of the Haida people. It contains seven vowels and well over 30 consonants.
The first documented contact between the Haida and Europeans was in 1774, on Juan Pérez's exploratory voyage. At this time Haidas inhabited the Queen Charlotte Islands, Dall Island, and Prince of Wales Island. The precontact Haida population was about 15,000; the first smallpox epidemic came soon after initial contact, reducing the population to about 10,000 and depopulating a large portion of the Ninstints dialect area.
The next epidemic came in 1862, causing the population to drop to 1,658. Venereal disease and tuberculosis further reduced the population to 588 by 1915. This dramatic decline lead to the merger of villages, the final result being three Haida villages: Masset (merged 1876), Skidegate (merged 1879), and Hydaburg (merged 1911).
The Fraser Canyon Gold Rush of 1858 lead to a boom in the town of Victoria, and Southern Haida began traveling there annually, mainly for the purpose of selling their women. Although a Haida-based trade pidgin may have been in use in the 1830s, the Haida were using Chinook jargon when they visited Victoria.
Inuktitut (Inuktitut syllabics: ᐃᓄᒃᑎᑐᑦ) [i.nuk.ti.'tut] or Eastern Canadian Inuktitut, Eastern Canadian Inuit language is the name of some of the Inuit languages spoken in Canada. It is spoken in all areas north of the tree line, including parts of the provinces of Newfoundland and Labrador, Quebec, to some extent in northeastern Manitoba as well as the territories of Nunavut, the Northwest Territories, and traditionally on the Arctic Ocean coast of Yukon.
It is recognised as an official language in Nunavut and the Northwest Territories. It also has legal recognition in Nunavik—a part of Québec—thanks in part to the James Bay and Northern Québec Agreement, and is recognised in the Charter of the French Language as the official language of instruction for Inuit school districts there. It also has some recognition in Nunatsiavut—the Inuit area in Labrador—following the ratification of its agreement with the government of Canada and the province of Newfoundland and Labrador. The Canadian census reports that there are roughly 35,000 Inuktitut speakers in Canada, including roughly 200 who live regularly outside of traditionally Inuit lands.
For more information on the relationship
Norn is an extinct North Germanic language that was spoken in the Northern Isles (Orkney and Shetland) off the north coast of mainland Scotland and in Caithness in the far north of the Scottish mainland. After Orkney and Shetland were pledged to Scotland by Norway in 1468/69 it was gradually replaced by Scots.
Norse settlement in the islands probably began in the early 9th century. These settlers are believed to have arrived in very substantial numbers and like those who migrated to Iceland and the Faroe Islands it is probable that most came from the west coast of Norway. Shetland toponymy bears some resemblance to that of northwest Norway, while Norn vocabulary implies links with more southerly Norwegian regions.
Orkney and Shetland were pledged to James III in 1468 and 1469 respectively, and it is with these pledges that the replacement of Norn with Scots is most associated. However, the decline of Norse speech in Orkney probably began in 1379 when the earldom passed into the hands of the Sinclairs, and Scots had superseded Norse as the language of prestige on the island by the early 15th century. In Shetland the transition began later, but by the end of the 15th century both
Waimoa or Waima'a is a spoken by about 3,000 people in northeast East Timor. Waimoa proper is reported to be mutually intelligible with neighboring Kairui and Midiki, with 5,000 speakers total.
The classification of Waimoa is unclear. Structurally, it is Malayo-Polynesian. However, its vocabulary is largely Papuan, similar to that of Makasae.
Waimoa is one of only two (possibly) Austronesian languages reported to have a set of ejective stops, the other being Yapese:
However, these sounds have also been described as post-glottalized.
Andalusian Arabic (also known as Andalusi Arabic, Spanish Arabic, or Moorish Arabic) was a variety of the Arabic language spoken in Al-Andalus, the regions of the Iberian Peninsula (modern Spain and Portugal) under Muslim rule. It became an extinct language in Iberia after the expulsion of the Moriscos, which took place over a century after the Reconquista by Christian Spain. Andalusi Arabic is still used in Andalusi music and has significantly influenced the dialects of such towns as Fez, Rabat, Nedroma, Tlemcen, Blida, Cherchell, Tangiers, Tetouan, etc., which welcomed Moriscos refugees. It also exerted some influence on Mozarabic, Spanish (particularly Andalusian), Ladino, Catalan, Portuguese, and the Moroccan Arabic dialect.
Andalusian Arabic appears to have spread rapidly and been in general oral use in most parts of Al-Andalus between the ninth and fifteenth centuries. The number of speakers is estimated to have peaked at around 5-7 million speakers around the eleventh and twelfth centuries before dwindling as a consequence of the gradual but relentless takeover by the Christians. In 1502, the Muslims of Granada were forced to choose between conversion and exile; those who
Breton (Brezhoneg) is a Celtic language spoken in Brittany (Breton: Breizh; French: Bretagne), France. Breton is a Brythonic language, descended from the Celtic British language brought from Great Britain to Armorica by migrating Britons during the Early Middle Ages. Like the other Brythonic languages, Welsh and Cornish, it is classified as an Insular Celtic language. Breton is most closely related to Cornish, as both are thought to have evolved from a Southwestern Brythonic protolanguage. The other regional language of Brittany, Gallo, is a Langue d'oïl derived from Latin and is consequently relatively close to French.
Having declined from more than one million speakers around 1950 to about 200,000 in the first decade of the 21st century, of whom 61% are more than 60 years old, Breton is classified as "severely endangered" by the UNESCO Atlas of the World's Languages in Danger. However, the number of children attending bilingual classes has risen 27% between 2006 and 2011 to 14,129.
Breton is spoken in Lower Brittany, roughly to the west of a line linking Plouha and La Roche-Bernard (east of Vannes). It comes from a Brythonic language community (see image) that once extended from
Coptic or Coptic Egyptian (ⲘⲉⲧⲢⲉⲙ̀ⲛⲭⲏⲙⲓ Met Remenkēmi) is the current stage of the Egyptian language, a northern Afro-Asiatic language spoken in Egypt until at least the 17th century. Egyptian began to be written using the Greek alphabet in the 1st century. The new writing system became the Coptic script, an adapted Greek alphabet with the addition of six or seven signs from the demotic script to represent Egyptian sounds the Greek language did not have. Several distinct Coptic dialects are identified, the most prominent of which are Sahidic and Bohairic.
Coptic and Demotic Egyptian are grammatically closely akin to Late Egyptian, which was written in the Hieroglyphic script. Coptic flourished as a literary language from the 2nd to 13th centuries, and its Bohairic dialect continues to be the liturgical language of the Coptic Orthodox Church of Alexandria. It was supplanted by Egyptian Arabic as a spoken language toward the early modern period, though revitalization efforts have been underway since the 19th century.
The native name of the language is ⲙⲛⲧⲣⲙⲛⲕⲏⲙⲉ (ment rəm ən kēme) in the Sahidic dialect and ⲙⲉⲧⲣⲉⲙⲛ̀ⲭⲏⲙⲓ (met rem ən kēmi) in Bohairic. The particle prefix ment-/met- is
Japanese (日本語, Nihongo, [nihoŋɡo] ( listen)) is a language spoken by over 120 million people in Japan and in Japanese emigrant communities. It is a member of the Japonic (or Japanese-Ryukyuan) language family, which has a number of proposed relationships with other languages, none of which have gained wide acceptance among historical linguists (see Classification of Japonic).
Japanese is an agglutinative language and a mora-timed language. It has a relatively small sound inventory, and a lexically significant pitch-accent system. It is distinguished by a complex system of honorifics reflecting the nature of Japanese society, with verb forms and particular vocabulary to indicate the relative status of the speaker, the listener, and persons mentioned in conversation. Japanese vowels are pure.
The Japanese language is written with a combination of three scripts: Chinese characters called kanji (漢字), and two syllabic (or moraic) scripts made of modified Chinese characters, hiragana (ひらがな or 平仮名) and katakana (カタカナ or 片仮名). The Latin script is also often used in modern Japanese, especially for company names and logos, advertising, and when entering Japanese text into a computer. Arabic
Khazar was the Turkic language spoken by the Khazars, a semi-nomadic Turkic people from Central Asia. It is also referred to as Khazarian, Khazaric, or Khazari.
The language is extinct and written records are almost non-existent. The 10th-century Kievan letter contains the only extant record of the Khazar language, the so-called "runiform recognitio", interpreted as the single word-phrase "I have read (it)." This appears to be a sign of approval from a Khazar magistrate.
Extant texts written in the Khazar Khaganate during the 10th century are primarily in the Hebrew language.
The linguistic affiliation of the Khazars has been disputed. Khazar was a Turkic language, however, different scholars take different views whether it belonged to the Oghur ("lir") or the Oghuz ("shaz") branch of the language family.
Arab scholars of the Middle Ages classified Khazar as similar to, yet distinct from, the type of Turkic spoken by other Turks with whom they were familiar, such as the Oghuz Turks. They noted, however, that both the Khazar tongue and the more common forms of Turkic were widely spoken in Khazaria.
The consensus among scholars had long been and still is that the Khazars spoke an
Marathi (IPA: [ˈmərɑːʈʰiː]; मराठी Marāṭhī) is a southern Indo-Aryan language spoken by the Marathi people. It is the official language of Maharashtra and Goa and is one of the 23 official languages of India. It is the 19th most spoken language in the world. There were 72 million speakers in 2001. Marathi has the fourth largest number of native speakers in India Marathi has some of the oldest literature of all modern Indo-European, Indic languages, dating from about 1000 AD. The major dialects of Marathi are called Standard Marathi and Warhadi Marathi. There are a few other sub-dialects like Ahirani, Dangi, Vadvali, Samavedi, Khandeshi, and Malwani. Standard Marathi is the official language of the State of Maharashtra.
Marathi is primarily spoken in Maharashtra and parts of neighbouring states of Gujarat, Madhya Pradesh, Goa, Karnataka, Chattisgarh and Andhra Pradesh, union-territories of Daman-diu and Dadra Nagar Haveli. The cities of Baroda, Surat, and Ahmedabad (Gujrat), Belgaum (Karnataka), Indore, Gwalior (Madhya Pradesh), Hyderabad (Andhra Pradesh) and Tanjore (Tamil Nadu) each have sizable Marathi-speaking communities. Marathi is also spoken by Maharashtrian emigrants
Old Irish (sometimes called Old Gaelic) is the name given to the oldest form of the Goidelic languages for which extensive written texts are extant. It was used from the 6th to the 10th centuries, by which time it had developed into Middle Irish. Old Irish is thus the ancestor of Modern Irish, Scottish Gaelic, and Manx. Broadly speaking, the grammar and sound systems of the modern languages are simpler than those of Old Irish.
Contemporary Old Irish scholarship is still greatly influenced by the works of a small number of scholars active in the late 19th and early 20th centuries, among them Rudolf Thurneysen (1857–1940) and Osborn Bergin (1873–1950).
Old Irish was the only member of the Goidelic/Gaelic branch of the Celtic languages which is in turn a sub-family of the wider Indo-European language family that also includes Slavonic, Italic/Romance, Indo-Aryan, Germanic sub-families and a few less well known ones. Old Irish is the ancestor of all the modern Goidelic Langues; they are Modern Irish, Scottish Gaelic and Manx.
A still older form of Irish is known as Primitive Irish. Fragments of Primitive Irish, mainly personal names, are known from inscriptions on stone written in the
Polish (język polski, polszczyzna) is a language of the Lechitic subgroup of West Slavic languages, used throughout Poland (being that country's official language) and by Polish minorities in other countries. Its written standard is the Polish alphabet, which has several additions to the letters of the basic Latin script.
Despite the pressure of non-Polish administrations in Poland, who have often attempted to suppress the Polish language, a rich literature has developed over the centuries, and the language is currently the largest, in terms of speakers, of the West Slavic group. It is also the second most widely spoken Slavic language, after Russian and ahead of Ukrainian.
Poland is one of the most linguistically homogeneous European countries; nearly 97% of Poland's citizens declare Polish as their mother tongue. Elsewhere, ethnic Poles constitute large minorities in Lithuania, Belarus, and Ukraine: Polish is the most widely used minority language in Lithuania's Vilnius County (26% of the population, according to the 2001 census results) and is found elsewhere in southeastern Lithuania; in Ukraine it is most common in the Lviv and Lutsk regions, while in Western Belarus it is
Proto-Norse (also Proto-Scandinavian, Primitive Norse, Proto-Nordic, Ancient Nordic, Old Scandinavian, Proto-North Germanic and North Proto-Germanic) was an Indo-European language spoken in Scandinavia that is thought to have evolved as a northern dialect of Proto-Germanic over the first centuries AD. It is the earliest stage of a characteristically North Germanic language, and the language attested in the oldest Scandinavian Elder Futhark inscriptions, spoken ca. from the 3rd to 7th centuries (corresponding to the late Roman Iron Age and the early Germanic Iron Age). It evolved into the dialects of the Old Norse language at the beginning of the Viking Age.
Old Norse had a stress accent which fell on the first syllable. Several scholars have proposed that Proto-Norse also had a separate pitch accent, which was inherited from Proto-Indo-European and has evolved into the tonal accents of modern Swedish and Norwegian, which in turn have evolved into the stød of modern Danish. Another recently advanced theory is that each Proto-Norse long syllable and every other short syllable received stress, marked by pitch, eventually leading to the development of the Swedish and Norwegian tonal
Welsh (Cymraeg or y Gymraeg, pronounced [kəmˈrɑːɨɡ, ə ɡəmˈrɑːɨɡ]) is a member of the Brythonic branch of the Celtic languages spoken natively in Wales, by some along the Welsh border in England, and in Y Wladfa (the Welsh colony in Chubut Province, Argentina). Historically, it has also been known in English as "Cambrian", "Cambric" and "Cymric".
A 2004 survey by the Welsh Language Board indicated that 21.7% of the population of Wales (611,000 people) were able to speak the language, compared with 20.8% in the 2001 Census. Of those 611,000 Welsh speakers, 57% (315,000) considered themselves fluent, and 78% (477,000) considered themselves fluent or "fair" speakers. 62% of speakers (340,000) claimed to speak the language daily, including 88% of fluent speakers.
A greeting in Welsh is one of 55 languages included on the Voyager Golden Record chosen to be representative of Earth in NASA's Voyager program launched in 1977. The greetings are unique to each language, with the Welsh greeting being Iechyd da i chwi yn awr ac yn oesoedd which translates into English as "Good health to you now and forever".
The Welsh Language Measure 2011 gave the Welsh language official status in Wales,
The Chinantec languages belong to the Chinantecan branch of the Oto-Manguean family. Ethnologue lists 14 partially mutually unintelligible varieties of Chinantec. The languages are spoken by the indigenous Chinantec people that live in Oaxaca and Veracruz, Mexico, especially in the districts of Cuicatlán, Ixtlán de Juárez, Tuxtepec and Choapan.
Egland & Bartholomew (1978) established fourteen Chinantec languages on the basis of 80% mutual intelligibility. Ethnologue found that one that had not been adequately compared (Tlaltepusco) was not distinct, but split another (Lalana from Tepinapa). At a looser criterion of 70% intelligibility, Lalana–Tepinapa, Quiotepec–Comaltepec, Palantla–Valle Nacional, and geographically distant Chiltepec–Tlacoatzintepec would be languages, reducing the count to ten. Leolao (Latani) is the most divergent.
A typical Chinantecan phoneme inventory distinguishes 7 vowels /i, e, a, u, o, ɨ, ø/ and consonants /p f b m θ d t ts s r l n k ŋ ʔ h/. Vowels can be nasalized, except usually /u/ and /ø/.
Chinantec is a tonal language and some dialects (Usila Chinantec) have five register tones, an uncommon trait in the world's languages. Whistled language is common.
Karuk or Karok is an endangered language of northwestern California. It is the traditional language of the Karuk people, most of whom now speak English. The name is derived from the word Káruk, which means 'upriver' (Campbell 1997:397).
William Bright documented the Karuk language and produced a grammar of it in 1957. Revitalization of the language followed. According to Census 2000, there are 55 people between the ages of 5 and 17 who can speak Karuk, including 10 with limited English proficiency.
Karuk is a language isolate, sharing few if any similarities with other nearby languages. Historically, the American linguist Edward Sapir proposed it be classified as part of the Hokan family he hypothesized. However, little evidence supports this proposal. As Bright wrote, "The Karok language is not closely or obviously related to any other (in the area), but has been classified as a member of the northern group of Hokan languages, in a subgroup which includes Chimariko and the Shasta languages, spoken in the same general part of California as Karok itself."
Karuk is a polysynthetic language known for its method of arranging old and new information: "...skilled Karuk speakers use
Kashubian or Cassubian (Kashubian: kaszëbsczi jãzëk, pòmòrsczi jãzëk, kaszëbskò-słowińskô mòwa; Polish: język kaszubski, język pomorski, język kaszubsko-słowiński) is one of the Lechitic languages, a subgroup of the Slavic languages.
Kashubian is assumed to have evolved from the language spoken by some tribes of Pomeranians called Kashubians, in the region of Pomerania, on the southern coast of the Baltic Sea between the Vistula and Oder rivers.
It is closely related to Slovincian and both are dialects of Pomeranian. Many linguists, in Poland and elsewhere, consider it a divergent dialect of Polish, although now it is usually recognized as the closest existing relative of Polish. Dialectal diversity is so great within Kashubian that a speaker of southern Kashubian has considerable difficulty in understanding a speaker of the northernmost dialects.
Like Polish, Kashubian includes about 5% loanwords from Low German, such as kùńszt (art), and some from High German. Other sources of loanwords include the Baltic languages, Russian and Polish. In dialects of Kashubian a schwa occurs.
The earliest printed documents in Kashubian date from the end of the 16th century. The modern orthography
Luxembourgian or Luxembourgish (from Lëtzebuergesch; French: Luxembourgeois, German: Luxemburgisch, Dutch: Luxemburgs, Walloon: Lussimbordjwès) is a High German language that is spoken mainly in Luxembourg. About 320,000 people worldwide speak Luxembourgish.
Luxembourgish belongs to the West Central German group of High German languages and is the primary example of a Moselle Franconian language.
Luxembourgish is the national language of Luxembourg and one of three administrative languages, alongside French and German.
Luxembourgish is also spoken in the Arelerland region of Belgium (part of the Province of Luxembourg) and in small parts of Lorraine in France.
In the German Eifel and Hunsrück regions, and in Lorraine, similar local Moselle Franconian dialects of German are spoken. Furthermore, the language is spoken by a few descendants of Luxembourg immigrants in the United States, and another similar Moselle Franconian dialect is spoken by ethnic Germans long settled in Transylvania, Romania (Siebenbürgen).
Moselle Franconian dialects outside the Luxembourg state border tend to have far fewer French loan words, and these mostly remain from the French occupation under Napoléon
Pannonian Rusyn (руски язик or руска бешеда) or simply Rusyn (or Ruthenian) is a Slavic language or dialect spoken by Pannonian Rusyns in north-western Serbia (Bačka region) and eastern Croatia (therefore also called Yugoslavo-Ruthenian, Vojvodina-Ruthenian or Bačka-Ruthenian). It is similar to West Slavic languages, Slovak in particular, but has East Slavic phonetics and vocabulary. It has been influenced by surrounding South Slavic languages (Serbian and Croatian). Pannonian Rusyn is one of the official languages of the Serbian Autonomous Province of Vojvodina.
While it is classified as a microlanguage by the Serbian authors, it is considered a Ukrainian dialect in Ukraine, and simply as a Rusyn (Ruthenian) language dialect by Slovaks and northern Ruthenians.
Like the northern Rusyn language, it constitutes a mixture of some Eastern Slovak dialects and East Slavic features (namely, Russian Church Slavonic, Russian and Old Ruthenian). This mixture is because these Rusyns emigrated to Bačka from Eastern Slovakia and Western Ukraine around the middle of the 18th century. Like most modern Ruthenians, they are Greek Catholics and therefore have closer ties with Ukraine. The language
The Sarikoli language (also Sariqoli, Selekur, Sarikul, Sariqul, Sariköli) is a member of the Pamir subgroup of the Southeastern Iranian languages spoken by Tajiks in China. It is officially referred to in China as the "Tajik language", although it is different from the language spoken in Tajikistan.
Sarikoli is officially referred to as "Tajik" (塔吉克语 Tǎjíkèyǔ) in China. However, it is not closely related to Tajik as spoken in Tajikistan. It is also referred to as Tashkorghani, after the ancient capital of the Sarikoli kingdom (now a county of Xinjiang); however, this usage is not widespread among scholars.
The earliest written accounts in English, from the 1870s, generally use the name "Sarikoli".
The number of speakers is around 20,000; most reside in the Taxkorgan Tajik Autonomous County in southern Xinjiang Province, China. The Chinese name for the Sarikoli language, as well as the usage of Sarikol as a toponym, is Sèlēikùěr (色勒库尔). Speakers in China typically use Uyghur and Chinese to communicate with people of other ethnic groups in the area. The rest are found in the Pakistani-controlled sector of Kashmir, closely hugging the Pakistan-Chinese international borders.
West Low German, also known as Low Saxon (German: Niedersächsisch, Dutch: Nedersaksisch), is a group of dialects spoken in the northwest of Germany and adjacent territories of The Netherlands and Denmark. Together with East Low German it forms the Low German dialect group (German: Niederdeutsch or Plattdeutsch, Dutch: Nederduits).
The language area comprises the North German states of Lower Saxony, North Rhine-Westphalia (the Westphalian part), Bremen, Hamburg, Schleswig-Holstein and Saxony-Anhalt (the northwestern areas around Magdeburg) as well as the northeast of The Netherlands (i.e. Dutch Low Saxon, spoken in Groningen, Drenthe, Overijssel and northern Gelderland) and the Schleswigsch dialect spoken by the North Schleswig Germans in the southernmost part of Denmark.
In the south the Benrath line and Uerdingen line isoglosses form the border with the area, where West Central German variants of High German are spoken.
While Dutch is classified as a Low Franconian language, the Dutch Low Saxon varieties, which are also defined as Dutch dialects, form a dialect continuum with the Westphalian language. They consist of:
Arvanitika also known Arvanitic (Arvanitika: αρbε̰ρίσ̈τ arbërisht, Greek: αρβανίτικα arvanitika) is the variety of Albanian traditionally spoken by the Arvanites, a population group in Greece. Arvanitika is today an endangered language, as its speakers have been shifting to the use of Greek and most younger members of the community no longer speak it.
The name Arvanítika and its native equivalent Arbërisht are derived from the ethnonym Arvanites, which in turn comes from the toponym Arbëna (Greek: Άρβανα), which in the Middle Ages referred to a region in what is today Albania (Babiniotis 1998). Its native equivalents (Arbërorë, Arbëreshë and others) formerly were the self-designation of Albanians in general. In the past Arvanitika had sometimes been described as "Graeco-Albanian" and the like (e.g. Furikis, 1934), although today many Arvanites consider such names offensive, as they identify nationally and ethnically as Greeks and not Albanians.
Arvanitika was brought to southern Greece during the late Middle Ages by settlers from what is today Albania. Arvanitika is also closely related to Arbëresh, the dialect of Albanian in Italy, which largely goes back to Arvanite settlers from
The Burmese language (Burmese: မြန်မာဘာသာ; pronounced: [mjəmà bàðà]; MLCTS: myanma bhasa) is the official language of Burma. Burmese is the native language of the Bamar and related sub-ethnic groups of the Bamar, as well as that of some ethnic minorities in Burma like the Mon. Burmese is spoken by 32 million as a first language and as a second language by 10 million, particularly ethnic minorities in Burma and those in neighboring countries. (Although the constitution officially recognizes the English name of the language as the Myanmar language, most English speakers continue to refer to the language as Burmese.)
Burmese is a tonal, pitch-register, and syllable-timed language, largely monosyllabic and analytic language, with a subject–object–verb word order. It is a member of the Tibeto-Burman language family, which is a subfamily of the Sino-Tibetan family of languages. The language uses the Burmese script, derived from the Old Mon script and ultimately from the Brāhmī script.
Burmese language, literary and spoken, is called မြန်မာဘာသာ (mranma bhasa [mjəmà bàðà]), with ဘာသာ (from Pali bhasa, "language"). The language is classified into two categories. One is formal, used in
Corsican (corsu or lingua corsa) is an Italo-Dalmatian Romance language spoken and written on the islands of Corsica (France) and northern Sardinia (Italy). Corsican was long the vernacular language alongside Italian, official language in Corsica until 1859, then it was replaced by French owing to the conquest of the island by France in 1768. Over the next two centuries, the use of French grew to the extent that, by the Liberation in 1945, all islanders had a working knowledge of French. The twentieth century saw a wholesale language shift, with islanders changing their language practices to the extent that there were no monolingual Corsican speakers left by the 1960s. By 1990, an estimated 50% of islanders had some degree of proficiency in Corsican, and a small minority, perhaps 10%, used Corsican as a first language.
The January 2007 estimated population of the island was 281,000, while the figure for the March 1999 census, when most of the studies – though not the linguistic survey work referenced in this article – were performed, was about 261,000 (see under Corsica). Only a fraction of the population at either time spoke Corsican with any fluency. The 2001 population of
Konkani (Devanāgarī: कोंकणी, Kōṅkaṇī),is an Indo-Aryan language belonging to the Indo-European family of languages and is spoken on the western coast of India. It has approximately 3.6 million speakers. It is one of the official languages of India, the official language of the Indian state of Goa, and a minority language in Karnataka and northern Kerala (Kasaragod district).
Konkani is a member of the southern Indo-Aryan language group. It retains elements of the old Indo-European language structure and shows similarities with both western and eastern Indo-Aryan languages.
It is quite possible that Old Konkani was just referred to as Prakrit by its speakers. Reference to the name Konkani is not found in literature prior to 14th century. We have first reference to the name Konkani in the abhanga 263,of the 14th century Marathi saint poet, Namadeva(1270–1350). Konkani has been known by a variety of names: canarim, concanim, gomantaki, bramana, goani. It is called amchi bhas (our language) by native speakers, and govi or Goenchi bhas by others. Learned Marathi speakers tend to call it Gomantaki.
Konkani was commonly referred to as lingua canarim by the Portuguese. while it was also
Western Pantar, also known by the name of one of its dialects, Lamma, is a Papuan language spoken in the western part of Pantar island in the Alor archipelago of Indonesia. Western Pantar is spoken widely in the region by about 10,000 speakers.
There are three primary dialects.
Dialect differences are primarily lexical:
The Western Pantar consonant inventory includes: voiced and voiceless stops /p t k ’/ and /b d g/; voiceless fricatives /s h/; nasals /n m ng/; trill /r/ and lateral /l/; and glides/w/ and /y/.
The glottal fricative /h/ is very lightly articulated. It occurs in both word-initial and word-medial positions. Words which begin with a glottal fricative can be difficult to distinguish from vowel-initial forms, which actually begin with a glottal stop.
Consonants /p t k b d ɡ s m n l/ contrast in length with longer (geminate) counterparts (written double).
Western Pantar and the other non-Austronesian languages of Alor and Pantar comprise the Alor–Pantar language family. This family is often itself included within the Timor–Alor–Pantar family, a larger grouping which includes some (though perhaps not all) of the non-Austronesian languages of Timor Island.
The TAP group is
Old Norse is a North Germanic language that was spoken by inhabitants of Scandinavia and inhabitants of their overseas settlements during the Viking Age, until about 1300.
Proto-Norse developed into Old Norse by the 8th century, and Old Norse began to develop into the modern North Germanic languages in the mid- to late 14th century, ending the language phase known as Old Norse. These dates, however, are not absolute, since written Old Norse is found well into the 15th century.
Old Norse was divided into three dialects: Old East Norse, Old West Norse, and Old Gutnish. Old West and East Norse formed a dialect continuum, with no clear geographical boundary between them. For example, Old East Norse traits were found in eastern Norway, although Old Norwegian is classified as Old West Norse, and Old West Norse traits were found in western Sweden. Most speakers spoke Old East Norse in what is present day Denmark and Sweden. Old Gutnish, the more obscure dialectal branch, is sometimes included in the Old East Norse dialect due to geographical associations. It developed its own unique features and shared in changes to both other branches.
The 12th century Icelandic Gray Goose Laws state
Russian (ру́сский язы́к, russkiy yazyk, pronounced [ˈruskʲɪj jɪˈzɨk]) is a Slavic language spoken primarily in Russia, Belarus, Ukraine, Kazakhstan, and Kyrgyzstan. It is an unofficial but widely spoken language in Moldova, Latvia, Estonia, and to a lesser extent, the other countries that were once constituent republics of the USSR. Russian belongs to the family of Indo-European languages and is one of three living members of the East Slavic languages. Written examples of Old East Slavonic are attested from the 10th century onwards.
It is the most geographically widespread language of Eurasia and the most widely spoken of the Slavic languages. It is also the largest native language in Europe, with 144 million native speakers in Russia, Ukraine and Belarus. Russian is the 8th most spoken language in the world by number of native speakers and the 4th by total number of speakers. The language is one of the six official languages of the United Nations.
Russian distinguishes between consonant phonemes with palatal secondary articulation and those without, the so-called soft and hard sounds. This distinction is found between pairs of almost all consonants and is one of the most
Tagalog (/təˈɡɑːlɒɡ/; Tagalog pronunciation: [tɐˈɡaːloɡ]) is an Austronesian language spoken as a first language by a third of the population of the Philippines and as a second language by most of the rest. It is the first language of the Philippine region IV (CALABARZON and MIMAROPA), of Bulacan and of Metro Manila. Its standardized form, commonly called Filipino, is the national language and one of two official languages of the Philippines, the other being English. It is related to other Philippine languages such as Ilokano, Bisayan, Kapampangan.
The word Tagalog derived from tagailog, from tagá- meaning "native of" and ílog meaning "river". Thus, it means "river dweller". Very little is known about the history of the language. However, according to linguists such as Dr. David Zorc and Dr. Robert Blust, the Tagalogs originated, along with their Central Philippine cousins, from Northeastern Mindanao or Eastern Visayas. The first written record of Tagalog is in the Laguna Copperplate Inscription, written in 900, using fragments of the language along with Sanskrit, Malay, and Javanese. Meanwhile, the first known book to be written in Tagalog is the Doctrina Cristiana (Christian
Taiwanese Hokkien (臺灣福建話 or 臺灣閩南語), commonly known as Taiwanese (Tâi-oân-oē 臺灣話 or Tâi-gí 臺語), is the Hokkien dialect of Min Nan as spoken by about 70% of the population of Taiwan. The largest linguistic group in Taiwan, in which Hokkien is considered a native language, is known as Hoklo or Holo (Hō-ló). The correlation between language and ethnicity is generally true, though not absolute, as some Hoklo speak Hokkien poorly while some non-Hoklo speak Hokkien fluently. Pe̍h-ōe-jī (POJ) is a popular orthography for this variant of Hokkien.
Taiwanese Hokkien is generally similar to Amoy. Minor differences only occur in terms of vocabulary. Like Amoy, Taiwanese Hokkien is based on a mixture of Zhangzhou and Quanzhou speech. Due to the mass popularity of Hokkien entertainment media from Taiwan, Taiwanese Hokkien has grown to become the more influential Hokkien dialect of Min Nan, especially since the 1980s. Along with Amoy, the Taiwanese prestige dialect (based on the Tâi-lâm variant) is regarded as ‘standard Hokkien.’
Taiwanese Hokkien is a variant of Min Nan, closely related to the Amoy dialect. It is often seen as a Chinese dialect within the larger Sinitic language family. On the
Hausa (هَوُسَ, Yaren Hausa – language of the Hausa [people]) is the Chadic language with the largest number of speakers, spoken as a first language by about 25 million people, and as a second language by about 18 million more, an approximate total of 43 million people. Hausa is one of Africa's largest spoken languages after Arabic, French, English, Portuguese and Swahili.
Hausa belongs to the West Chadic languages subgroup of the Chadic languages group, which in turn is part of the Afro-Asiatic language family.
Along with the Chadic language branch, the Afro-Asiatic language family also includes 4 other branches:
Native speakers of Hausa, the Hausa people are mostly to be found in Niger and in the north of Nigeria, but the language is used as a trade language across a much larger swathe of West Africa (Benin, Ghana, Cameroon, Togo, Côte d'Ivoire etc.), Central Africa (Chad, Central African Republic, Equatorial Guinea) and northwestern Sudan, particularly amongst Muslims. Radio stations like BBC, Radio France Internationale, China Radio International, Voice of Russia, Voice of America, Deutsche Welle, and IRIB broadcast in Hausa. It is taught at universities in Africa and around the
Alemannic (German: Alemannisch (help·info)) is a group of dialects of the Upper German branch of the Germanic language family. It is spoken by approximately ten million people in seven countries: Switzerland, Germany, Austria, Liechtenstein, France, Italy, and Venezuela. The name derives from the ancient Germanic alliance of tribes known as the Alamanni ("all men").
Alemannic itself comprises a dialect continuum, from the Highest Alemannic spoken in the mountainous south to Swabian in the relatively flat north, with more of the characteristics of Standard German the farther north one goes.
Some linguists and organisations that differentiate between languages and dialects primarily on the grounds of mutual intelligibility, such as SIL International and UNESCO, describe Alemannic as one or several independent languages. ISO 639-3 distinguishes four languages: gsw (Swiss German), swg (Swabian German), wae (Walser German) and gct (Alemán Coloniero, spoken since 1843 in Venezuela).
At this level, the distinction between a language and a dialect is frequently considered a cultural and political question, in part because linguists have failed to agree on a clear standard. Standard German
Amharic (Amharic: አማርኛ amarəñña) is a Semitic language spoken in Ethiopia. It is the second most-spoken Semitic language in the world, after Arabic, and the official working language of the Federal Democratic Republic of Ethiopia. Thus, it has official status and is used nationwide. Amharic is also the official or working language of several of the states within the federal system. It has been the working language of government, the military, and of the Ethiopian Orthodox Tewahedo Church throughout medieval and modern times. Outside Ethiopia, Amharic is the language of some 2.7 million emigrants. It is written using Amharic Fidel, ፊደል, which grew out of the Ge'ez abugida—called, in Ethiopian Semitic languages, ፊደል fidel ("alphabet", "letter", or "character") and አቡጊዳ abugida (from the first four Ethiopic letters, which gave rise to the modern linguistic term abugida).
There is no agreed way of transliterating Amharic into Roman characters. The Amharic examples in the sections below use one system that is common, though not universal, among linguists specializing in Ethiopian Semitic languages. The Amharic ejective consonants correspond to the Proto-Semitic "emphatic consonants",
Gothic is an extinct Germanic language that was spoken by the Goths. It is known primarily from the Codex Argenteus, a 6th-century copy of a 4th-century Bible translation, and is the only East Germanic language with a sizable text corpus. All others, including Burgundian and Vandalic, are known, if at all, only from proper names that survived in historical accounts, and from loan-words in other languages such as Portuguese, Spanish and French.
As a Germanic language, Gothic is a part of the Indo-European language family. It is the earliest Germanic language that is attested in any sizable texts, but lacks any modern descendants. The oldest documents in Gothic date back to the 4th century. The language was in decline by the mid-6th century, due, in part, to the military defeat of the Goths at the hands of the Franks, the elimination of the Goths in Italy, and geographic isolation (in Spain the Gothic language lost its last and probably already declining function as a church language when the Visigoths converted to Catholicism in 589). The language survived as a domestic language in the Iberian peninsula (modern Spain and Portugal) as late as the 8th century, and in the lower Danube
Igbo (Igbo: Asụsụ Igbo), or Igbo proper, is a native language of the Igbo people, an ethnic group primarily located in southeastern Nigeria. There are approximately 20 million speakers that are mostly in Nigeria and are primarily of Igbo descent. Igbo is a national language of Nigeria. It is written in the Latin script, which was introduced by British colonialists. Secret societies such as the Ekpe use the Nsibidi symbols which were invented by the Ejagham and were used to represent other languages like Efik.
There are over 20 Igbo dialects. There is apparently a degree of dialect leveling occurring. A standard literary language was developed in 1972 based on the Owerri (Isuama) and Umuahia (such as Ohuhu) dialects, though it omits the nasalization and aspiration of those varieties. There are related Igboid languages as well that are sometimes considered dialects of Igbo, the most divergent being Ekpeye. Some of these, such as Ika, have separate standard forms.
The first book to publish Igbo words was Geschichte der Mission der Evangelischen Bruder auf den Carabischen (German: History of the Evangelistic Mission of the Brothers in the Caribbean), published in 1777. Shortly after
Mazandarani (مازِرونی) or Tabari (تبری) (Also known as: Mazaniki) is an Iranian language of the Northwestern branch, spoken mainly in Iran's Mazandaran, Gilan and Golestan provinces. As a member of the Northwestern branch (the northern branch of Western Iranian), genetically speaking it is rather closely related to Gilaki, and more distantly related to Persian, which belongs to the Southwestern branch.
The name Mazandarani (and variants of it) derives from the name of the historical region of Mazandaran (Mazerun in Mazandarani), which was part of former Kingdom of Tapuria. People traditionally call their language Gileki, the same as Gilekis do. Gileki consist of two morphemes : Gil + postfix ki. The name Tapuri (or Tabari) which was the name of an ancient language of somewhere in former Tapuria, Nowadays becomes prevalent into youth groups instead of Gileki. However, Gilan and Mazanderan were part of the same state known as Tapuria which its national language was known as Gileki.
Among the living Iranian languages, Mazandarani has one of the longest written traditions, from the tenth to the fifteenth century. This status was achieved during the long reign of the independent and
Middle Dutch is a collective name for a number of closely related West Germanic dialects (whose ancestor was Old Dutch) which were spoken and written between 1150 and 1500. There was at that time as yet no overarching standard language, but they were all mutually intelligible.
In historic literature Diets and Middle Dutch (Middelnederlands) are used interchangeably to describe this whole group of dialects from which later standard Dutch would be derived. Although already at the beginning several Middle-Dutch variations were present, the similarities between the different regional languages were much stronger than their differences, especially for written languages and various literary works of that time today are often very readable for modern Dutch speakers, Dutch being a rather conservative language. By many non-linguists Middle Dutch is often referred to as Diets.
Within Middle Dutch, five large groups can be distinguished, all believed to be mutually intelligible:
Limburgish and Low Saxon gradated into Middle High German and Middle Low German, respectively. These two areas border directly the German language area in the narrow sense (i.e., today's Germany). The dialect
Pictish is a term used for the extinct language or languages thought to have been spoken by the Picts, the people of northern and central Scotland in the Early Middle Ages. The idea that a distinct Pictish language was perceived at some point is attested clearly in Bede's early 8th-century Historia ecclesiastica gentis Anglorum, which names Pictish as a language distinct from both Welsh and Gaelic.
There is virtually no direct attestation of Pictish, short of a limited number of place names and names of people found on monuments and the contemporary records in the area controlled by the Kingdom of the Picts.
The term "Pictish" was used by Jackson (1955), and followed by Forsyth (1997), to mean the language spoken mainly north of the Forth-Clyde line in the Early Middle Ages. They use the term "Pritennic" to refer to the proto-Pictish language spoken in this area during the Iron Age.
The evidence of place names and personal names argues strongly that at some point at least some of the people in the Pictish area spoke Insular Celtic languages related to the more southerly Brythonic languages. Columba, a Gael, used an interpreter in Pictland according to Bede (however, more recent
The Swahili language or Kiswahili is a Bantu language spoken by various ethnic groups that inhabit several large stretches of the Mozambique Channel coastline from northern Kenya to northern Mozambique, including the Comoros Islands. It is also spoken by ethnic minority groups in Somalia. Although only five million people speak Swahili as their mother tongue, it is used as a lingua franca in much of East Africa, meaning the total number of speakers exceeds 60 million. Swahili serves as a national, or official language, of five nations: Tanzania, Kenya, Uganda, the Comoros and the Democratic Republic of the Congo.
Some Swahili vocabulary is derived from Arabic through more than twelve centuries of contact with Arabic-speaking inhabitants of the coast of southeastern Africa. It has also incorporated Persian, German, Portuguese, English, and French words into its vocabulary through contact during the past five centuries.
Scholars lack sufficient historical or archaeological evidence to allow them to state exactly when and where either the Swahili language or the Swahili culture emerged. Nevertheless, it is assumed that the Swahili speaking people have occupied their present
Telugu (తెలుగు telugu, IPA: [t̪eluɡu]) is a South-Central Dravidian language primarily spoken in the state of Andhra Pradesh, India, where it is an official language. It is also spoken in the neighbouring states of Chattisgarh, Karnataka, Maharashtra, Orissa and Tamil Nadu, and is spoken in the bordering city of Yanam, in the neighboring territory of Pondicherry. According to the 2001 Census of India, Telugu is the language with the third largest number of native speakers in India (74 million), thirteenth in the Ethnologue list of most-spoken languages worldwide, and most spoken Dravidian language. It is one of the twenty-two scheduled languages of the Republic of India and one of the four classical languages.
Telugu has the second best script in the world after Hangeul . Telugu is the only Indian language in the list. Telugu was influenced by Sanskrit and Prakrit. Telugu borrowed several features of Sanskrit that have subsequently been lost in Sanskrit's daughter languages such as Hindi and Bengali, especially in the pronunciation of some vowels and consonants. It has also been influenced by Urdu only around Hyderabad city.
Telugu is written in a Brahmic alphabet.
The etymology of
The Etruscan language was spoken and written by the Etruscan civilization, in what is present-day Italy, in the ancient region of Etruria (modern Tuscany plus western Umbria and northern Latium) and in parts of Lombardy, Veneto, and Emilia-Romagna (where the Etruscans were displaced by Gauls). Etruscan was superseded completely by Latin, leaving only a few documents and some loanwords in Latin, such as persona (from Etruscan φersu), and some place-names, such as Roma.
Etruscan literacy was widespread over the Mediterranean shores, as evidenced by about 13,000 inscriptions (dedications, epitaphs, etc.), most fairly short, but some of considerable length. They date from about 700 BC.
The Etruscans had a rich literature, as noted by Latin authors. Unfortunately, only one book (now unreadable) has survived. By AD 100, Etruscan had been replaced by Latin.
Only a few educated Romans with antiquarian interests, such as Varro, could read Etruscan. The last person known to have been able to read Etruscan was the Roman emperor Claudius (10 BC – AD 54), the author of a treatise in twenty volumes on the Etruscans, Tyrrenikà (now lost), who compiled a dictionary (also lost) by interviewing the
Finnish ( suomi (help·info), or suomen kieli) is the language spoken by the majority of the population in Finland (92% as of 2006) and by ethnic Finns outside Finland. It is one of the two official languages of Finland and an official minority language in Sweden. In Sweden, both standard Finnish and Meänkieli, a Finnish dialect, are spoken. The Kven language, a Finnish dialect, is spoken in Northern Norway.
Finnish is the eponymous member of the Finnic language family and is typologically between fusional and agglutinative languages. It modifies and inflects nouns, adjectives, pronouns, numerals and verbs, depending on their roles in the sentence.
Finnish is a member of the Finnic group of the Uralic family of languages. The Finnic group also includes Estonian and other minority languages spoken around the Baltic Sea.
Finnish demonstrates an affiliation with the Uralic languages in several respects including:
Several theories exist as to the geographic origin of Finnish and the other Uralic languages, but the most widely held view is that they originated as a Proto-Uralic language somewhere in the boreal forest belt around the Ural Mountains region and/or the bend of the middle
The Inuit language is traditionally spoken across the North American Arctic and to some extent in the subarctic in Labrador. The related Yupik languages are spoken in western and southern Alaska and Russian Far East, particularly the Diomede Islands, but is severely endangered in Russia today and is spoken only in a few villages on the Chukchi Peninsula. The Inuit live primarily in three countries: Greenland (a constituent of the Kingdom of Denmark), Canada (specifically the Nunatsiavut region of Labrador, the Nunavik region of Quebec, Nunavut, and the Northwest Territories), and the United States (specifically the state of Alaska).
The total population of Inuit speaking their traditional language is difficult to assess with precision, since most counts rely on self-reported census data that may not accurately reflect usage or competence. Greenland census estimates place the number of speakers of Inuit dialects there at roughly 50,000, while Canadian estimates are at roughly 35,000. These two countries count the bulk of speakers of Inuit language variants, although about 7,500 Alaskans speak Inuit dialects out of a population of over 13,000 Inuit.
The Eskimo languages have a few
Lepontic is an extinct Alpine language that was spoken in parts of Rhaetia and Cisalpine Gaul (what is now Northern Italy) between 550 and 100 BC. Lepontic is attested in inscriptions found in an area centered around Lugano, Switzerland, and including the Lake Como and Lake Maggiore areas of Italy.
Lepontic is a Celtic language. While some recent scholarship (e.g. Eska 1998) has tended to consider it simply as an early form of Cisalpine Gaulish (or Cisalpine Celtic), thus a dialect of the Gaulish language, the majority opinion since Lejeune 1971 continues to view it as a distinct Continental Celtic language, thus not a Gaulish dialect. Within this latter view, the earlier inscriptions found within a 50 km radius of Lugano are considered Lepontic, while the later ones, to the immediate south of this area are considered Cisalpine Gaulish.
Lepontic was assimilated first by Gaulish, with the settlement of Gaulish tribes north of the River Po, and then by Latin, after the Roman Republic gained control over Gallia Cisalpina during the late 2nd and 1st century BC.
The majority view (e.g. Lejeune 1971, Koch 2008) is that Lepontic is a distinct Continental Celtic language. A minority
Libyan Arabic (Lībi ليبي; also known as Sulaimitian Arabic) is a collective term for the closely related varieties of Arabic spoken in Libya. It can be divided into two major dialect areas; the eastern centred in Benghazi and Bayda, and the western centred in Tripoli and Misrata. The eastern variety extends beyond the borders to the east into western Egypt.
The transcription of Libyan Arabic into Latin script poses a few problems. First, there is not one standard transcription in use even for Standard Arabic. The use of IPA alone is not sufficient as it obscures some points that can be better understood if several different allophones in Libyan Arabic are transcribed using the same symbol. On the other hand, Standard Arabic transcription schemes, while providing good support for representing Arabic sounds that are not normally represented by the Latin script, do not list symbols for other sounds found in Libyan Arabic. Therefore, to make this article more legible, DIN 31635 is used with a few additions to render phonemes particular to Libyan Arabic. These additions are as follow:
Two major historical events have shaped the Libyan dialect: the Hilalian-Sulaimi migration, and the
The Mixtecan languages constitute a branch of the Otomanguean language family of Mexico. They include the Trique (or Triqui) languages, spoken by about 24,500 people; Cuicatec, spoken by about 15,000 people; and the large expanse of Mixtec languages, spoken by about 511,000 people. The relationship between Trique, Cuicatec, and Mixtec, is an open question. Research by Terrence Kaufman in the 1980s supported grouping Cuicatec and Mixtec together, but it apparently remains unpublished.
Longacre (1957) reconstructed the following consonant inventory for proto-Mixtecan:
The Nauruan language (dorerin Naoero) is an Austronesian language spoken in Nauru. It is estimated that it has 7,000 speakers. Almost all speakers are bilingual in English.
It is a member of the Micronesian family of Austronesian languages. Its ISO 639 codes are 'na' and 'nau'. Nauruan is a UN recognized language.
Nauruan has 16–17 consonant phonemes. Nauruan makes phonemic contrasts between velarized and palatalized labial consonants (which are also geminated). Velarization is not apparent before long back vowels and palatalization is not apparent before non-low front vowels.
Between a vowel and word-final /mˠ/, an epenthetical [b] appears.
There are 12 phonemic vowels (six long, six short). In addition to the allophony in the following table from Nathan (1974), a number of vowels reduce to [ə]:
Non-open vowels (that is, all but /aa/, /a/, /ɑɑ/ /ɑ/) become non-syllabic when preceding another vowel, as in /e-oeeoun/ → [ɛ̃õ̯ɛ̃õ̯ʊn] ('hide').
Stress is on the penultimate syllable when the final syllable ends in a vowel, on the last syllable when it ends in a consonant, and initial with reduplications.
In the Nauruan written language, 17 letters were originally used:
The letters c, f,
Otomi (English pronunciation: /ˌoʊtəˈmiː/, in Spanish spelling Otomí Spanish: [otoˈmi]) is an Oto-Manguean language and one of the indigenous languages of Mexico, spoken by approximately 240,000 indigenous Otomi people in the central altiplano region of Mexico. Otomi is variously considered a dialect continuum or a group of closely related languages, since many of the varieties are not mutually intelligible. The word Hñähñu [hɲɑ̃hɲṹ] has been proposed as an endonym, but since it represents the usage of a single dialect it has not gained wide currency. Linguists have classified the modern dialects into three dialect areas: the Northwestern dialects spoken in Querétaro, Hidalgo and Guanajuato; the Southwestern dialects spoken in the State of Mexico; and the Eastern dialects spoken in the highlands of Veracruz, Puebla, and eastern Hidalgo and in villages in Tlaxcala and Mexico states.
Like all other Oto-Manguean languages, Otomi is a tonal language and most varieties distinguish three tones. Nouns are marked only for possessor; plural number is marked with a definite article and by a verbal suffix, and some dialects maintain dual number marking. There is no case marking. Verb
The Philistine language (/ˈfɪləstiːn/, /-staɪn/ or /fɨˈlɪstɨn/, /-tiːn/) is the extinct language of the Philistines, spoken— and rarely inscribed— along the coastal strip of southwestern Canaan. Very little is known about the language, of which a handful of words survive as cultural loan-words in Hebrew, describing specifically Philistine institutions, like the seranim, the "lords" of the Philistine Pentapolis, or the ’argáz receptacle that occurs in 1 Samuel 6 and nowhere else, or the title padî.
There is not enough information of the language of the Philistines to relate it securely to any other languages: possible relations to Indo-European languages, even Mycenaean Greek, support the independently-held theory that immigrant Philistines originated among "sea peoples". There are hints of non-Semitic vocabulary and onomastics, but the inscriptions, not clarified by some modern forgeries, are enigmatic: a number of inscribed miniature "anchor seals" have been found at various Philistine sites. On the other hand, evidence from the slender corpus of brief inscriptions from Iron Age IIA-IIB Tell es-Safi. demonstrates that at some stage during the local Iron Age, the Philistines
Romansh (also spelled Romansch, Rumants(c)h, or Romanche; Romansh: rumantsch (help·info)/ rumauntsch (help·info)/ romontsch (help·info)/rumàntsch; German: Rätoromanisch; Italian: Romancio) is a Rhaeto-Romance language descended from the Vulgar Latin spoken by the Roman era occupiers of the region. It is closely related to French, Occitan, and Lombard, as well as the other Romance languages to a lesser extent. Romansh is one of the four national languages of Switzerland, along with German, French and Italian.
In the 2000 Swiss census, 35,095 people (of which 27,038 in the canton of Grisons) indicated Romansh as the language of "best command", and 61,815 also as a "regularly spoken" language. Spoken by around 0.9% of Switzerland's 7.7 million inhabitants, Romansh is Switzerland's least-used national language in terms of number of speakers and the tenth most spoken language in Switzerland overall.
Romansh is a Romance language descending from Vulgar Latin, the spoken language of the Roman Empire. Within the Romance languages, Romansh stands out through its peripheral location, which manifests itself through several archaic features. Another distinguishing feature is the
Seraiki (Siraiki, Saraiki) is a cluster of southern Lahnda (southwestern Punjabi) dialects: Multani spoken in Multan and Muzaffargarh districts, Riyasati spoken in Bahawalpur, Rahim yar khan and Lodhran districts, and Derawali spoken in Dera Ghazi Khan and Rajanpur of Punjab Province and Deri Ismail Khan, Tank and Banu districts of Khyber Pakhtunkhwa. In Sindh Sureli which is bit effected by Sindhi language is spoken in whole upper Sindh which comprise of Kashmore,Ghotki,Sukhar,Jacobabad,Shikarpur,Larkana,Naushahro feroz, Qambar shahdadkot,Dadu and Kherpur Districts. In Balochistan the Khetrani of Daroug, Rakni and Barkhan District and Jadgali which is also called Jattaki of Jafferabad, Naseerabad and Jhalmagsi districts and Frakhi of Sibi District and Jafri of Musa khel district.Some times Thali dialect of Bhakker and Layyah districts is also included to this cluster but that is more closer to Shahpuri dialect of central Punjab.
Total number of saraiki dialect speaking districts in Punjab are seven, in Sindh are ten, in Balouchistan are six and in Khyber Pakhtunkhwa are three. Overall total in Pakistan is twenty six.
Standard Punjabi (Majhi), South Punjabi (Saraiki)and North
Serbian (Serbian Cyrillic: српски , Latin: srpski, pronounced [sr̩̂pskiː]) is a standardized register of the Serbo-Croatian language spoken by Serbs, mainly in Serbia, Bosnia and Herzegovina, Montenegro, Croatia, and Macedonia. It is official in Serbia and one of the official languages of Bosnia and Herzegovina, and is the principal language of the Serbs.
The dialect serving as the basis for the main literary and standard language is Shtokavian, which is also the basis of standard Croatian, Bosnian, and Montenegrin. In particular, Serbian is standardized around Šumadija-Vojvodina and Eastern Herzegovinian subdialects of Shtokavian. The Torlakian dialect of Serbian is spoken in southeast Serbia, and is not standardized, as it represents transitional form to Macedonian and Bulgarian.
Serbo-Croatian is the only European language with active digraphia, using both Cyrillic and Latin alphabets. The Bosnian and Serbian varieties use both alphabets while the Croatian variety uses only the Latin alphabet. The Serbian Cyrillic alphabet was devised in 1814 by Serbian linguist Vuk Karadžić, who created the alphabet on phonemic principles. The Latin alphabet was designed by Croatian linguist
Slovak ( slovenský jazyk (help·info), slovenčina, not to be confused with slovenski jezik or slovenščina, the native name of the Slovene language), is an Indo-European language that belongs to the West Slavic languages (together with Czech, Polish, Silesian, Kashubian, and Sorbian).
Slovak is the official language of Slovakia, where it is spoken by 5 million people. There are also Slovak speakers in the United States, the Czech Republic, Serbia, Ireland, Romania, Poland, Canada, Hungary, Croatia, the United Kingdom, Australia, Austria, and Ukraine.
Slovak uses a Latin script with small modifications that include the four diacritics (ˇ, ´, ¨, ˆ; see Pronunciation) placed above certain letters.
The primary principle of Slovak spelling is the phonemic principle. The secondary principle is the morphological principle: forms derived from the same stem are written in the same way even if they are pronounced differently. An example of this principle is the assimilation rule (see below). The tertiary principle is the etymological principle, which can be seen in the use of i after certain consonants and of y after other consonants, although both i and y are pronounced the same way. Finally
Lower Sorbian (Dolnoserbski) is a Slavic minority language spoken in eastern Germany in the historical province of Lower Lusatia, today part of Brandenburg. It is one of the two literary Sorbian languages, the other being Upper Sorbian.
Lower Sorbian is spoken in and around the city of Cottbus in Brandenburg. Signs in this region are usually bilingual, and Cottbus has a Gymnasium where one language of instruction is Lower Sorbian. It is a heavily endangered language. Most native speakers are in the oldest generation today.
The phonology of Lower Sorbian has been greatly influenced by contact with German, especially in Cottbus and larger towns. For example, German-influenced pronunciation tends to have a voiced uvular fricative [ʁ] instead of the alveolar trill [r], and a "clear" [l] that is not especially palatalized instead of [lʲ]. In villages and rural areas German influence is less marked, and the pronunciation is more "typically Slavic".
The consonant phonemes of Lower Sorbian are as follows:
Lower Sorbian has both final devoicing and regressive voicing assimilation:
The postalveolar fricative /ʃ/ is assimilated to [ɕ] before /t͡ɕ/:
The vowel phonemes are as follows:
The Swazi or Swati language (Swazi: siSwati [siswatʼi]; Zulu: isiSwazi [isiswaz̤i]) is a Bantu language of the Nguni group spoken in Swaziland and South Africa by the Swazi people. The number of speakers is estimated to be in the region of 3 million. The language is taught in Swaziland and some South African schools in Mpumalanga and KaNgwane areas. Swazi is an official language of Swaziland, (along with English), and is also one of the eleven official languages of South Africa.
Although the preferred term is "Swati" among native speakers, in English it is generally referred to as Swazi: this is taken from the Zulu name for the language, isiSwazi. Swazi is most closely related to the other "Tugela" Nguni language, Phuthi; but is also very close to the "Zunda" Nguni languages: Zulu, Southern Ndebele, Northern Ndebele, and Xhosa.
Swazi spoken in Swaziland (eSwatini) can be divided into four dialects corresponding to the four administrative regions of the country: Hhohho, Lubombo, Manzini, and Shiselweni.
Swazi has at least two varieties: the standard, prestige variety spoken mainly in the north, centre and southwest of the country, and a less prestigious variety spoken elsewhere.
Swedish ( svenska (help·info)) is a North Germanic language, spoken by approximately 10 million people, predominantly in Sweden and parts of Finland, especially along its coast and on the Åland islands. It is largely mutually intelligible with Norwegian and Danish (see Classification). Along with the other North Germanic languages, Swedish is a descendant of Old Norse, the common language of the Germanic peoples living in Scandinavia during the Viking Era. It is currently the largest of the North Germanic languages by numbers of speakers.
Standard Swedish, used by most Swedish people, is the national language that evolved from the Central Swedish dialects in the 19th century and was well established by the beginning of the 20th century. While distinct regional varieties descended from the older rural dialects still exist, the spoken and written language is uniform and standardized. Some dialects differ considerably from the standard language in grammar and vocabulary and are not always mutually intelligible with Standard Swedish. These dialects are confined to rural areas and are spoken primarily by small numbers of people with low social mobility. Though not facing imminent