Malayalam script
Malayalam script is a Brahmic script used commonly to write the Malayalam language, which is the principal language of Kerala, India, spoken by 45 million people in the world. Malayalam script is also widely used for writing Sanskrit texts in Kerala. Like many other Indic scripts, it is an alphasyllabary, a writing system that is partially “alphabetic” and partially syllable-based. The modern Malayalam alphabet has 15 vowel letters, 42 consonant letters, and a few other symbols. The Malayalam script is a Vatteluttu alphabet extended with symbols from the Grantha alphabet to represent Indo-Aryan loanwords.
The script is also used to write several minority languages such as Paniya, Betta Kurumba, and Ravula. The Malayalam language itself was historically written in several different scripts.
Overview
Characteristics
The basic characters can be classified as follows:- Vowels
- # Independent vowel letters
- # Dependent vowel signs
- Consonant letters
- ക് /k/ = /k/ which is a consonant sound
- ക ka = ക് /k/ + അ vowel sign a
- കി ki = ക് /k/ + ഇ vowel sign i
- കു ku = ക് /k/ + ഉ vowel sign u
- കൈ kai = ക് /k/ + ഐ vowel sign ai
- ക ka = ക് /k/ + അ vowel sign a
History
Malayalam was first written in the Vatteluttu alphabet, an ancient script of Tamil. However, the modern Malayalam script evolved from the Grantha alphabet, which was originally used to write Sanskrit. Both Vatteluttu and Grantha evolved from the Tamil-Brahmi, but independently.Vatteluttu alphabet
Vattezhuthu is a script that had evolved from Tamil-Brahmi and was once used extensively in the southern part of present-day Tamil Nadu and in Kerala.Malayalam was first written in Vattezhuthu. The Vazhappally inscription issued by Rajashekhara Varman is the earliest example, dating from about 830 CE. In the Tamil country, the modern Tamil script had supplanted Vattezhuthu by the 15th century, but in the Malabar region, Vattezhuthu remained in general use up to the 17th century, or the 18th century. A variant form of this script, Kolezhuthu, was used until about the 19th century mainly in the Kochi area and in the Malabar area. Another variant form, Malayanma, was used in the south of Thiruvananthapuram.
Grantha
According to Arthur Coke Burnell, one form of the Grantha alphabet, originally used in the Chola dynasty, was imported into the southwest coast of India in the 8th or 9th century, which was then modified in course of time in this secluded area, where communication with the east coast was very limited. It later evolved into Tigalari-Malayalam script was used by the Malayali, Havyaka Brahmins and Tulu Brahmin people, but was originally only applied to write Sanskrit. This script split into two scripts: Tigalari and Malayalam. While Malayalam script was extended and modified to write vernacular language Malayalam, the Tigalari was written for Sanskrit only. In Malabar, this writing system was termed Arya-eluttu, meaning “Arya writing”.Vatteluttu was in general use, but was not suitable for literature where many Sanskrit words were used. Like Tamil-Brahmi, it was originally used to write Tamil, and as such, did not have letters for voiced or aspirated consonants used in Sanskrit but not used in Tamil. For this reason, Vatteluttu and the Grantha alphabet were sometimes mixed, as in the Manipravalam. One of the oldest examples of the Manipravalam literature, Vaishikatantram, dates back to the 12th century, where the earliest form of the Malayalam script was used, which seems to have been systematized to some extent by the first half of the 13th century.
Thunchaththu Ezhuthachan, a poet from around the 17th century, used Arya-eluttu to write his Malayalam poems based on Classical Sanskrit literature. For a few letters missing in Arya-eluttu, he used Vatteluttu. His works became unprecedentedly popular to the point that the Malayali people eventually started to call him the father of the Malayalam language, which also popularized Arya-eluttu as a script to write Malayalam. However, Grantha did not have distinctions between e and ē, and between o and ō, as it was as an alphabet to write a Sanskrit language. The Malayalam script as it is today was modified in the middle of the 19th century when Hermann Gundert invented the new vowel signs to distinguish them.
By the 19th century, old scripts like Kolezhuthu had been supplanted by Arya-eluttu – that is the current Malayalam script. Nowadays, it is widely used in the press of the Malayali population in Kerala.
Malayalam and Tigalari are sister scripts are descended from Grantha alphabet. Both share similar glyphic and orthographic characteristics.
Orthography reform
In 1971, the Government of Kerala reformed the orthography of Malayalam by a government order to the education department. The objective was to simplify the script for print and typewriting technology of that time, by reducing the number of glyphs required. In 1967, the government appointed a committee headed by Sooranad Kunjan Pillai, who was the editor of the Malayalam Lexicon project. It reduced number of glyphs required for Malayalam printing from around 1000 to around 250. Above committee's recommendations were further modified by another committee in 1969.This proposal was later accepted by major newspapers in January 1971. The reformed script came into effect on 15 April 1971, by a government order released on 23 March 1971.
Recommendations by the committees
Use non-ligating vowel signs for ''u'', ''ū'', and ''r̥''
In the traditional orthography, that had been taught in the primary education till that time, any consonant or consonant ligature followed by the vowel sign of u, ū, or r̥ are represented by a cursive consonant-vowel ligature. The glyph of each consonant had its own way of ligating with these vowel signs. This irregularity was simplified in the reformed script. As per that, a vowel sign or the consonant sign would always have a disconnected symbol that does not fuse with the base consonant.Examples:
- ku: → കു
- kū: → കൂ
- kr̥: → കൃ
- nu: → നു
- śu: → ശു
Reph is replaced with ''Chillu ṟa''
- rkka: ൎക്ക → ർക്ക
Split uncommon conjuncts with ''Chandrakkala''
- ഗ് g + ദ da = gda: -> ഗ്ദ
- ല് l + ത ta = lta: -> ല്ത
- ശ് ś + ന na = śna: -> ശ്ന
- ശ് ś + മ ma = śma: -> ശ്മ
- The ligature śma is required as an additional letter. For examples, ശ്മശാനം, śmaśanam, is the word for cemetery.
Use non-ligating sign for conjoining ''ra''
- kra: ക്ര → ക്ര
- kru: ക്രു → ക്രു
Current status
Malayalam letters
Vowels
Vowel letters and vowel signs
The following tables show the independent vowel letters and the corresponding dependent vowel signs of the Malayalam script, with romanizations in ISO 15919, transcriptions in the International Phonetic Alphabet.r̥, r̥̄, l̥, l̥̄, used to write Sanskrit words, are treated as vowels. They are called semi-vowels and are phonetically closer to vowels in Malayalam and in Classical Sanskrit where Panini, the Sanskrit grammarian, groups them with vowel sounds in his sutras.. The letters and signs for r̥̄, l̥, l̥̄ are very rare, and are not considered as part of the modern orthography.
The vowel signs ā, i, ī are placed to the right of a consonant letter to which it is attached. The vowel signs e, ē, ai are placed to the left of a consonant letter. The vowel signs o and ō consist of two parts: the first part goes to the left of a consonant letter and the second part goes to the right of it. In the reformed orthography, the vowel signs u, ū, r̥ are simply placed to the right of the consonant letter, while they often make consonant-vowel ligatures in the traditional orthography.
It is important to note the vowel duration as it can be used to differentiate words that would otherwise be the same. For example, means "earthenware pot" while means "time" or "season".
Anusvaram
An anusvaram, or an anusvara, originally denoted the nasalization where the preceding vowel was changed into a nasalized vowel, and hence is traditionally treated as a kind of vowel sign. In Malayalam, however, it simply represents a consonant after a vowel, though this may be assimilated to another nasal consonant. It is a special consonant letter, different from a "normal" consonant letter, in that it is never followed by an inherent vowel or another vowel. In general, an anusvara at the end of a word in an Indian language is transliterated as ṁ in ISO 15919, but a Malayalam anusvara at the end of a word is transliterated as m without a dot.Visargam
A visargam, or [|visarga], represents a consonant after a vowel, and is transliterated as ḥ. Like the anusvara, it is a special symbol, and is never followed by an inherent vowel or another vowel.Consonants
Basic consonant letters
The following tables show the basic consonant letters of the Malayalam script, with romanizations in ISO 15919, transcriptions in IPA, and Unicode CHARACTER NAMES. The character names used in the report of the Government of Kerala committee are shown in lowercase italics when different from Unicode character names. Those alternative names are based on the traditional romanization used by the Malayali people. For example, tha in “Thiruvananthapuram” is neither ISO tha nor Unicode THA, but tha in this sense. The ISCII character names are given in parentheses when different from the above.The consonants are retroflex. In Malayalam, they are produced by touching the underside of the tip of the tongue to the front part of the hard palate.
യ ya YA | ര ra RA | ല la LA | വ va VA |
ള ḷa LLA hard la | ഴ ḻa LLLA/ṛ /ɽ/ zha | റ ṟa, ṯa RRA | ഩ ṉa NNNA | ഺ ṯa TTTA |
- Dental nasal or alveolar nasal, depending on the word.
- Alveolar tap.
- The tip of the tongue almost touches the teeth, forward than the English l.
- .
- .
- Voiceless apico-palatal approximant.
- Dental sibilant fricative.
- Apico-palatal.
- This glyph is a ligature, but is sometimes listed as a basic unit. Often pronounced.
- Voiced apico-palatal approximant. This consonant is usually described as, but also can be approximated by.
- Repetition of this letter often represents a geminated voiceless alveolar plosive, ; chillu-n + this letter often represents ; otherwise alveolar trill . Optionally, may be transliterated as ṯṯa instead of ṟṟa, as nṯa instead of nṟa.
- Corresponds to Tamil ṉa ன. Used rarely in scholarly texts to represent the alveolar nasal, as opposed to the dental nasal. In ordinary texts both are represented by na ന.
- Used rarely in scholarly texts to represent the voiceless alveolar plosive, as opposed to the voiceless dental plosive represented by ta ത. In ordinary texts this sound is represented by ṟa റ.
Chillus
Six independent chillu letters had been encoded in Unicode 5.1., three additional chillu letters were encoded with the publication of Unicode 9.0.
Letter | Unicode name | Base | Remarks |
ൺ | CHILLU NN | ṇa ണ | |
ൻ | CHILLU N | na ന | |
ർ | CHILLU RR | ra ര | Historically from ra, not from ṟa റ. |
ൽ | CHILLU L | la ല | Historically from ta ത. |
ൾ | CHILLU LL | ḷa ള | Historically from ṭa ട |
ൿ | CHILLU K | ka ക | |
ൔ | CHILLU M | ma മ | Not in modern use |
ൕ | CHILLU Y | ya യ | Not in modern use |
ൖ | CHILLU LLL | llla ഴ | Not in modern use |
Chandrakkala
The virama in Malayalam is called candrakkala, it has two functions:- As virama: used to suppress the inherent vowel
- As samvruthokaram: represent the “half-u” sound
As virama
Half-u
At the end of a word, the same symbol sometimes represents a very short vowel, known as “half-u”, or “samvruthokaram”, or kuṯṯiyal ukaram. The exact pronunciation of this vowel varies from dialect to dialect, but it is approximately or, and transliterated as ŭ. Optionally, a vowel sign u is inserted, as in നു്. According to one author, this alternative form is historically more correct, though the simplified form without a vowel sign u is common nowadays. This means that the same spelling ന് may represent either n or nŭ depending on the context. Generally, it is nŭ at the end of a word, and n elsewhere; നു് always represents nŭ.The virama of Tigalari script behave similarly to Malayalam. Virama has three functions: to suppress the inherent vowel ; to form conjunct consonants; to represent the half-u.
Note:
Srinidhi A and Sridatta A made comments on the proposals of Cibu Johny et al.
Ligatures
Consonant ligatures
Like in other Indic scripts, a virama is used in the Malayalam script to cancel—or “kill”—the inherent vowel of a consonant letter and represent a consonant without a vowel, so-called a “dead” consonant. For example,- ന is a consonant letter na,
- ് is a virama; therefore,
- ന് represents a dead consonant n.
Generally, when a dead consonant letter C1 and another consonant letter C2 are conjoined, the result may be either:
- A fully conjoined ligature of C1+C2;
- Half-conjoined—
- * C1-conjoining: a modified form of C1 attached to the original form of C2
- * C2-conjoining: a modified form of C2 attached to the full form of C1; or
- Non-ligated: full forms of C1 and C2 with a visible virama.
Common consonant ligatures
Several consonant-consonant ligatures are used commonly even in the new orthography.kka | ṅka | ṅṅa | cca | ñca | ñña | ṭṭa | ṇṭa | ṇṇa | tta | nta | nna | ppa | mpa | mma | |
Non-ligated | ക്ക | ങ്ക | ങ്ങ | ച്ച | ഞ്ച | ഞ്ഞ | ട്ട | ണ്ട | ണ്ണ | ത്ത | ന്ത | ന്ന | പ്പ | മ്പ | മ്മ |
Ligated | ക്ക | ങ്ക | ങ്ങ | ച്ച | ഞ്ച | ഞ്ഞ | ട്ട | ണ്ട | ണ്ണ | ത്ത | ന്ത | ന്ന | പ്പ | മ്പ | മ്മ |
The ligature mpa മ്പ was historically derived from npa ന്പ. The ligatures cca, bba, yya, and vva are special in that a doubled consonant is denoted by a triangle sign below a consonant letter.
cca | bba | yya | vva | |
Non-ligated | ച്ച | ബ്ബ | യ്യ | വ്വ |
Ligated | ച്ച | ബ്ബ | യ്യ | വ്വ |
Consonant + ''ya'', ''va'', ''la'', ''ra''
- The consonant letter ya is generally C2-conjoining after a consonant in both orthographies. For example,
- k ക് + ya യ = kya ക്യ
- p പ് + ya യ = pya പ്യ
- Similarly, va after a consonant takes a post-base form:
- k ക് + va വ = kva ക്വ
- p പ് + va വ = pva പ്വ
- The consonant letter la after a consonant traditionally takes a below-base form. These forms are used also in the new orthography, though some fonts do not support them.
- k ക് + la ല = kla ക്ല
- p പ് + la ല = pla പ്ല
- l ല് + la ല = lla ല്ല
- A consonant letter ra after a consonant usually takes a pre-base form in the reformed orthography, while this combination makes a fully conjoined ligature in the traditional orthography.
- k ക് + ra ര = kra ക്ര
- p പ് + ra ര = pra പ്ര
''nṯa'' and ''ṯṯa''
nṯa | ṯṯa | |
Non-ligated | ന്റ | റ്റ |
Ligated | ന്റ | റ്റ |
Digraph | ൻറ | ററ |
In those two ligatures, a small ṟa റ is written below the first letter. Alternatively, the letter ṟa is sometimes written to the right of the first letter, making a digraph. The spelling ൻറ is therefore read either nṟa or nṯa depending on the word. Similarly, ററ is read either ṟaṟa or ṯṯa.
Dot reph
In the traditional orthography, a dead consonant r before a consonant sometimes takes an above-base form, known as a dot reph, which looks like a short vertical line or a dot. Generally, a chillu-r is used instead of a dot reph in the reformed orthography.- r ര് + ga ഗ = rga
- r ര് + ja ജ = rja
Consonant-vowel ligatures
Numeral System
Malayalam numeral system is archaic and no longer commonly used. Instead, the common Hindu-Arabic numeral system is followed.0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 100 | 1000 | |||
൦ | ൧ | ൨ | ൩ | ൪ | ൫ | ൬ | ൭ | ൮ | ൯ | ൰ | ൱ | ൲ | ൳ | ൴ | ൵ |
Number "11" is written as "൰൧" and not "൧൧". "32" is written as "൩൰൨" similar to the Tamil numeral system.
11 | 20 | 21 | 30 | 110 | 10,099 |
൦ | ൨൰ | ൨൰൧ | ൩൰ | ൱൰ | ൰൲൯൰൯ |
Suppose the number is "2013". It is read in Malayalam as "രണ്ടായിരത്തി പതിമൂന്ന്". It is split into :
- രണ്ട് : 2 - ൨
- ആയിരത്തി : 1000 - ൲
- പത്ത് : 10 - ൰
- മൂന്ന് : 3 - ൩
Other symbols
Unicode
Malayalam script was added to the Unicode Standard in October, 1991 with the release of version 1.0.Block
The Unicode block for Malayalam is U+0D00–U+0D7F:Chillus in Unicode
For example, avan അവൻ is written as a അ + va വ + chillu-n ൻ, where chillu-n represents the n sound without a vowel. In other Indic scripts, the same word would be possibly written as a + va + na + virama. However, in Malayalam script, that sequence represents a different word, avanŭ അവന്, and is not interchangeable with avan. This is because in modern Malayalam script, the sign for a virama also works as the sign for a vowel ŭ at the end of a word, and is not able to cleanly “kill” the inherent vowel in this case.To differentiate a pure consonant and a consonant with ŭ, zero-width joiner and zero-width non-joiner were used before Unicode 5.1. However, this system was problematic. Among other things, glyph variants specified by ZWJ or ZWNJ are supposed to be non-semantic, whereas a chillu and the same consonant followed by a ŭ are often semantically different. After a long debate, Nine chillu letters now have their own code points since Unicode 9.0, though applications should also be prepared to handle data in the representation specified in Unicode 5.0. This means, fonts should display chillus in both sequences; while an input method should output standard chillus.
The ligature nṯa ന്റ is very common and supported by most Malayalam fonts in one way or another, but exactly how it should be encoded was not clear in Unicode 5.0 and earlier, and two incompatible implementations are currently in use. In Unicode 5.1, the sequence to represent it was explicitly redefined as chillu-n + virama + ṟa.