Here is a list of languages for which the necessary support files have been provided. Please note that in some cases, the character set implemented may be just what is needed for the language, in which case, the full character set as specified in Sanskrit will not be available for this language. However, the supported subset of consonants, vowels and conjuncts is entirely adequate for all practical purposes.
The Sanskrit character set is very rich in conjuncts and in the present implementation, more than 550 conjuncts are included. In addition to this, the present implementation of Sanskrit provides for displaying text with Vedic marks. Numerals are in Sanskrit.
The Hindi character set does not include many of the three consonant conjuncts commonly used in Sanskrit. But the special characters in the script such as those derived from Persian are also included. Devanagari numerals are supported.
Tamil has no conjunct consonants. Hence conjunct characters from other languages are transliterated and displayed. The numerals are also Tamil specific.
The implemented character set is close to that of Hindi. Kannada numerals are supported.
The Telugu implementation supports a large number of conjuncts. As of now, the screen definition of the characters needs improvement. Telugu has some special consonants not found in Sanskrit. The Telugu character set implemented is also very similar to that of Hindi. The numerals are however only the Roman numerals.
Malayalam implementation is very close to the Sanskrit character set. This language is rich in conjuncts and there are some consonants here not found in Sanskrit. Malayalam numerals are supported.
About 40% of the conjuncts used in Sanskrit are implemented. Oriya has a few consonants which it shares with Bengali and Assamese. Oriya numerals are supported.
Many conjuncts are implemented but the set is not complete. Bengali has a few consonants which differ from Sanskrit. The numerals are in Bengali.
A fairly straightforward implementation. There are not many conjuncts. Numerals are in Roman.
Again a fairly straightforward implementation. A smaller set of conjuncts is supported. Numerals are in Roman.
The languages not included in the above actually use the script of one of the included languages. Hence the support files are not separately available for these languages. Urdu is also included in the list of national languages. The support for Urdu will be included in the near future.
The system supports the following languages as well.
The Japanese Hiragana character set has 14 consonants and 5 vowels. Interestingly, this is phonetic but differs from Indian languages in that Matras(vowel marks) are not used and each consonant-vowel combination has a unique shape.
Though in current practice, vowel marks are not used with the Hebrew characters, traditional Hebrew writing uses marks known as vowel points. The IITM system supports a fairly complete set of vowel points as found in the Hebrew Bible.
The implementation of the Greek character set supports the rough and the smooth breathing marks as well as the three accents (acute, grave and circumflex). Combinations of the accent marks and the breathing marks are also supported along with the iota subscript for the long vowels.
For the benefit of those who cannot directly read Indian scripts but are involved in research on Indian documents and manuscripts, some help is provided through this script which is basically Roman script with internationally accepted phonetic marks. In this implementation, one can correctly read all the consonants, conjuncts and vowel combinations in the superset of the Indian language characters mentioned earlier. Keyboard entry with IPA will result in the corresponding Indian language character appearing in transliterated form.