Browse Codepoints by Script
See writing scripts and systems how they are used around the world. Click on a script name for additional information.
-
Adlam: 92 code points
92 characters are encoded in this script.
The Adlam script is a script used to write Fulani. The name Adlam is an acronym derived from the first four letters of the alphabet, standing for Alkule Dandayɗe Leñol Mulugol, which means "the alphabet that protects the peoples from vanishing". It is one of many indigenous scripts developed for specific languages in West Africa.
-
Afaka: 0 code points
0 characters are encoded in this script.
The Afaka script is a syllabary of 56 letters devised in 1910 for the Ndyuka language, an English-based creole of Suriname. The script is named after its inventor, Afáka Atumisi. It continues to be used to write Ndyuka in the 21st century, but the literacy rate in the language for all scripts is under 10%.
-
Caucasian Albanian: 56 code points
56 characters are encoded in this script.
The Caucasian Albanian script was an alphabetic writing system used by the Caucasian Albanians, one of the ancient Northeast Caucasian peoples whose territory comprised parts of the present-day Republic of Azerbaijan and Dagestan.
-
Ahom: 65 code points
65 characters are encoded in this script.
The Ahom script or Tai Ahom Script is an abugida that is used to write the Ahom language, a dormant Tai language undergoing revival spoken by the Ahom people till the late 18th-century, who established the Ahom kingdom and ruled the eastern part of the Brahmaputra valley between the 13th and the 18th centuries. The old Ahom language today survives in the numerous manuscripts written in this script currently in institutional and private possession.
-
Arabic: 1421 code points
1421 characters are encoded in this script.
The Arabic script is the writing system used for Arabic and several other languages of Asia and Africa. It is the second-most widely used alphabetic writing system in the world, the second-most widely used writing system in the world by number of countries using it, and the third-most by number of users.
-
Nastaliq Arabic: 0 code points
0 characters are encoded in this script.
Nastaliq, also romanized as Nastaʿlīq or Nastaleeq, is one of the main calligraphic hands used to write the Perso-Arabic script and it is used for some Indo-Iranian languages, predominantly Classical Persian, Kashmiri, Punjabi (Shahmukhi) and Urdu. It is often used also for Ottoman Turkish poetry, but rarely for Arabic. Nastaliq developed in Iran from naskh beginning in the 13th century and remains widely used in Iran, India, Afghanistan, Pakistan, and other countries for written poetry and as a form of art.
-
Imperial Aramaic: 31 code points
31 characters are encoded in this script.
The ancient Aramaic alphabet was used to write the Aramaic languages spoken by ancient Aramean pre-Christian tribes throughout the Fertile Crescent. It was also adopted by other peoples as their own alphabet when empires and their subjects underwent linguistic Aramaization during a language shift for governing purposes — a precursor to Arabization centuries later — including among the Assyrians and Babylonians who permanently replaced their Akkadian language and its cuneiform script with Aramaic and its script, and among Jews, but not Samaritans, who adopted the Aramaic language as their vernacular and started using the Aramaic alphabet, which they call "Square Script", even for writing Hebrew, displacing the former Paleo-Hebrew alphabet. The modern Hebrew alphabet derives from the Aramaic alphabet, in contrast to the modern Samaritan alphabet, which derives from Paleo-Hebrew.
-
Armenian: 97 code points
97 characters are encoded in this script.
The Armenian alphabet or, more broadly, the Armenian script, is an alphabetic writing system developed for Armenian and occasionally used to write other languages. It was developed around 405 CE by Mesrop Mashtots, an Armenian linguist and ecclesiastical leader. The script originally had 36 letters. Eventually, two more were adopted in the 13th century. In reformed Armenian orthography (1920s), the ligature և ev is also treated as a letter, bringing the total number of letters to 39.
-
Avestan: 64 code points
64 characters are encoded in this script.
The Avestan alphabet is a writing system developed during Iran's Sasanian era (226–651 CE) to render the Avestan language.
-
Balinese: 127 code points
127 characters are encoded in this script.
The Balinese script, natively known as Aksara Bali and Hanacaraka, is an abugida used in the island of Bali, Indonesia, commonly for writing the Austronesian Balinese language, Old Javanese, and the liturgical language Sanskrit. With some modifications, the script is also used to write the Sasak language, used in the neighboring island of Lombok. The script is a descendant of the Brahmi script, and so has many similarities with the modern scripts of South and Southeast Asia. The Balinese script, along with the Javanese script, is considered the most elaborate and ornate among Brahmic scripts of Southeast Asia.
-
Bamum: 657 code points
657 characters are encoded in this script.
The Bamum scripts are an evolutionary series of six scripts created for the Bamum language by Ibrahim Njoya, King of Bamum. They are notable for evolving from a pictographic system to a semi-syllabary in the space of fourteen years, from 1896 to 1910. Bamum type was cast in 1918, but the script fell into disuse around 1931. A project began around 2007 to revive the Bamum script.
-
Bassa Vah: 36 code points
36 characters are encoded in this script.
Bassa Vah, also known as simply Vah is an alphabetic script for writing the Bassa language of Liberia. As an old system nearing extinction in the 1900s, it was rediscovered among Bassa in Brazil and the West Indies, then revived in Liberia, by Thomas Flo Lewis. Type was cast for it, and an association for its promotion was formed in Liberia in 1959. It is not used today and has been classified as a failed script.
-
Batak: 56 code points
56 characters are encoded in this script.
The Batak script is a writing system used to write the Austronesian Batak languages spoken by several million people on the Indonesian island of Sumatra. The script may be derived from the Kawi and Pallava script, ultimately derived from the Brahmi script of India, or from the hypothetical Proto-Sumatran script influenced by Pallava.
-
Bengali: 114 code points
114 characters are encoded in this script.
The Bengali–Assamese script, sometimes also known as Eastern Nagari, is an eastern Brahmic script, primarily used today for the Bengali and Assamese language spoken in eastern South Asia. It evolved from Gaudi script, also the common ancestor of the Odia and Trihuta scripts. It is commonly referred to as the Bengali script by Bengalis and the Assamese script by the Assamese, while in academic discourse it is sometimes called Eastern-Nāgarī. Three of the 22 official languages of the Indian Republic—Bengali, Assamese, and Meitei—commonly use this script in writing; Bengali is also the official and national language of Bangladesh.
-
Bhaiksuki: 97 code points
97 characters are encoded in this script.
Bhaiksuki is a Brahmi-based script that was used around the 11th and 12th centuries CE. It used to be known in English as the "Arrow-Headed Script" or "Point-Headed Script," while an older designation, "Sindhura," had been used in Tibet for at least three centuries. Records showing usage of the script mainly appeared in the present-day states of Bihar and West Bengal in India, and in regions of Bangladesh. Records have also been located in Tibet, Nepal, and Burma.
-
Bliss: 0 code points
0 characters are encoded in this script.
Blissymbols or Blissymbolics is a constructed language conceived as an ideographic writing system called Semantography consisting of several hundred basic symbols, each representing a concept, which can be composed together to generate new symbols that represent new concepts. Blissymbols differ from most of the world's major writing systems in that the characters do not correspond at all to the sounds of any spoken language.
-
Bopomofo: 122 code points
122 characters are encoded in this script.
Bopomofo, also called Zhuyin Fuhao, or simply Zhuyin, is a transliteration system for Standard Chinese and other Sinitic languages. It is commonly used in Taiwan. It consists of 37 characters and five tone marks, which together can transcribe all possible sounds in Mandarin Chinese.
-
Brahmi: 115 code points
115 characters are encoded in this script.
Brahmi is a writing system from ancient India that appeared as a fully developed script in the 3rd century BCE. Its descendants, the Brahmic scripts, continue to be used today across South and Southeastern Asia.
-
Braille: 256 code points
256 characters are encoded in this script.
Braille is a tactile writing system used by people who are visually impaired. It can be read either on embossed paper or by using refreshable braille displays that connect to computers and smartphone devices. Braille can be written using a slate and stylus, a braille writer, an electronic braille notetaker or with the use of a computer connected to a braille embosser.
-
Buginese: 31 code points
31 characters are encoded in this script.
The Lontara script, also known as the Bugis script, Bugis-Makassar script, or Urupu Sulapa’ Eppa’ "four-cornered letters", is one of Indonesia's traditional scripts developed in the South Sulawesi and West Sulawesi region. The script is primarily used to write the Buginese language, followed by Makassarese and Mandar. Closely related variants of Lontara are also used to write several languages outside of Sulawesi such as Bima, Ende, and Sumbawa. The script was actively used by several South Sulawesi societies for day-to-day and literary texts from at least mid-15th Century CE until the mid-20th Century CE, before its function was gradually supplanted by the Latin alphabet. Today the script is taught in South Sulawesi Province as part of the local curriculum, but with very limited usage in everyday life.
-
Buhid: 22 code points
22 characters are encoded in this script.
Surat Buhid is an abugida used to write the Buhid language. As a Brahmic script indigenous to the Philippines, it closely related to Baybayin and Hanunó'o. It is still used today by the Mangyans, found mainly on island of Mindoro, to write their language, Buhid, together with the Filipino latin script.
-
Chakma: 91 code points
91 characters are encoded in this script.
The Chakma Script, also called Ajhā pāṭh, Ojhapath, Ojhopath, Aaojhapath, is an abugida used for the Chakma language, and recently for the Pali language.
-
Canadian Aboriginal: 726 code points
726 characters are encoded in this script.
Canadian syllabic writing, or simply syllabics, is a family of writing systems used in a number of indigenous Canadian languages of the Algonquian, Inuit, and (formerly) Athabaskan language families. These languages had no formal writing system previously. They are valued for their distinctiveness from the Latin script and for the ease with which literacy can be achieved. For instance, by the late 19th century the Cree had achieved what may have been one of the highest rates of literacy in the world. Syllabics are an abugida, where glyphs represent consonant–vowel pairs, determined by the rotation of the glyphs. They derive from the work of linguist and missionary James Evans.
-
Carian: 53 code points
53 characters are encoded in this script.
The Carian alphabets are a number of regional scripts used to write the Carian language of western Anatolia. They consisted of some 30 alphabetic letters, with several geographic variants in Caria and a homogeneous variant attested from the Nile delta, where Carian mercenaries fought for the Egyptian pharaohs. They were written left-to-right in Caria and right-to-left in Egypt.
-
Cham: 83 code points
83 characters are encoded in this script.
The Cham script is a Brahmic abugida used to write Cham, an Austronesian language spoken by some 245,000 Chams in Vietnam and Cambodia. It is written horizontally left to right, just like other Brahmic abugidas.
-
Cherokee: 182 code points
182 characters are encoded in this script.
The Cherokee syllabary is a syllabary invented by Sequoyah in the late 1810s and early 1820s to write the Cherokee language. His creation of the syllabary is particularly noteworthy as he was illiterate until its creation. He first experimented with logograms, but his system later developed into the syllabary. In his system, each symbol represents a syllable rather than a single phoneme; the 85 characters provide a suitable method for writing Cherokee. The letters resemble characters from other scripts, such as Latin, Greek, Cyrillic, and Glagolitic, however, these are not used to represent the same sounds.
-
Chorasmian: 28 code points
28 characters are encoded in this script.
-
Cirth: 0 code points
0 characters are encoded in this script.
The Cirth is a semi‑artificial script, based on real‑life runic alphabets, one of several scripts invented by J. R. R. Tolkien for the constructed languages he devised and used in his works. Cirth is written with a capital letter when referring to the writing system; the letters themselves can be called cirth.
-
Coptic: 173 code points
173 characters are encoded in this script.
The Coptic script is the script used for writing the Coptic language, the most recent development of Egyptian. The repertoire of glyphs is based on the uncial Greek alphabet, augmented by letters borrowed from the Egyptian Demotic. It was the first alphabetic script used for the Egyptian language. There are several Coptic alphabets, as the script varies greatly among the various dialects and eras of the Coptic language.
-
Cypro Minoan: 101 code points
101 characters are encoded in this script.
The Cypro-Minoan syllabary (CM), more commonly called the Cypro-Minoan Script, is an undeciphered syllabary used on the island of Cyprus and at its trading partners during the late Bronze Age and early Iron Age. The term "Cypro-Minoan" was coined by Arthur Evans in 1909 based on its visual similarity to Linear A on Minoan Crete, from which CM is thought to be derived. Approximately 250 objects—such as clay balls, cylinders, and tablets which bear Cypro-Minoan inscriptions, have been found. Discoveries have been made at various sites around Cyprus, as well as in the ancient city of Ugarit on the Syrian coast. It is thought to be somehow related to the later Cypriot syllabary.
-
Cypriot: 112 code points
112 characters are encoded in this script.
The Cypriot or Cypriote syllabary is a syllabic script used in Iron Age Cyprus, from about the 11th to the 4th centuries BCE, when it was replaced by the Greek alphabet. It has been suggested that the script remained in use as late as the 1st century BC. A pioneer of that change was King Evagoras of Salamis. It is thought to be descended from the Cypro-Minoan syllabary, itself a variant or derivative of Linear A. Most texts using the script are in the Arcadocypriot dialect of Greek, but also one bilingual inscription was found in Amathus.
-
Cyrillic: 521 code points
521 characters are encoded in this script.
The Cyrillic script, Slavonic script or simply Slavic script is a writing system used for various languages across Eurasia. It is the designated national script in various Slavic, Turkic, Mongolic, Uralic, Caucasian and Iranic-speaking countries in Southeastern Europe, Eastern Europe, the Caucasus, Central Asia, North Asia, and East Asia, and used by many other minority languages.
-
Old Cyrillic: 0 code points
0 characters are encoded in this script.
The Early Cyrillic alphabet, also called classical Cyrillic or paleo-Cyrillic, is an alphabetic writing system that was developed in Medieval Bulgaria in the Preslav Literary School during the late 9th century. It is used to write the Church Slavonic language, and was historically used for its ancestor, Old Church Slavonic. It was also used for other languages, but between the 18th and 20th centuries was mostly replaced by the modern Cyrillic script, which is used for some Slavic languages, and for East European and Asian languages that have experienced a great amount of Russian cultural influence.
-
Devanagari: 221 code points
221 characters are encoded in this script.
Devanagari is an Indic script used in the northern Indian subcontinent. Also simply called Nāgari, it is a left-to-right abugida, based on the ancient Brāhmi script. It is one of the official scripts of the Republic of India and Nepal. It was developed and in regular use by the 8th century CE and achieved its modern form by 1200 CE. The Devanāgari script, composed of 48 primary characters, including 14 vowels and 34 consonants, is the fourth most widely adopted writing system in the world, being used for over 120 languages.
-
Dhives Akuru: 72 code points
72 characters are encoded in this script.
Dhives Akuru, later called Dhivehi Akuru is a script formerly used for the Maldivian language. The name can be alternatively spelled Dives Akuru or Divehi Akuru using the ISO 15919 Romanization scheme, as the "d" is unaspirated.
-
Dogri: 82 code points
82 characters are encoded in this script.
The Dogri script is a writing system originally used for writing the Dogri language in Jammu and Kashmir in the northern part of the Indian subcontinent.
-
Deseret: 80 code points
80 characters are encoded in this script.
The Deseret alphabet is a phonemic English-language spelling reform developed between 1847 and 1854 by the board of regents of the University of Deseret under the leadership of Brigham Young, the second president of the Church of Jesus Christ of Latter-day Saints. George D. Watt is reported to have been the most actively involved in the development of the script's novel characters, which were used to replace those of Isaac Pitman's English phonotypic alphabet. He was also the "New Alphabet's" first serious user. The script gets its name from the word deseret, a hapax legomenon in the Book of Mormon, which is said to mean "honeybee" in the only verse it is used in.
-
Duployan: 154 code points
154 characters are encoded in this script.
The Duployan shorthand, or Duployan stenography, was created by Father Émile Duployé in 1860 for writing French. Since then, it has been expanded and adapted for writing English, German, Spanish, Romanian, Latin, Danish, and Chinook Jargon. The Duployan stenography is classified as a geometric, alphabetic stenography and is written left-to-right in connected stenographic style. The Duployan shorthands, including Chinook writing, Pernin's Universal Phonography, Perrault's English Shorthand, the Sloan-Duployan Modern Shorthand, and Romanian stenography, were included as a single script in version 7.0 of the Unicode Standard / ISO 10646
-
Demotic: 0 code points
0 characters are encoded in this script.
Demotic is the ancient Egyptian script derived from northern forms of hieratic used in the Nile Delta. The term was first used by the Greek historian Herodotus to distinguish it from hieratic and hieroglyphic scripts. By convention, the word "Demotic" is capitalized in order to distinguish it from demotic Greek.
-
Hieratic: 0 code points
0 characters are encoded in this script.
Hieratic is the name given to a cursive writing system used for Ancient Egyptian and the principal script used to write that language from its development in the third millennium BCE until the rise of Demotic in the mid-first millennium BCE. It was primarily written in ink with a reed brush on papyrus.
-
Egyptian Hieroglyphs: 5105 code points
5105 characters are encoded in this script.
Egyptian hieroglyphs were the formal writing system used in Ancient Egypt for writing the Egyptian language. Hieroglyphs combined ideographic, logographic, syllabic and alphabetic elements, with more than 1,000 distinct characters. Cursive hieroglyphs were used for religious literature on papyrus and wood. The later hieratic and demotic Egyptian scripts were derived from hieroglyphic writing, as was the Proto-Sinaitic script that later evolved into the Phoenician alphabet. Egyptian hieroglyphs are the ultimate ancestor of the Phoenician alphabet, the first widely adopted phonetic writing system. Moreover, owing in large part to the Greek and Aramaic scripts that descended from Phoenician, the majority of the world's living writing systems are descendants of Egyptian hieroglyphs—most prominently the Latin and Cyrillic scripts through Greek, and the Arabic and Brahmic scripts through Aramaic.
-
Elbasan: 42 code points
42 characters are encoded in this script.
The Elbasan alphabet is a mid 18th-century alphabetic script created for the Albanian language Elbasan Gospel Manuscript, also known as the Anonimi i Elbasanit, which is the only document written in it. The document was created at St. Jovan Vladimir's Church in central Albania, but is preserved today at the National Archives of Albania in Tirana. The alphabet, like the manuscript, is named after the city of Elbasan, where it was invented, and although the manuscript isn't the oldest document written in Albanian, Elbasan is the oldest out of seven known original alphabets created for Albanian. Its 59 pages contain Biblical content written in a script of 40 letters, of which 35 frequently recur and 5 are rare.
-
Elymaic: 23 code points
23 characters are encoded in this script.
The Elymaic alphabet is a right-to-left, non-joining abjad. It is derived from the Aramaic alphabet. Elymaic was used in the ancient state of Elymais, which was a semi-independent state of the 2nd century BCE to the early 3rd century CE, frequently a vassal under Parthian control, in the present-day region of Khuzestan, Iran (Susiana).
-
Ethiopic: 524 code points
524 characters are encoded in this script.
Geʽez is a script used as an abugida (alphasyllabary) for several Afro-Asiatic and Nilo-Saharan languages of Ethiopia and Eritrea. It originated as an abjad and was first used to write the Geʽez language, now the liturgical language of the Ethiopian Orthodox Tewahedo Church, the Eritrean Orthodox Tewahedo Church, the Eritrean Catholic Church, the Ethiopian Catholic Church, and Haymanot Judaism of the Beta Israel Jewish community in Ethiopia. In the languages Amharic and Tigrinya, the script is often called fidäl (ፊደል), meaning "script" or "letter". Under the Unicode Standard and ISO 15924, it is defined as Ethiopic text.
-
Garay: 72 code points
72 characters are encoded in this script.
The Garay alphabet was designed in 1961, as a transcription system "[marrying] African sociolinguistic characteristics" according to its inventor, Assane Faye. This alphabet has 25 consonants and 14 vowels. It is used in particular for the writing of the Wolof language, spoken mostly in Senegal, although it is more often written in the Latin alphabet and to a lesser extent in the Arabic (Wolofal) alphabet. It is written from right to left, and distinguishes letter case.
-
Chutsuri: 0 code points
0 characters are encoded in this script.
-
Georgian: 178 code points
178 characters are encoded in this script.
-
Glagolitic: 144 code points
144 characters are encoded in this script.
The Glagolitic script is the oldest known Slavic alphabet. It is generally agreed that it was created in the 9th century for the purpose of translating liturgical texts into Old Church Slavonic by Saint Cyril, a monk from Thessalonica. He and his brother Saint Methodius were sent by the Byzantine Emperor Michael III in 863 to Great Moravia to spread Christianity there. After the deaths of Cyril and Methodius, their disciples were expelled and they moved to the First Bulgarian Empire instead. The Early Cyrillic alphabet, which developed gradually in the Preslav Literary School by Greek alphabet scribes who incorporated some Glagolitic letters, gradually replaced Glagolitic in that region. Glagolitic remained in use alongside Latin in the Kingdom of Croatia and alongside Cyrillic until the 14th century in the Second Bulgarian Empire and the Serbian Empire, and later mainly for cryptographic purposes.
-
Gundjala Gondi: 66 code points
66 characters are encoded in this script.
The Gunjala Gondi lipi or Gunjala Gondi script is a script used to write the Gondi language, a Dravidian language spoken by the Gond people of northern Telangana, eastern Maharashtra, southeastern Madhya Pradesh, and Chhattisgarh. Approximately a dozen manuscripts in the script were recovered from Gunjala, a Gond village in Adilabad district of Telangana, by a team of researchers from the University of Hyderabad, led by Professor Jayadheer Tirumala Rao. The script and preliminary font were unveiled in early 2014.
-
Gondi: 77 code points
77 characters are encoded in this script.
Gondi has typically been written in Devanagari script or Telugu script, but native scripts are in existence. A Gond by the name of Munshi Mangal Singh Masaram designed a Brahmi-based script in 1918, and in 2006, a native script that dates up to 1750 has been discovered by a group of researchers from the University of Hyderabad.
-
Gothic: 32 code points
32 characters are encoded in this script.
The Gothic alphabet is an alphabet used for writing the Gothic language. It was developed in the 4th century AD by Ulfilas, a Gothic preacher of Cappadocian Greek descent, for the purpose of translating the Bible.
-
Grantha: 116 code points
116 characters are encoded in this script.
The Grantha script is a classical South Indian Brahmic script, found particularly in Tamil Nadu and Kerala. Originating from the Pallava script, the Grantha script is related to Tamil and Vatteluttu scripts. The modern Malayalam script of Kerala is a direct descendant of the Grantha script. The Southeast Asian and Indonesian scripts such as Thai and Javanese respectively, as well as South Asian Tigalari and Sinhala scripts, are derived or closely related to Grantha through the early Pallava script. The Pallava script or Pallava Grantha emerged in the 4th century CE and was used until the 7th century CE, in India. This early Grantha script was used to write Sanskrit texts, inscriptions on copper plates and stones of Hindu temples and monasteries. It was also used for classical Manipravalam – a language that is a blend of Sanskrit and Tamil. From it evolved Middle Grantha by the 7th century, and Transitional Grantha by about the 8th century, which remained in use until about the 14th century. Modern Grantha has been in use since the 14th century and into the modern era, to write classical texts in Sanskrit and Dravidian languages. It is also used to chant hymns and in traditional Vedic schools.
-
Greek: 531 code points
531 characters are encoded in this script.
The Greek alphabet has been used to write the Greek language since the late 9th or early 8th century BC. It is derived from the earlier Phoenician alphabet, and was the earliest known alphabetic script to have distinct letters for vowels as well as consonants. In Archaic and early Classical times, the Greek alphabet existed in many local variants, but, by the end of the 4th century BC, the Euclidean alphabet, with 24 letters, ordered from alpha to omega, had become standard and it is this version that is still used for Greek writing today.
-
Gujarati: 105 code points
105 characters are encoded in this script.
The Gujarati script is an abugida for the Gujarati language, Kutchi language, and various other languages. It is one of the official scripts of the Indian Republic. It is a variant of the Devanagari script differentiated by the loss of the characteristic horizontal line running above the letters and by a number of modifications to some characters.
-
Gurung Khema: 59 code points
59 characters are encoded in this script.
The Khema script, also known as Gurung Khema, Khema Phri, Khema Lipi, is used to write the Gurung language. The Language Commission of Nepal recognizes Khema as the official script of Gurung.
-
Gurmukhi: 94 code points
94 characters are encoded in this script.
Gurmukhī is an abugida developed from the Laṇḍā scripts, standardized and used by the second Sikh guru, Guru Angad (1504–1552). Commonly regarded as a Sikh script, Gurmukhi is used in Punjab, India as the official script of the Punjabi language.
-
Han Bopomofo: 0 code points
0 characters are encoded in this script.
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts. Each script is given both a four-letter code and a numeric code.
-
Hangul: 11775 code points
11775 characters are encoded in this script.
The Korean alphabet, known as Hangul or Hangeul in South Korea and Chosŏn'gŭl in North Korea, is the modern writing system for the Korean language. The letters for the five basic consonants reflect the shape of the speech organs used to pronounce them. They are systematically modified to indicate phonetic features. The vowel letters are systematically modified for related sounds, making Hangul a featural writing system. It has been described as a syllabic alphabet as it combines the features of alphabetic and syllabic writing systems.
-
Han: 99338 code points
99338 characters are encoded in this script.
Chinese characters are logographs used to write the Chinese languages and others from regions historically influenced by Chinese culture. Chinese characters have a documented history spanning over three millennia, representing one of the four independent inventions of writing accepted by scholars; of these, they comprise the only writing system continuously used since its invention. Over time, the function, style, and means of writing characters have evolved greatly. Unlike letters in alphabets that reflect the sounds of speech, Chinese characters generally represent morphemes, the units of meaning in a language. Writing a language's entire vocabulary requires thousands of different characters. Characters are created according to several different principles, where aspects of both shape and pronunciation may be used to indicate the character's meaning.
-
Hanunoo: 23 code points
23 characters are encoded in this script.
Hanunoo, also rendered Hanunó'o, is one of the scripts indigenous to the Philippines and is used by the Mangyan peoples of southern Mindoro to write the Hanunó'o language.
-
Han Simplified: 0 code points
0 characters are encoded in this script.
Simplified Chinese characters are one of two standardized character sets widely used to write the Chinese language, with the other being traditional characters. Their mass standardization during the 20th century was part of an initiative by the People's Republic of China (PRC) to promote literacy, and their use in ordinary circumstances on the mainland has been encouraged by the Chinese government since the 1950s. They are the official forms used in mainland China and Singapore, while traditional characters are officially used in Hong Kong, Macau, and Taiwan.
-
Han Traditional: 0 code points
0 characters are encoded in this script.
Traditional Chinese characters are a standard set of Chinese character forms used to write Chinese languages. In Taiwan, the set of traditional characters is regulated by the Ministry of Education and standardized in the Standard Form of National Characters. These forms were predominant in written Chinese until the middle of the 20th century, when various countries that use Chinese characters began standardizing simplified sets of characters, often with characters that existed before as well-known variants of the predominant forms.
-
Hatran: 26 code points
26 characters are encoded in this script.
Hatran Aramaic designates a Middle Aramaic dialect, that was used in the region of Hatra and Assur in northeastern parts of Mesopotamia, approximately from the 3rd century BC to the 3rd century CE. Its range extended from the Nineveh Plains in the centre, up to Tur Abdin in the north, Dura-Europos in the west and Tikrit in the south.
-
Hebrew: 136 code points
136 characters are encoded in this script.
The Hebrew alphabet, known variously by scholars as the Ktav Ashuri, Jewish script, square script and block script, is traditionally an abjad script used in the writing of the Hebrew language and other Jewish languages, most notably Yiddish, Ladino, Judeo-Arabic, and Judeo-Persian. In modern Hebrew, vowels are increasingly introduced. It is also used informally in Israel to write Levantine Arabic, especially among Druze. It is an offshoot of the Imperial Aramaic alphabet, which flourished during the Achaemenid Empire and which itself derives from the Phoenician alphabet.
-
Hiragana: 433 code points
433 characters are encoded in this script.
Hiragana is a Japanese syllabary, part of the Japanese writing system, along with katakana as well as kanji.
-
Anatolian Hieroglyphs: 583 code points
583 characters are encoded in this script.
Anatolian hieroglyphs are an indigenous logographic script native to central Anatolia, consisting of some 500 signs. They were once commonly known as Hittite hieroglyphs, but the language they encode proved to be Luwian, not Hittite, and the term Luwian hieroglyphs is used in English publications. They are typologically similar to Egyptian hieroglyphs, but do not derive graphically from that script, and they are not known to have played the sacred role of hieroglyphs in Egypt. There is no demonstrable connection to Hittite cuneiform.
-
Pahawh Hmong: 127 code points
127 characters are encoded in this script.
Pahawh Hmong is an indigenous semi-syllabic script, invented in 1959 by Shong Lue Yang, to write two Hmong languages, Hmong Daw (Hmoob Dawb White Miao) and Hmong Njua AKA Hmong Leng (Moob Leeg Green Miao).
-
Nyiakeng Puachue Hmong: 71 code points
71 characters are encoded in this script.
Nyiakeng Puachue Hmong is an alphabet script devised for White Hmong and Green Hmong in the 1980s by Reverend Chervang Kong for use within his United Christians Liberty Evangelical Church. The church, which moved around California, Minnesota, Wisconsin, North Carolina, Colorado, and many other states, has used the script in printed material and videos. It is reported to have some use in Laos, Thailand, Vietnam, France, and Australia.
-
Katakana Or Hiragana: 0 code points
0 characters are encoded in this script.
Kana are syllabaries used to write Japanese phonological units, morae. In current usage, kana most commonly refers to hiragana and katakana. It can also refer to their ancestor magana , which were Chinese characters used phonetically to transcribe Japanese ; and hentaigana, which are historical variants of the now-standard hiragana.
-
Old Hungarian: 112 code points
112 characters are encoded in this script.
The Old Hungarian script or Hungarian runes is an alphabetic writing system used for writing the Hungarian language. Modern Hungarian is written using the Latin-based Hungarian alphabet. The term "old" refers to the historical priority of the script compared with the Latin-based one. The Old Hungarian script is a child system of the Old Turkic alphabet.
-
Indus: 0 code points
0 characters are encoded in this script.
The Indus script, also known as the Harappan script and the Indus Valley Script, is a corpus of symbols produced by the Indus Valley Civilisation. Most inscriptions containing these symbols are extremely short, making it difficult to judge whether or not they constituted a writing system used to record a Harappan language, any of which are yet to be identified. Despite many attempts, the "script" has not yet been deciphered. There is no known bilingual inscription to help decipher the script, which shows no significant changes over time. However, some of the syntax varies depending upon location.
-
Old Italic: 39 code points
39 characters are encoded in this script.
The Old Italic scripts are a family of ancient writing systems used in the Italian Peninsula between about 700 and 100 BC, for various languages spoken in that time and place. The most notable member is the Etruscan alphabet, which was the immediate ancestor of the Latin alphabet used by more than 100 languages today, including English. The runic alphabets used in Northern Europe are believed to have been separately derived from one of these alphabets by the 2nd century AD.
-
Javanese: 91 code points
91 characters are encoded in this script.
Javanese script is one of Indonesia's traditional scripts developed on the island of Java. The script is primarily used to write the Javanese language, but in the course of its development has also been used to write several other regional languages such as Sundanese and Madurese, the regional lingua franca Malay, as well as the historical languages Kawi and Sanskrit. It heavily influenced the Balinese script from which the system for Sasak developed. Javanese script was actively used by the Javanese people for writing day-to-day and literary texts from at least the mid-16th century CE until the mid-20th century CE, before it was gradually supplanted by the Latin alphabet. Today, the script is taught in the Yogyakarta Special Region as well as the provinces of Central Java and East Java as part of the local curriculum, but with very limited function in everyday use.
-
Japanese: 0 code points
0 characters are encoded in this script.
The modern Japanese writing system uses a combination of logographic kanji, which are adopted Chinese characters, and syllabic kana. Kana itself consists of a pair of syllabaries: hiragana, used primarily for native or naturalized Japanese words and grammatical elements; and katakana, used primarily for foreign words and names, loanwords, onomatopoeia, scientific names, and sometimes for emphasis. Almost all written Japanese sentences contain a mixture of kanji and kana. Because of this mixture of scripts, in addition to a large inventory of kanji characters, the Japanese writing system is considered to be one of the most complicated currently in use.
-
Jurchen: 0 code points
0 characters are encoded in this script.
The Jurchen script was the writing system used to write the Jurchen language, the language of the Jurchen people who created the Jin Empire in northeastern China in the 12th–13th centuries. It was derived from the Khitan script, which in turn was derived from Chinese. The script has only been decoded to a small extent.
-
Kayah Li: 48 code points
48 characters are encoded in this script.
The Kayah Li alphabet is used to write the Kayah languages Eastern Kayah Li and Western Kayah Li, which are members of Karenic branch of the Sino-Tibetan language family. They are also known as Red Karen and Karenni. Eastern Kayah Li is spoken by about 26,000 people, and Western Kayah Li by about 100,000 people, mostly in the Kayah and Karen states of Myanmar, but also by people living in Thailand.
-
Katakana: 375 code points
375 characters are encoded in this script.
Katakana is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji and in some cases the Latin script.
-
Kawi: 87 code points
87 characters are encoded in this script.
The Kawi script or the Old Javanese script is a Brahmic script found primarily in Java and used across much of Maritime Southeast Asia between the 8th century and the 16th century. The script is an abugida, meaning that characters are read with an inherent vowel. Diacritics are used, either to suppress the vowel and represent a pure consonant, or to represent other vowels.
-
Kharoshthi: 68 code points
68 characters are encoded in this script.
Kharosthi script, also known as the Gandhari script, was an ancient Indian script used by various peoples from the north-western outskirts of the Indian subcontinent to Central Asia via Afghanistan. An abugida, it was introduced by the middle of the 3rd century BCE, possibly during the 4th century BCE, and remained in use until it died out in its homeland around the 3rd century CE.
-
Khmer: 146 code points
146 characters are encoded in this script.
Khmer script is an abugida (alphasyllabary) script used to write the Khmer language, the official language of Cambodia. It is also used to write Pali in the Buddhist liturgy of Cambodia and Thailand.
-
Khojki: 85 code points
85 characters are encoded in this script.
Khojkī, Khojakī, or Khwājā Sindhī, is a script used formerly and almost exclusively by the Khoja community of parts of the Indian subcontinent, including Sindh, Gujarat, and Punjab. However, this script also had a further reach and was used by members of Ismaili communities from Burma to East and South Africa. The Khojki script is one of the earliest forms of written Sindhi. The name "Khojki" is likely derived from the Persian word khoja, which means "master", or "lord".
-
Khitan Large: 0 code points
0 characters are encoded in this script.
The Khitan large script was one of two writing systems used for the now-extinct Khitan language. It was used during the 10th–12th centuries by the Khitan people, who had created the Liao Empire in north-eastern China. In addition to the large script, the Khitans simultaneously also used a functionally independent writing system known as the Khitan small script. Both Khitan scripts continued to be in use to some extent by the Jurchens for several decades after the fall of the Liao dynasty, until the Jurchens fully switched to a script of their own. Examples of the scripts appeared most often on epitaphs and monuments, although other fragments sometimes surface.
-
Khitan Small: 472 code points
472 characters are encoded in this script.
The Khitan small script was one of two writing systems used for the now-extinct Khitan language. It was used during the 10th–12th century by the Khitan people, who had created the Liao Empire in present-day northeastern China. In addition to the small script, the Khitans simultaneously also used a functionally independent writing system known as the Khitan large script. Both Khitan scripts continued to be in use to some extent by the Jurchens for several decades after the fall of the Liao dynasty, until the Jurchens fully switched to a script of their own. Examples of the scripts appeared most often on epitaphs and monuments, although other fragments sometimes surface.
-
Kannada: 107 code points
107 characters are encoded in this script.
The Kannada script is an abugida of the Brahmic family, used to write Kannada, one of the Dravidian languages of South India especially in the state of Karnataka. It is one of the official scripts of the Indian Republic. Kannada script is also widely used for writing Sanskrit texts in Karnataka. Several minor languages, such as Tulu, Konkani, Kodava, Beary and Sanketi also use alphabets based on the Kannada script. The Kannada and Telugu scripts share very high mutual intellegibility with each other, and are often considered to be regional variants of single script. Other scripts similar to Kannada script are Sinhala script, and Old Peguan script (used in Burma).
-
Korean Mixed: 0 code points
0 characters are encoded in this script.
Korean mixed script is a form of writing the Korean language that uses a mixture of the Korean alphabet or hangul and hanja, the Korean name for Chinese characters. The distribution on how to write words usually follows that all native Korean words, including suffixes, particles, and honorific markers are generally written in hangul and never in hanja. Sino-Korean vocabulary or hanja-eo, either words borrowed from Chinese or created from Sino-Korean roots, were generally always written in hanja, although very rare or complex characters were often substituted with hangul. Although the Korean alphabet was introduced and taught to people beginning in 1446, most literature until the early twentieth century was written in literary Chinese known as hanmun.
-
Kpelle: 0 code points
0 characters are encoded in this script.
The Kpelle syllabary was invented c. 1935 by Chief Gbili of Sanoyie, Liberia. It was intended for writing the Kpelle language, a member of the Mande group of Niger-Congo languages spoken by about 490,000 people in Liberia and around 300,000 people in Guinea at that time.
-
Kirat Rai: 58 code points
58 characters are encoded in this script.
Kirat Rai is a left-to-right abugida, based on the Sumhung Lipi of 1920s, used to write the Bantawa language in the Indian state of Sikkim. Kirat Rai is composed of 31 primary characters, including seven vowels, one of which (/a/) is inherent in all consonants, 31 consonants, a virama to cancel the inherent vowel, and a vowel carrier to be used in combination with the vowel diacritics for writing word-initial vowels.
-
Kaithi: 89 code points
89 characters are encoded in this script.
Kaithi, also called Kayathi or Kayasthi, is a historical Brahmic script that was used widely in parts of Northern and Eastern India, primarily in the present-day states of Uttar Pradesh, Jharkhand and Bihar. In particular, it was used for writing legal, administrative and private records. It was used for a variety of Indo-Aryan languages, including Angika, Awadhi, Bhojpuri, Hindustani, Maithili, Magahi, and Nagpuri.
-
Tai Tham: 127 code points
127 characters are encoded in this script.
Tai Tham script is an abugida writing system used mainly for a group of Southwestern Tai languages i.e., Northern Thai, Tai Lü, Khün and Lao; as well as the liturgical languages of Buddhism i.e., Pali and Sanskrit. It is historically known as Tua Tham. In Thailand and Myanmar, the script is often referred to as Lanna script in relation to the historical kingdom of Lan Na situating in the Northern region of modern day Thailand and a part of Shan state in Myanmar. Local people in Northern Thailand also call the script as Tua Mueang in parallel to Kam Mueang, a local name for Northern Thai language. In Laos and Isan region of Thailand, a variation of Tai Tham script, often dubbed Lao Tham, is also known by the locals as To Tham Lao or Yuan script. Tai Tham script is traditionally written on a dried palm leaf as a palm-leaf manuscript.
-
Lao: 83 code points
83 characters are encoded in this script.
Lao script or Akson Lao is the primary script used to write the Lao language and other minority languages in Laos. Its earlier form, the Tai Noi script, was also used to write the Isan language, but was replaced by the Thai script. It has 27 consonants, 7 consonantal ligatures, 33 vowels, and 4 tone marks.
-
Latin Fraktur: 0 code points
0 characters are encoded in this script.
Fraktur is a calligraphic hand of the Latin alphabet and any of several blackletter typefaces derived from this hand. It is designed such that the beginnings and ends of the individual strokes that make up each letter will be clearly visible, and often emphasized; in this way it is often contrasted with the curves of the Antiqua (common) typefaces where the letters are designed to flow and strokes connect together in a continuous fashion. The word "Fraktur" derives from Latin frāctūra, built from frāctus, passive participle of frangere, which is also the root for the English word "fracture". In non-professional contexts, the term "Fraktur" is sometimes misused to refer to all blackletter typefaces – while Fraktur typefaces do fall under that category, not all blackletter typefaces exhibit the Fraktur characteristics described above.
-
Latin Gaelic: 0 code points
0 characters are encoded in this script.
Gaelic type is a family of Insular script typefaces devised for printing Early Modern Irish. It was widely used from the 16th century until the mid-18th century in Scotland and the mid-20th century in Ireland, but is now rarely used. Sometimes, all Gaelic typefaces are called Celtic or uncial although most Gaelic types are not uncials. The "Anglo-Saxon" types of the 17th century are included in this category because both the Anglo-Saxon types and the Gaelic/Irish types derive from the insular manuscript hand.
-
Latin: 1555 code points
1555 characters are encoded in this script.
The Latin script, also known as the Roman script, is a writing system based on the letters of the classical Latin alphabet, derived from a form of the Greek alphabet which was in use in the ancient Greek city of Cumae in Magna Graecia. The Greek alphabet was altered by the Etruscans, and subsequently their alphabet was altered by the Ancient Romans. Several Latin-script alphabets exist, which differ in graphemes, collation and phonetic values from the classical Latin alphabet.
-
Leke: 0 code points
0 characters are encoded in this script.
The Leke script, previously known as Karen Chicken Scratch script, is an abugida used to write the Pwo Karen language and Sgaw language in Burma (Myanmar) and Thailand. It has 25 consonants, 17 vowels and 3 tones. The script also has a unique set of numerals and punctuation, such as a full stop (period) and a comma.
-
Lepcha: 74 code points
74 characters are encoded in this script.
The Lepcha script, or Róng script, is an abugida used by the Lepcha people to write the Lepcha language. Unusually for an abugida, syllable-final consonants are written as diacritics.
-
Limbu: 69 code points
69 characters are encoded in this script.
The Limbu script is used to write the Limbu language. It is a Brahmic type abugida.
-
Linear A: 386 code points
386 characters are encoded in this script.
Linear A is a writing system that was used by the Minoans of Crete from 1800 BC to 1450 BC. Linear A was the primary script used in palace and religious writings of the Minoan civilization. It evolved into Linear B, which was used by the Mycenaeans to write an early form of Greek. It was discovered by the archaeologist Sir Arthur Evans in 1900. No texts in Linear A have yet been deciphered. Evans named the script "Linear" because its characters consisted simply of lines inscribed in clay, in contrast to the more pictographic characters in Cretan hieroglyphs that were used during the same period.
-
Linear B: 268 code points
268 characters are encoded in this script.
Linear B is a syllabic script that was used for writing in Mycenaean Greek, the earliest attested form of the Greek language. The script predates the Greek alphabet by several centuries, the earliest known examples dating to around 1400 BC. It is adapted from the earlier Linear A, an undeciphered script perhaps used for writing the Minoan language, as is the later Cypriot syllabary, which also recorded Greek. Linear B, found mainly in the palace archives at Knossos, Kydonia, Pylos, Thebes and Mycenae, disappeared with the fall of Mycenaean civilization during the Late Bronze Age collapse. The succeeding period, known as the Greek Dark Ages, provides no evidence of the use of writing.
-
Lisu: 53 code points
53 characters are encoded in this script.
The Fraser or Old Lisu script is an artificial abugida for the Lisu language invented around 1915 by Sara Ba Thaw, a Karen preacher from Myanmar, and improved by the missionary James O. Fraser. It is a single-case (unicameral) alphabet. It was also used for the Naxi language, e.g. in the 1932 Naxi Gospel of Mark, and used in the Zaiwa or Atsi language, e.g. in the 1938 Atsi Gospel of Mark.
-
Lycian: 30 code points
30 characters are encoded in this script.
The Lycian alphabet was used to write the Lycian language of the Asia Minor region of Lycia. It was an extension of the Greek alphabet, with half a dozen additional letters for sounds not found in Greek. It was largely similar to the Lydian and the Phrygian alphabets.
-
Lydian: 29 code points
29 characters are encoded in this script.
Lydian script was used to write the Lydian language. Like other scripts of Anatolia in the Iron Age, the Lydian alphabet is based on the Phoenician alphabet. It is related to the East Greek alphabet, but it has unique features.
-
Mahajani: 62 code points
62 characters are encoded in this script.
Mahajani is a Laṇḍā mercantile script that was historically used in northern India for writing accounts and financial records in Marwari, Hindi and Punjabi. It is a Brahmic script and is written left-to-right. Mahajani refers to the Hindi word for 'bankers', also known as 'sarrafi' or 'kothival' (merchant).
-
Makasar: 25 code points
25 characters are encoded in this script.
The Makasar script, also known as Ukiri' Jangang-jangang or Old Makasar script, is a historical Indonesian writing system that was used in South Sulawesi to write the Makassarese language between the 17th and 19th centuries until it was supplanted by the Lontara Bugis script.
-
Mandaic: 30 code points
30 characters are encoded in this script.
The Mandaic alphabet is a writing system primarily used to write the Mandaic language. It is thought to have evolved between the second and seventh century CE from either a cursive form of Aramaic or from Inscriptional Parthian. The exact roots of the script are difficult to determine. It was developed by members of the Mandaean faith of Lower Mesopotamia to write the Mandaic language for liturgical purposes. Classical Mandaic and its descendant Neo-Mandaic are still in limited use. The script has changed very little over centuries of use.
-
Manichaean: 52 code points
52 characters are encoded in this script.
The Manichaean script is an abjad-based writing system rooted in the Semitic family of alphabets and associated with the spread of Manichaeism from southwest to central Asia and beyond, beginning in the third century CE. It bears a sibling relationship to early forms of the Pahlavi scripts, both systems having developed from the Imperial Aramaic alphabet, in which the Achaemenid court rendered its particular, official dialect of Aramaic. Unlike Pahlavi, the Manichaean script reveals influences from the Sogdian alphabet, which in turn descends from the Syriac branch of Aramaic. The Manichaean script is so named because Manichaean texts attribute its design to Mani himself. Middle Persian is written with this alphabet.
-
Marchen: 68 code points
68 characters are encoded in this script.
The Marchen script was a Brahmic abugida which was used for writing the extinct Zhangzhung language. It was derived from the Tibetan script.
-
Maya: 0 code points
0 characters are encoded in this script.
Maya script, also known as Maya glyphs, is historically the native writing system of the Maya civilization of Mesoamerica and is the only Mesoamerican writing system that has been substantially deciphered. The earliest inscriptions found which are identifiably Maya date to the 3rd century BCE in San Bartolo, Guatemala. Maya writing was in continuous use throughout Mesoamerica until the Spanish conquest of the Maya in the 16th and 17th centuries. Though modern Mayan languages are almost entirely written using the Latin alphabet rather than Maya script, there have been recent developments encouraging a revival of the Maya glyph system.
-
Medefaidrin: 91 code points
91 characters are encoded in this script.
-
Mende Kikakui: 213 code points
213 characters are encoded in this script.
The Mende Kikakui script is a syllabary used for writing the Mende language of Sierra Leone.
-
Meroitic Cursive: 90 code points
90 characters are encoded in this script.
-
Meroitic Hieroglyphs: 33 code points
33 characters are encoded in this script.
-
Malayalam: 127 code points
127 characters are encoded in this script.
Malayalam script is a Brahmic script used commonly to write Malayalam, which is the principal language of Kerala, India, spoken by 45 million people in the world. It is a Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry by the Malayali people. It is one of the official scripts of the Indian Republic. Malayalam script is also widely used for writing Sanskrit texts in Kerala.
-
Modi: 89 code points
89 characters are encoded in this script.
Modi is a script used to write the Marathi language, which is the primary language spoken in the state of Maharashtra, India. There are multiple theories concerning its origin. The Modi script was used alongside the Devanagari script to write Marathi until the 20th century when the Balbodh style of the Devanagari script was promoted as the standard writing system for Marathi.
-
Mongolian: 178 code points
178 characters are encoded in this script.
The traditional Mongolian script, also known as the Khudam Mongol bichig, was the first writing system created specifically for the Mongolian language, and was the most widespread until the introduction of Cyrillic in 1946. It is traditionally written in vertical lines Top-Down, right across the page. Derived from the Old Uyghur alphabet, it is a true alphabet, with separate letters for consonants and vowels. It has been adapted for such languages as Oirat and Manchu. Alphabets based on this classical vertical script continue to be used in Mongolia and Inner Mongolia to write Mongolian, Xibe and, experimentally, Evenki.
-
Moon System: 0 code points
0 characters are encoded in this script.
The Moon System of Embossed Reading is a writing system for the blind, using embossed symbols mostly derived from the Latin script. It is claimed by its supporters to be easier to understand than braille, though it is mainly used by people who have lost their sight as adults, and thus already have knowledge of the shapes of letters.
-
Mro: 43 code points
43 characters are encoded in this script.
The Mru script is an indigenous, messianic script for the Mru language. In the 1980s Menlay Murang created the religion of Khrama and with it a new script.
-
Meetei Mayek: 79 code points
79 characters are encoded in this script.
The Meitei script, also known as the Kanglei script or the Kok Sam Lai script, after its first three letters is an abugida in the Brahmic scripts family used to write the Meitei language, the official language of Manipur, Assam and one of the 22 official languages of India. It is first known from engravings on 6th century CE coins and copper plate inscriptions. as verified by the various publications of the National Sahitya Akademi. It was used until the 18th century, when it was replaced by the Bengali alphabet. A few manuscripts survive. In the 20th century, the script was revived and is again being used. Beginning in 2021, the Government of Manipur began to use the Meitei alongside the Bengali-Assamese script, per the Manipur Official Language (Amendment) Act, 2021.
-
Multani: 48 code points
48 characters are encoded in this script.
Multani is a Brahmic script originating in the Multan region of Punjab and in northern Sindh, Pakistan. It was used to write Saraiki, often considered a dialect of Lahnda group of languages. The script was used for routine writing and commercial activities. Multani is one of four Landa scripts whose usage was extended beyond the mercantile domain and formalized for literary activity and printing; the others being Gurmukhi, Khojki, and Khudabadi. Although Multani is now obsolete, it is a historical script in which written and printed records exist. It was also known as Karikki and as Sarai.
-
Myanmar: 244 code points
244 characters are encoded in this script.
The Mon–Burmese script is an abugida that derives from the Pallava Grantha script of southern India and later of Southeast Asia. It is the basis of the alphabets used for modern Burmese, Mon, Shan, Rakhine, Jingpho and Karen.
-
Nag Mundari: 42 code points
42 characters are encoded in this script.
Mundari Bani is the writing system created for the Mundari language, spoken in eastern India. Mundari is an Austroasiatic language. Mundari Bani has 27 letters and five diacritics, the forms of which are intended to evoke natural shapes. The script is written from left to right.
-
Nandinagari: 86 code points
86 characters are encoded in this script.
Nandināgarī is a Brahmic script derived from the Nāgarī script which appeared in the 7th century AD. This script and its variants were used in the central Deccan region and south India, and an abundance of Sanskrit manuscripts in Nandināgarī have been discovered but remain untransliterated. Some of the discovered manuscripts of Madhvacharya of the Dvaita Vedanta school of Hinduism are in Nandināgarī script.
-
Ancient North Arabian: 32 code points
32 characters are encoded in this script.
Ancient North Arabian (ANA) is a collection of scripts and a language or family of languages under the North Arabian languages branch along with Old Arabic that were used in north and central Arabia and south Syria from the 8th century BCE to the 4th century CE. The term "Ancient North Arabian" is defined negatively. It refers to all of the South Semitic scripts except Ancient South Arabian (ASA) regardless of their genetic relationships.
-
Nabataean: 40 code points
40 characters are encoded in this script.
The Nabataean script is an abjad that was used to write Nabataean Aramaic and Nabataean Arabic from the second century BC onwards. Important inscriptions are found in Petra, the Sinai Peninsula, and other archaeological sites including Abdah and Mada'in Saleh in Saudi Arabia.
-
Prachalit: 97 code points
97 characters are encoded in this script.
Prachalit, also known as Newa, Newar, Newari, or Nepāla lipi is a type of abugida script developed from the Nepalese scripts, which are a part of the family of Brahmic scripts descended from Brahmi script. It is used to write Nepal Bhasa, Sanskrit and Pali. Various publications are still published in this script including the Sikkim Herald the bulletin of the Sikkim government.
-
Dongba: 0 code points
0 characters are encoded in this script.
The Dongba, Tomba or Tompa or Mo-so symbols are a system of pictographic glyphs used by the ²dto¹mba of the Naxi people in southern China. In the Naxi language it is called ²ss ³dgyu 'wood records' or ²lv ³dgyu 'stone records'. The first artifacts with this script on them originate from approximately 30 AD.
-
Geba: 0 code points
0 characters are encoded in this script.
Geba is a syllabic script for the Naxi language. It is called ¹Ggo¹baw in Naxi, adapted as Geba, 哥巴, in Chinese. Some glyphs resemble the Yi script, and some appear to be adaptations of Chinese characters. Geba is used only to transcribe mantras, and there are few texts, though it is sometimes used to annotate dongba pictographs. Geba's phonetics can vary depending on who is using it. Symbols do not have fixed phonetic values, and they may have the same phonetic values as well.
-
Nko: 67 code points
67 characters are encoded in this script.
NKo (ߒߞߏ), also spelled N'Ko, is an alphabetic script devised by Solomana Kanté in 1949, as a modern writing system for the Manding languages of West Africa. The term NKo, which means I say in all Manding languages, is also used for the Manding literary standard written in the NKo script.
-
Nushu: 397 code points
397 characters are encoded in this script.
Nüshu is a syllabic script derived from Chinese characters that was used exclusively among ethnic Yao women in Jiangyong County in Hunan province of southern China for several centuries before almost going extinct. From the early 21st century there have been official efforts to revitalize the script, as well as indications of renewed interest among the wider public.
-
Ogham: 29 code points
29 characters are encoded in this script.
Ogham is an Early Medieval alphabet used primarily to write the early Irish language, and later the Old Irish language. There are roughly 400 surviving orthodox inscriptions on stone monuments throughout Ireland and western Britain, the bulk of which are in southern Munster. The largest number outside Ireland are in Pembrokeshire, Wales.
-
Ol Chiki: 48 code points
48 characters are encoded in this script.
The Ol Chiki script, also known as Ol Chemetʼ, Ol Ciki, Ol, and sometimes as the Santali alphabet is the official writing system for Santali, an Austroasiatic language recognized as an official regional language in India. It was invented by Pandit Raghunath Murmu in 1925, and is one of the official scripts of the Indian Republic. It has 30 letters, the design of which is intended to evoke natural shapes. The script is written from left to right, and has two styles. Unicode does not maintain a distinction between these two, as is typical for print and cursive variants of a script. In both styles, the script is unicameral.
The shapes of the letters are not arbitrary, but reflect the names for the letters, which are words, usually the names of objects or actions representing conventionalized form in the pictorial shape of the characters.
-
Ol Onal: 46 code points
46 characters are encoded in this script.
The Ol Onal, also known as also known as Bhumij Lipi or Bhumij Onal, is an alphabetic writing system for the Bhumij language. Ol Onal script was created between 1981 and 1992 by Ol Guru Mahendra Nath Sardar. Ol Onal script is used to write the Bhumij language in some parts of West Bengal, Jharkhand, Orissa, and Assam.
-
Old Turkic: 75 code points
75 characters are encoded in this script.
The Old Turkic script was the alphabet used by the Göktürks and other early Turkic khanates from the 8th to 10th centuries to record the Old Turkic language.
-
Oriya: 97 code points
97 characters are encoded in this script.
The Odia script is a Brahmic script used to write primarily Odia language and others including Sanskrit and other regional languages. It is one of the official scripts of the Indian Republic. The script has developed over more than 1000 years from a variant of Siddhaṃ script which was used in Eastern India, where the characteristic top line transformed into a distinct round umbrella shape due to the influence of palm leaf manuscripts and also being influenced by the neighbouring scripts from the Western and Southern regions.
-
Osage: 76 code points
76 characters are encoded in this script.
The Osage script is a new script promulgated in 2006 and revised 2012–2014 for the Osage language. Because Latin orthographies were subject to interference from English conventions among Osage students who were more familiar with English than with Osage, in 2006 the director of the Osage Language Program, Herman Mongrain Lookout, decided to create a distinct script by modifying or fusing Latin letters. This Osage script has been in regular use on the Osage Nation ever since.
-
Osmanya: 40 code points
40 characters are encoded in this script.
The Osmanya alphabet, also known as Far Soomaali and, in Arabic, as al-kitābah al-ʿuthmānīyah, is an alphabetic script created to transcribe the Somali language. It was invented by Osman Yusuf Kenadid, the son of Sultan Yusuf Ali Kenadid and brother of Sultan Ali Yusuf Kenadid of the Sultanate of Hobyo.
-
Old Uyghur: 28 code points
28 characters are encoded in this script.
The Old Uyghur alphabet was a Turkic script used for writing Old Uyghur, a variety of Old Turkic spoken in Turpan and Gansu that is the ancestor of the modern Western Yugur language. The term "Old Uyghur" used for this alphabet is misleading because Qocho, the Uyghur (Yugur) kingdom created in 843, originally used the Old Turkic alphabet. The Uyghur adopted this "Old Uyghur" script from local inhabitants when they migrated into Turfan after 840. It was an adaptation of the Aramaic alphabet used for texts with Buddhist, Manichaean and Christian content for 700–800 years in Turpan. The last known manuscripts are dated to the 18th century. This was the prototype for the Mongolian and Manchu alphabets. The Old Uyghur alphabet was brought to Mongolia by Tata-tonga.
-
Palmyrene: 32 code points
32 characters are encoded in this script.
The Palmyrene alphabet was a historical Semitic alphabet used to write Palmyrene Aramaic. It was used between 100 BCE and 300 CE in Palmyra in the Syrian desert. The oldest surviving Palmyrene inscription dates to 44 BCE. The last surviving inscription dates to 274 CE, two years after Palmyra was sacked by Roman Emperor Aurelian, ending the Palmyrene Empire. Use of the Palmyrene language and script declined, being replaced with Greek and Latin.
-
Pau Cin Hau: 57 code points
57 characters are encoded in this script.
The Pau Cin Hau scripts, known as Pau Cin Hau lai, or Zo tual lai in Zomi, are two scripts, a logographic script and an alphabetic script created by Pau Cin Hau, a Zomi religious leader from Chin State, Burma. The logographic script consists of 1,050 characters, which is a traditionally significant number based on the number of characters appearing in a religious text. The alphabetic script is a simplified script of 57 characters, which is divided into 21 consonants, 7 vowels, 9 final consonants, and 20 tone, length, and glottal marks. The original script was produced in 1902, but it is thought to have undergone at least two revisions, of which the first revision produced the logographic script.
-
Proto-Cuneiform: 0 code points
0 characters are encoded in this script.
The proto-cuneiform script was a system of proto-writing that emerged in Mesopotamia, eventually developing into the early cuneiform script used in the region's Early Dynastic I period. It arose from the token-based system that had already been in use across the region in preceding millennia. While it is known definitively that later cuneiform was used to write the Sumerian language, it is still uncertain what the underlying language of proto-cuneiform texts were.
-
Protoelamite: 0 code points
0 characters are encoded in this script.
The Proto-Elamite script is an early Bronze Age writing system briefly in use before the introduction of Elamite cuneiform.
-
Old Permic: 50 code points
50 characters are encoded in this script.
The Old Permic script, sometimes known by its initial two characters as Abur or Anbur, is a "highly idiosyncratic adaptation" of the Cyrillic script once used to write medieval Komi.
-
Phags Pa: 61 code points
61 characters are encoded in this script.
The Phagspa, ʼPhags-pa or ḥPʻags-pa script is an alphabet designed by the Tibetan monk and State Preceptor Drogön Chögyal Phagpa (1235–1280) for Kublai Khan, the founder of the Yuan dynasty (1271–1368) in China, as a unified script for the written languages within the Yuan. The actual use of this script was limited to about a hundred years during the Mongol-led Yuan dynasty, and it fell out of use with the advent of the Ming dynasty.
-
Inscriptional Pahlavi: 27 code points
27 characters are encoded in this script.
Inscriptional Pahlavi is the earliest attested form of Pahlavi scripts, and is evident in clay fragments that have been dated to the reign of Mithridates I. Other early evidence includes the Pahlavi inscriptions of Parthian coins and rock inscriptions of Sasanian emperors and other notables, such as Kartir the High Priest.
-
Psalter Pahlavi: 30 code points
30 characters are encoded in this script.
Psalter Pahlavi is a cursive abjad that was used for writing Middle Persian on paper; it is thus described as one of the Pahlavi scripts. It was written right to left, usually with spaces between words.
-
Book Pahlavi: 0 code points
0 characters are encoded in this script.
-
Phoenician: 29 code points
29 characters are encoded in this script.
The Phoenician alphabet is an abjad used across the Mediterranean civilization of Phoenicia for most of the 1st millennium BC. It was one of the first alphabets, and attested in Canaanite and Aramaic inscriptions found across the Mediterranean region. In the history of writing systems, the Phoenician script also marked the first to have a fixed writing direction—while previous systems were multi-directional, Phoenician was written horizontally, from right to left. It developed directly from the Proto-Sinaitic script used during the Late Bronze Age, which was derived in turn from Egyptian hieroglyphs.
-
Miao: 149 code points
149 characters are encoded in this script.
The Pollard script, also known as Pollard Miao or Miao, is an abugida loosely based on the Latin alphabet and invented by Methodist missionary Sam Pollard. Pollard invented the script for use with A-Hmao, one of several Miao languages spoken in southeast Asia. The script underwent a series of revisions until 1936, when a translation of the New Testament was published using it.
-
Inscriptional Parthian: 30 code points
30 characters are encoded in this script.
Inscriptional Parthian was a script used to write the Parthian language, the majority of the text found were from clay fragments. This script was used from the 2nd century CE to the 5th century CE or in the Parthian Empire to the early Sasanian Empire. During the Sasanian Empire it was mostly used for official texts.
-
Protosinaitic: 0 code points
0 characters are encoded in this script.
The Proto-Sinaitic script is a Middle Bronze Age writing system known from a small corpus of about 30-40 inscriptions and fragments from Serabit el-Khadim in the Sinai Peninsula, as well as two inscriptions from Wadi el-Hol in Middle Egypt. Together with about 20 known Proto-Canaanite inscriptions, it is also known as Early Alphabetic, i.e. the earliest trace of alphabetic writing and the common ancestor of both the Ancient South Arabian script and the Phoenician alphabet, which led to many modern alphabets including the Greek alphabet. According to common theory, Canaanites or Hyksos who spoke a Canaanite language repurposed Egyptian hieroglyphs to construct a different script.
-
Ranjana: 0 code points
0 characters are encoded in this script.
The Rañjanā script (Lantsa) is an abugida writing system which developed in the 11th century and until the mid-20th century was used in an area from Nepal to Tibet by the Newar people, the historic inhabitants of the Kathmandu Valley, to write Sanskrit and Newar. Nowadays it is also used in Buddhist monasteries in China, especially in the Tibetan Buddhist areas within the Tibet Autonomous Region, Sichuan, Yunnan, Qinghai and Gansu; Mongolia, and Japan. It is normally written from left to right but the Kutakshar form is written from top to bottom. It is also considered to be the standard Nepali calligraphic script.
-
Rejang: 37 code points
37 characters are encoded in this script.
The Rejang script is an abugida of the Brahmic family that is related to other scripts of the region, such as the Batak and Lontara scripts. Rejang is also a member of the closely related group of Ulu scripts that include the script variants of South Sumatra, Bengkulu, Lembak, Lintang, Lebong, and Serawai. Other closely related scripts that are sometimes included in the Surat Ulu group include the Ogan, Kerinci, and Lampung scripts. The script was in use prior to the introduction of Islam to the Rejang area; the earliest attested document appears to date from the mid-18th century CE. The Rejang script is sometimes also known as the KaGaNga script following the first three letters of the alphabet. The term KaGaNga was never used by the users of the script community, but it was coined by the British anthropologist Mervyn A. Jaspan (1926–1975) in his book Folk literature of South Sumatra. Redjang Ka-Ga-Nga texts. Canberra, The Australian National University 1964.
-
Hanifi Rohingya: 55 code points
55 characters are encoded in this script.
The Hanifi Rohingya script is a unified script for the Rohingya language. Rohingya today is written in three scripts, Hanifi, Arabic, and Latin (Rohingyalish). The Rohingya language was first written in the 19th century with a version of the Perso-Arabic script. In 1975, an orthographic Arabic script was developed and approved by the community leaders, based on the Urdu alphabet but with unique innovations to make the script suitable to Rohingya.
-
Rongorongo: 0 code points
0 characters are encoded in this script.
Rongorongo is a system of glyphs discovered in the 19th century on Easter Island that has the appearance of writing or proto-writing. Numerous attempts at decipherment have been made, but none have been successful. Although some calendrical and what might prove to be genealogical information has been identified, none of the glyphs can actually be read. If rongorongo does prove to be writing and to be an independent invention, it would be one of very few inventions of writing in human history.
-
Runic: 89 code points
89 characters are encoded in this script.
A rune is a letter in a set of related alphabets known as runic alphabets native to the Germanic peoples. Runes were used to write Germanic languages before they adopted the Latin alphabet, and for specialised purposes thereafter. In addition to representing a sound value, runes can be used to represent the concepts after which they are named (ideographs). Scholars refer to instances of the latter as Begriffsrunen. The Scandinavian variants are also known as fuþark, or futhark; this name is derived from the first six letters of the script, ⟨ᚠ⟩, ⟨ᚢ⟩, ⟨ᚦ⟩, ⟨ᚨ⟩/⟨ᚬ⟩, ⟨ᚱ⟩, and ⟨ᚲ⟩/⟨ᚴ⟩, corresponding to the Latin letters ⟨f⟩, ⟨u⟩, ⟨þ⟩/⟨th⟩, ⟨a⟩, ⟨r⟩, and ⟨k⟩. The Anglo-Saxon variant is known as futhorc, or fuþorc, due to changes in Old English of the sounds represented by the fourth letter, ⟨ᚨ⟩/⟨ᚩ⟩.
-
Samaritan: 62 code points
62 characters are encoded in this script.
The Samaritan Hebrew script, or simply Samaritan script is used by the Samaritans for religious writings, including the Samaritan Pentateuch, writings in Samaritan Hebrew, and for commentaries and translations in Samaritan Aramaic and occasionally Arabic.
-
Sarati: 0 code points
0 characters are encoded in this script.
Sarati is an artificial script, one of several scripts created by J. R. R. Tolkien. According to Tolkien's mythology, the Sarati alphabet was invented by the Elf Rúmil of Tirion.
-
Old South Arabian: 32 code points
32 characters are encoded in this script.
The Ancient South Arabian script branched from the Proto-Sinaitic script in about the late 2nd millennium BCE. It was used for writing the Old South Arabian languages Sabaic, Qatabanic, Hadramautic, Minaean Hasaitic, and Geʽez in Dʿmt. The earliest instances of the Ancient South Arabian script are painted pottery sherds from Raybun in Hadhramaut in Yemen, which are dated to the late 2nd millennium BCE. There are no letters for vowels, though some can be indicated via matres lectionis.
-
Saurashtra: 82 code points
82 characters are encoded in this script.
The Saurashtra script is an abugida script that is used by Saurashtrians of Tamil Nadu to write the Saurashtra language. The script is of Brahmic origin, although its exact derivation is not known; it was later reformed and standardized by T. M. Rama Rai. Its usage has declined, and the Tamil and Latin scripts are now used more commonly.
-
Sutton SignWriting: 672 code points
672 characters are encoded in this script.
Sutton SignWriting, or simply SignWriting, is a system of writing sign languages. It is highly featural and visually iconic, both in the shapes of the characters, which are abstract pictures of the hands, face, and body, and in their spatial arrangement on the page, which does not follow a sequential order like the letters that make up written English words. It was developed in 1974 by Valerie Sutton, a dancer who had, two years earlier, developed DanceWriting. Some newer standardized forms are known as the International Sign Writing Alphabet (ISWA).
-
Shavian: 49 code points
49 characters are encoded in this script.
The Shavian alphabet is a constructed alphabet conceived as a way to provide simple, phonemic orthography for the English language to replace the inefficiencies and difficulties of conventional spelling using the Latin alphabet. It was posthumously funded by and named after Irish playwright George Bernard Shaw and designed by Ronald Kingsley Read.
-
Sharada: 109 code points
109 characters are encoded in this script.
The Śāradā, Sarada or Sharada script is an abugida writing system of the Brahmic family of scripts. The script was widespread between the 8th and 12th centuries in the northwestern parts of Indian Subcontinent, for writing Sanskrit and Kashmiri. Although originally a signature Brahminical script created in the valley, it was more widespread throughout northwestern Indian subcontinent, and later became restricted to Kashmir, and is now rarely used, except by the Kashmiri Pandit community for religious purposes.
-
Sui: 0 code points
0 characters are encoded in this script.
The Sui script or Shuishu, is a logographic writing system with some pictographic characters that can be used to write the Sui language. However, traditionally only shamans were familiar with the writing system, and it is not utilized for everyday use by ordinary Sui people. This system is used for geomancy and divination purposes. There are at least 500 different Sui characters, known as le1 sui3 in the Sui language. According to tradition, these characters were created by ljok8 to2 qong5. Some of these characters are pictorial representations, such as of a bird or a fish, and a few are schematic representations of a characteristic quality, such a snail represented by a drawing of an inward curving spiral. Many of these characters appear to be borrowings from Chinese characters and are written backwards, apparently for increased supernatural power.
-
Siddham: 92 code points
92 characters are encoded in this script.
Siddhaṃ, also known in its later evolved form as Siddhamātṛkā, is a medieval Brahmic abugida, derived from the Gupta script and ancestral to the Nāgarī, Eastern Nagari, Tirhuta, Odia and Nepalese scripts.
-
Sidetic: 0 code points
0 characters are encoded in this script.
The Sidetic language is a member of the extinct Anatolian branch of the Indo-European language family known from legends of coins dating to the period of approximately the 5th to 3rd centuries BCE found in Side at the Pamphylian coast, and two Greek–Sidetic bilingual inscriptions from the 3rd and 2nd centuries BCE respectively. The Greek historian Arrian in his Anabasis Alexandri mentions the existence of a peculiar indigenous language in the city of Side. Sidetic was probably closely related to Lydian, Carian and Lycian.
-
Khudabadi: 81 code points
81 characters are encoded in this script.
Khudabadi was a script used to write the Sindhi language, generally used by some Sindhi Hindus even in the present-day. The script originates from Khudabad, a city in Sindh, and is named after it. It is also known as Hathvanki script. Khudabadi is one of the four scripts used for writing Sindhi, the others being Perso-Arabic, Khojki and Devanagari script. It was used by traders and merchants to record their information and rose to importance as the script began to be used to record information kept secret from other non-Sindhi groups.
-
Sinhala: 114 code points
114 characters are encoded in this script.
The Sinhala script, also known as Sinhalese script, is a writing system used by the Sinhalese people and most Sri Lankans in Sri Lanka and elsewhere to write the Sinhala language as well as the liturgical languages Pali and Sanskrit. The Sinhalese Akṣara Mālāva, one of the Brahmic scripts, is a descendant of the Ancient Indian Brahmi script. It is also related to the Grantha script.
-
Sogdian: 43 code points
43 characters are encoded in this script.
The Sogdian alphabet was originally used for the Sogdian language, a language in the Iranian family used by the people of Sogdia. The alphabet is derived from Syriac, a descendant script of the Aramaic alphabet. The Sogdian alphabet is one of three scripts used to write the Sogdian language, the others being the Manichaean alphabet and the Syriac alphabet. It was used throughout Central Asia, from the edge of Iran in the west, to China in the east, from approximately 100–1200 A.D.
-
Old Sogdian: 40 code points
40 characters are encoded in this script.
-
Sora Sompeng: 35 code points
35 characters are encoded in this script.
The Sorang Sompeng script is used to write Sora, a Munda language with 300,000 speakers in India. The script was created by Mangei Gomango in 1936 and is used in religious contexts.
-
Soyombo: 83 code points
83 characters are encoded in this script.
The Soyombo script is an abugida developed by the monk and scholar Zanabazar in 1686 to write Mongolian. It can also be used to write Tibetan and Sanskrit.
-
Sundanese: 72 code points
72 characters are encoded in this script.
Standard Sundanese script is a writing system which is used by the Sundanese people. It is built based on Old Sundanese script which was used from the 14th to the 18th centuries.
-
Sunuwar: 51 code points
51 characters are encoded in this script.
The Sunuwar alphabet is an alphabet developed by Krishna Bahadur Jentich in 1942, to write the Sunwar language, a member of the Kiranti language family spoken in Eastern Nepal, as in Sikkim. It is recognised in Sikkim and used as an official writing system. The alphabet has 33 letters, 10 numerals and 1 'auspicious sign'.
-
Syloti Nagri: 57 code points
57 characters are encoded in this script.
Sylheti Nagri or Sylheti Nāgarī, known in classical manuscripts as Sylhet Nagri as well as by many other names, is an Indic script of the Brahmic family. The script was historically used in the regions of Bengal and Assam, that were east of the Padma. It was primarily used in the eastern part of the Sylhet region, to document poetry known as puthis. In the course of the 20th century, it has lost much ground to the standardised Eastern Nagari script. Printing presses for Sylheti Nagri existed as late as into the 1970s, and in the 2000s, a Unicode font was created for the script.
-
Syriac: 119 code points
119 characters are encoded in this script.
The Syriac alphabet is a writing system primarily used to write the Syriac language since the 1st century AD. It is one of the Semitic abjads descending from the Aramaic alphabet through the Palmyrene alphabet, and shares similarities with the Phoenician, Hebrew, Arabic and Sogdian, the precursor and a direct ancestor of the traditional Mongolian scripts.
-
Tagbanwa: 20 code points
20 characters are encoded in this script.
Tagbanwa is one of the scripts indigenous to the Philippines, used by the Tagbanwa and the Palawan people as their ethnic writing system.
-
Takri: 80 code points
80 characters are encoded in this script.
The Tākri script is an abugida writing system of the Brahmic family of scripts. It is derived from the Sharada script of Kashmiri language. It is the sister script of Laṇḍā scripts. It has another variant Dogra Takri employed in Jammu region. There are numerous varieties present throughout Punjab. Until the late 1940s, the adapted version of the script was the official script for writing Dogri in the princely state of Jammu and Kashmir. Throughout the history, different kingdoms of what now forms Himachal Pradesh used their own variety to maintain their records. The Takri script used in Sirmour in Himachal Pradesh and in the adjacent region of Jaunsar-Bawar in Uttarakhand has some distinction.
-
Tai Le: 50 code points
50 characters are encoded in this script.
The Tai Le script, or Dehong Dai script, is a Brahmic script used to write the Tai Nüa language spoken by the Tai Nua people of south-central Yunnan, China. It is written in horizontal lines from left to right, with spaces only between clauses and sentences.
-
New Tai Lue: 83 code points
83 characters are encoded in this script.
New Tai Lue script, also known as Xishuangbanna Dai and Simplified Tai Lue, is an abugida used to write the Tai Lue language. Developed in China in the 1950s, New Tai Lue is based on the traditional Tai Tham alphabet developed c. 1200. The government of China promoted the alphabet for use as a replacement for the older script; teaching the script was not mandatory, however, and as a result many are illiterate in New Tai Lue. In addition, communities in Burma, Laos, Thailand and Vietnam still use the Tai Tham alphabet.
-
Tamil: 133 code points
133 characters are encoded in this script.
The Tamil script is an abugida script that is used by Tamils and Tamil speakers in India, Sri Lanka, Malaysia, Singapore, Indonesia and elsewhere to write the Tamil language. It is one of the official scripts of the Indian Republic. Certain minority languages such as Saurashtra, Badaga, Irula and Paniya are also written in the Tamil script.
-
Tangut: 6931 code points
6931 characters are encoded in this script.
The Tangut script is a logographic writing system, formerly used for writing the extinct Tangut language of the Western Xia dynasty. According to the latest count, 5863 Tangut characters are known, excluding variants. The Tangut characters are similar in appearance to Chinese characters, with the same type of strokes, but the methods of forming characters in the Tangut writing system are significantly different from those of forming Chinese characters. As in Chinese calligraphy, regular, running, cursive and seal scripts were used in Tangut writing.
-
Tai Viet: 72 code points
72 characters are encoded in this script.
The Tai Viet script is a Brahmic script used by the Tai Dam people and various other Thai people in Vietnam and Thailand.
-
Tai Yo: 0 code points
0 characters are encoded in this script.
The Lai Tay script, is a writing system used by the Tai Yo people of Quỳ Châu District, to write the Tai Yo language.
-
Telugu: 106 code points
106 characters are encoded in this script.
Telugu script, an abugida from the Brahmic family of scripts, is used to write the Telugu language, a Dravidian language spoken in the Indian states of Andhra Pradesh and Telangana as well as several other neighbouring states. It is one of the official scripts of the Indian Republic. The Telugu script is also widely used for writing Sanskrit texts and to some extent the Gondi language. It gained prominence during the Eastern Chalukyas also known as Vengi Chalukya era. It shares extensive similarities with the Kannada script, as both of them evolved from the Bhattiprolu and Kadamba scripts of the Brahmi family.
-
Tengwar: 0 code points
0 characters are encoded in this script.
The Tengwar script is an artificial script, one of several scripts created by J. R. R. Tolkien, the author of The Lord of the Rings.
-
Tifinagh: 63 code points
63 characters are encoded in this script.
Tifinagh is a script used to write the Berber languages. Tifinagh is descended from the ancient Libyco-Berber alphabet. The traditional Tifinagh, sometimes called Tuareg Tifinagh, is still favored by the Tuareg people of the Sahara desert in southern Algeria, northeastern Mali, northern Niger, and northern Burkina Faso for writing the Tuareg languages. Neo-Tifinagh is an alphabet developed by the Berber Academy by adopting Tuareg Tifinagh for use for Kabyle; it has been since modified for use across North Africa.
-
Tagalog: 25 code points
25 characters are encoded in this script.
Baybayin or Sulat Tagalog, also called Basahan, sometimes erroneously referred to as alibata, is a Philippine script widely used primarily in Luzon during the 16th and 17th centuries and prior to write Tagalog and to a lesser extent Visayan languages, Kampampangan, Ilocano, and several other Philippine languages.
-
Thaana: 66 code points
66 characters are encoded in this script.
Thaana, Tãnaa, Taana or Tāna is the present writing system of the Maldivian language spoken in the Maldives. Thaana has characteristics of both an abugida and a true alphabet, with consonants derived from indigenous and Arabic numerals, and vowels derived from the vowel diacritics of the Arabic abjad. Maldivian orthography in Thaana is largely phonemic.
-
Thai: 90 code points
90 characters are encoded in this script.
The Thai script is the abugida used to write Thai, Southern Thai and many other languages spoken in Thailand. The Thai script itself has 44 consonant symbols, 16 vowel symbols that combine into at least 32 vowel forms, four tone diacritics, and other diacritics.
-
Tibetan: 211 code points
211 characters are encoded in this script.
The Tibetan script is a segmental writing system, or abugida, derived from Brahmic scripts and Gupta script, and used to write certain Tibetic languages, including Tibetan, Dzongkha, Sikkimese, Ladakhi, Jirel and Balti. It was originally developed c. 620 by Tibetan minister Thonmi Sambhota for King Songsten Gampo.
-
Tirhuta: 97 code points
97 characters are encoded in this script.
The Tirhuta or Maithili script was the primary historical script for the Maithili language, as well as one of the historical scripts for Sanskrit. It is believed to have originated in the 13th century CE. It is very similar to Bengali–Assamese script, with most consonants being effectively identical in appearance. For the most part, writing in Maithili has switched to the Devanagari script, which is used to write neighbouring Central Indic languages to the west and north such as Hindi and Nepali, and the number of people with a working knowledge of Tirhuta has dropped considerably in recent years.
-
Tangsa: 89 code points
89 characters are encoded in this script.
-
Todrhi: 58 code points
58 characters are encoded in this script.
The Todhri alphabet is an 18th-century Albanian alphabetical writing system invented for writing the Albanian language by Theodhor Haxhifilipi, also known as Dhaskal Todhri.
-
Tolong Siki: 0 code points
0 characters are encoded in this script.
Tolong Siki is an alphabetic script made specifically for Kurux in 1999 by Narayan Oraon, a doctor. Many books and magazines have been published in Tolong Siki, and it was officially recognized by the state of Jharkhand in 2007. The Kurukh Literary Society of India has been instrumental in spreading the Tolong Siki script for Kurukh literature.
-
Toto: 32 code points
32 characters are encoded in this script.
-
Tulu-Tigalari: 99 code points
99 characters are encoded in this script.
Tigalari or Tulu script is a Southern Brahmic script which was used to write Tulu, Kannada, and Sanskrit languages. It was primarily used for writing Vedic texts in Sanskrit. It evolved from the Grantha script.
-
Ugaritic: 31 code points
31 characters are encoded in this script.
The Ugaritic writing system is a cuneiform abjad with syllabic elements used from around either 1400 BCE or 1300 BCE for Ugaritic, an extinct Northwest Semitic language. It was discovered in Ugarit, modern Ras Al Shamra, Syria, in 1928. It has 30 letters. Other languages, particularly Hurrian, were occasionally written in the Ugaritic script in the area around Ugarit, although not elsewhere.
-
Vai: 300 code points
300 characters are encoded in this script.
The Vai syllabary is a syllabic writing system devised for the Vai language by Momolu Duwalu Bukele of Jondu, in what is now Grand Cape Mount County, Liberia. Bukele is regarded within the Vai community, as well as by most scholars, as the syllabary's inventor and chief promoter when it was first documented in the 1830s. It is one of the two most successful indigenous scripts in West Africa in terms of the number of current users and the availability of literature written in the script, the other being N'Ko.
-
Visible Speech: 0 code points
0 characters are encoded in this script.
Visible Speech is a system of phonetic symbols developed by British linguist Alexander Melville Bell in 1867 to represent the position of the speech organs in articulating sounds. Bell was known internationally as a teacher of speech and proper elocution and an author of books on the subject. The system is composed of symbols that show the position and movement of the throat, tongue, and lips as they produce the sounds of language, and it is a type of phonetic notation. The system was used to aid the deaf in learning to speak.
-
Vithkuqi: 70 code points
70 characters are encoded in this script.
The Vithkuqi alphabet, also called Büthakukye or Beitha Kukju after the appellation applied to it by German Albanologist Johann Georg von Hahn, was an alphabetic script invented for writing the Albanian language between 1825 and 1845 by Albanian scholar Naum Veqilharxhi.
-
Warang Citi: 84 code points
84 characters are encoded in this script.
Warang Citi is a writing system invented by Lako Bodra for the Ho language spoken in East India. It is used in primary and adult education and in various publications.
-
Wancho: 59 code points
59 characters are encoded in this script.
Wancho script is an alphabet created between 2001 and 2012 by middle school teacher Banwang Losu in Longding district, Arunachal Pradesh for writing the Wancho language. Letters represent consonants and vowels. Conjunct consonants are not used. Tone is indicated with diacritical marks on vowel letters.
-
Woleai: 0 code points
0 characters are encoded in this script.
The Woleai or Caroline Island script, thought to have been a syllabary, was a partially Latin-based script indigenous to Woleai Atoll and nearby islands of Micronesia and used to write the Woleaian language until the mid-20th century. At the time the script was first noticed by Europeans, this part of Micronesia was known as the Caroline Islands, hence the name Caroline Island script.
-
Old Persian: 50 code points
50 characters are encoded in this script.
Old Persian cuneiform is a semi-alphabetic cuneiform script that was the primary script for Old Persian. Texts written in this cuneiform have been found in Iran, Armenia, Romania (Gherla), Turkey, and along the Suez Canal. They were mostly inscriptions from the time period of Darius I, such as the DNa inscription, as well as his son, Xerxes I. Later kings down to Artaxerxes III used more recent forms of the language classified as "pre-Middle Persian".
-
Cuneiform: 1234 code points
1234 characters are encoded in this script.
Cuneiform is a logo-syllabic writing system that was used to write several languages of the Ancient Near East. The script was in active use from the early Bronze Age until the beginning of the Common Era. Cuneiform scripts are marked by and named for the characteristic wedge-shaped impressions which form their signs. Cuneiform is the earliest known writing system and was originally developed to write the Sumerian language of southern Mesopotamia.
-
Yezidi: 60 code points
60 characters are encoded in this script.
-
Yi: 1246 code points
1246 characters are encoded in this script.
The Yi scripts are two scripts used to write the Yi languages; Classical Yi, and the later Yi syllabary. The script is historically known in Chinese as Cuan Wen or Wei Shu and various other names (夷字、倮語、倮倮文、畢摩文), among them "tadpole writing" (蝌蚪文).
-
Zanabazar Square: 72 code points
72 characters are encoded in this script.
Zanabazar's square script is a horizontal Mongolian square script, an abugida developed by the monk and scholar Zanabazar based on the Tibetan alphabet to write Mongolian. It can also be used to write Tibetan language and Sanskrit as a geometric typeface.
-
Inherited: 657 code points
657 characters are encoded in this script.
-
Mathematical Notation: 0 code points
0 characters are encoded in this script.
Mathematical notation consists of using symbols for representing operations, unspecified numbers, relations, and any other mathematical objects and assembling them into expressions and formulas. Mathematical notation is widely used in mathematics, science, and engineering for representing complex concepts and properties in a concise, unambiguous, and accurate way.
-
Emoji: 0 code points
0 characters are encoded in this script.
An emoji is a pictogram, logogram, ideogram, or smiley embedded in text and used in electronic messages and web pages. The primary function of modern emoji is to fill in emotional cues otherwise missing from typed conversation as well as to replace words as part of a logographic system. Emoji exist in various genres, including facial expressions, expressions, activity, food and drinks, celebrations, flags, objects, symbols, places, types of weather, animals and nature.
-
Symbol: 0 code points
0 characters are encoded in this script.
A symbol is a mark, sign, or word that indicates, signifies, or is understood as representing an idea, object, or relationship. Symbols allow people to go beyond what is known or seen by creating linkages between otherwise different concepts and experiences. All communication is achieved through the use of symbols: for example, a red octagon is a common symbol for "STOP"; on maps, blue lines often represent rivers; and a red rose often symbolizes love and compassion. Numerals are symbols for numbers; letters of an alphabet may be symbols for certain phonemes; and personal names are symbols representing individuals. The academic study of symbols is called semiotics.
-
Common: 9053 code points
9053 characters are encoded in this script.
-
Unknown: 66 code points
66 characters are encoded in this script.