ISO 15924

ISO 15924, Codes for the representation of names of scripts, defines two sets of codes for a number of writing systems (scripts). Each script is given both a four-letter code and a numeric one.[1] Script is defined as "set of graphic characters used for the written form of one or more languages".[1]

Where possible the codes are derived from ISO 639-2 where the name of a script and the name of a language using the script are identical (example: Gujarātī ISO 639 guj, ISO 15924 Gujr). Preference is given to the 639-2 Bibliographical codes, which is different from the otherwise often preferred use of the Terminological codes.[1]

4-letter ISO 15924 codes are incorporated into the Language Subtag Registry for IETF language tags and so can be used in file formats that make use of such language tags. For example, they can be used in HTML and XML to help Web browsers determine which typeface to use for foreign text. This way one could differentiate, for example, between Serbian written in the Cyrillic (sr-Cyrl) or Latin (sr-Latn) script, or mark romanized text as such.

Maintenance

ISO appointed the Unicode Consortium as the Registration Authority (RA) for the standard. The RA is responsible for appointing a registrar who works with a Joint Advisory Committee (JAC) in developing and implementing the standard. The registrar from 2004 to 2018 was Michael Everson, and from January 2019 the registrar has been Markus Scherer, a technical director of the Unicode Consortium.[2][3] The JAC consists of six members: one representative of the RA (Markus Scherer), one representative of ISO 639-2 (Randall K. Barry of the Library of Congress), one representative of ISO TC37 (Christian Galinski), one representative of ISO TC46 (Peeter Päll), and two representatives of ISO/IEC JTC 1/SC 2 (Rick McGowan and Ken Whistler, both also officers of the Unicode Consortium).[4]

Script codes

Numeric ranges

Special codes

  • Qaaa–Qabx (900–949): 50 Codes reserved for private use.
  • Zsye 993: Emoji
  • Zinh 994: Code for inherited script
  • Zmth 995: Mathematical notation
  • Zsym 996: Symbols
  • Zxxx 997: Code for unwritten languages
  • Zyyy 998: Code for undetermined script
  • Zzzz 999: Code for uncoded script

Exceptionally reserved codes

Two four letter codes are reserved at the request of the Common Locale Data Repository Project (CLDR):[7]

  • Root: Reserved for the language-neutral base of the CLDR locale tree
  • True: Reserved for the boolean value "true"

List of codes

This list of codes is from the ISO 15924 standard.[7]

Relations to other standards

The following standards are referred to as indispensable by ISO 15924.

  • ISO 639-2:1998 Codes for the representation of names of languages — Part 2: Alpha-3 code
  • ISO/IEC 9541-1:1991 Information technology — Font information interchange — Part 1: Architecture
  • ISO/IEC 10646-1:2000 Information technology — Universal Multiple-Octet Coded Character Set (UCS)

For definition of font and glyph the standard refers to

  • ISO/IEC 9541-1:1991

Around 150 scripts are defined in Unicode. Through a linkpin called "Property Value Alias", Unicode has made a 1:1 connection between a script defined, and its ISO 15924 standard. See Script (Unicode).

References

  1. ^ a b c Everson, Michael. "ISO 15924:2004". Retrieved 2011-06-21.
  2. ^ "ISO 15924 Registration Authority". Unicode Consortium. Retrieved 24 January 2019.
  3. ^ "Unicode Directors, Officers and Staff". Unicode Consortium. Retrieved 24 January 2019.
  4. ^ "Joint Advisory Committee ISO 15924/RA-JAC". Unicode Consortium. Retrieved 24 January 2019.
  5. ^ In July, 2010, Duployan shorthand was assigned code 755, even though the 700-799 range still carried its original designation of (unassigned). Shortly thereafter, Revision 1.1 clarified that codes in the 700s were reserved for "Shorthands and other notations", although that revision is only provisional until it can be confirmed by governing committees.
  6. ^ Everson, Michael (2004-01-09). "ISO 15924:2004 Information and documentation — Codes for the representation of names of scripts". Unicode Consortium.
  7. ^ a b "ISO 15924:2004 – Codes for the representation of names of scripts". Unicode. 2018.
  8. ^ a b c d e f g h i "Proposed New Scripts". Unicode Consortium. 2018-05-25. Retrieved 2018-09-12.
  9. ^ "Roadmap to the SMP". Unicode Consortium. 2018-08-08. Retrieved 2018-09-12.
  10. ^ Michael Everson (1997-09-18). "Proposal to encode Klingon in Plane 1 of ISO/IEC 10646-2".
  11. ^ The Unicode Consortium (2001-08-14). "Approved Minutes of the UTC 87 / L2 184 Joint Meeting".

External links

Ahom script

The Ahom script is an abugida that is used to write the Ahom language, a nearly-extinct (but being revived) Tai language spoken by the Ahom people who ruled eastern part of Brahmaputra valley—about one-third of the length of Brahmaputra valley—in the Indian state of Assam between the 13th and the 18th centuries. It is also called Tai Ahom Script.

Batak script

The Batak script, natively known as surat Batak, surat na sampulu sia (the nineteen letters), or si-sia-sia, is a writing system used to write the Austronesian Batak languages spoken by several million people on the Indonesian island of Sumatra. The script may be derived from the Kawi and Pallava script, ultimately derived from the Brahmi script of India, or from the hypothetical Proto-Sumatran script influenced by Pallava.

Bhaiksuki script

Bhaiksuki (Sanskrit: भैक्षुकी, Bhaiksuki:𑰥𑰹𑰎𑰿𑰬𑰲𑰎𑰱) is a Brahmi-based script that was used around the 11th and 12th centuries CE. It used to be known in English as the "Arrow-Headed Script" or "Point-Headed Script," while an older designation, "Sindhura," had been used in Tibet for at least three centuries. Records showing usage of the script mainly appeared in the present-day states of Bihar and West Bengal in India, and in regions of Bangladesh. Records have also been located in Tibet, Nepal, and Burma.

Demotic (Egyptian)

Demotic (from Ancient Greek: δημοτικός dēmotikós, "popular") is the ancient Egyptian script derived from northern forms of hieratic used in the Nile Delta, and the stage of the Egyptian language written in this script, following Late Egyptian and preceding Coptic. The term was first used by the Greek historian Herodotus to distinguish it from hieratic and hieroglyphic scripts. By convention, the word "Demotic" is capitalized in order to distinguish it from demotic Greek.

Elbasan script

The Elbasan script is a mid 18th-century alphabetic script used for the Albanian language. It was named after the city of Elbasan, where it was invented, and was used mainly in the area of Elbasan and Berat, and is the oldest original script used to write Albanian.

It was created for the "Elbasan Gospel Manuscript", also known as the Anonimi i Elbasanit ("the Anonymous of Elbasan"), which is the primary document associated with it.

The document was created at St. Jovan Vladimir's Church in central Albania, but is preserved today at the National Archives of Albania in Tirana. Its 59 pages contain Biblical content written in an alphabet of 40 letters, of which 35 frequently recur and 5 are rare. The name "Papa Totasi" (father Totasi) is written on the cover's verso, thus sometimes the script is attributed to him.

Gaelic type

Gaelic type (sometimes called Irish character, Irish type, or Gaelic script) is a family of Insular script typefaces devised for printing Classical Gaelic. It was widely used from the 16th until the mid-18th century (Scotland) or the mid-20th century (Ireland) but is now rarely used. Sometimes, all Gaelic typefaces are called Celtic or uncial although most Gaelic types are not uncials. The "Anglo-Saxon" types of the 17th century are included in this category because both the Anglo-Saxon types and the Gaelic/Irish types derive from the Insular manuscript hand.

The terms Gaelic type, Gaelic script and Irish character translate the Irish phrase cló Gaelach (pronounced [kl̪ˠoː ˈɡˠeːl̪ˠəx]). In Ireland, the term cló Gaelach is used in opposition to the term cló Rómhánach, Roman type.

The Scottish Gaelic term is corra-litir (pronounced [kʰɔrˠə ˈliʰtʲɪɾʲ]). Alasdair mac Mhaighstir Alasdair was one of the last Scottish writers with the ability to write in this script, but his main work, Ais-Eiridh na Sean Chánoin Albannaich, was published in the Roman script.

Gondi writing

Gondi has typically been written in Devanagari script or Telugu script, but native scripts are in existence. A Gond by the name of Munshi Mangal Singh Masaram designed a Brahmi-based script in 1918, and in 2006, a native script that dates up to 1750 has been discovered by a group of researchers from the University of Hyderabad.

Nonetheless, most Gonds are illiterate and do not use any script. The Gunjala Gondi Lipi has witnessed a surge in prominence, and well-supported efforts are being undertaken in villages of northern Andhra Pradesh to widen its usage.

Hanifi Rohingya script

The Hanifi Rohingya script is a unified script for the Rohingya language. Rohingya was first written in the 19th century with a version of the Perso-Arabic script. In 1975, an orthographic Arabic script was developed, based on the Urdu alphabet.

In the 1980s, (Maolana) Mohammad Hanif and his colleagues created the suitable phonetic script based on Arabic letters; it has been compared to the N’ko script. The script also includes a set of decimal numbers.

Hiragana

Hiragana (平仮名, ひらがな, Japanese pronunciation: [çiɾaɡana]) is a Japanese syllabary, one component of the Japanese writing system, along with katakana, kanji, and in some cases rōmaji (Latin script). It is a phonetic lettering system. The word hiragana literally means "ordinary" or "simple" kana ("simple" originally as contrasted with kanji).Hiragana and katakana are both kana systems. With one or two minor exceptions, each sound in the Japanese language (strictly, each mora) is represented by one character (or one digraph) in each system. This may be either a vowel such as "a" (hiragana あ); a consonant followed by a vowel such as "ka" (か); or "n" (ん), a nasal sonorant which, depending on the context, sounds either like English m, n, or ng ([ŋ]), or like the nasal vowels of French. Because the characters of the kana do not represent single consonants (except in the case of ん "n"), the kana are referred to as syllabaries and not alphabets.Hiragana is used to write okurigana (kana suffixes following a kanji root, for example to inflect verbs and adjectives), various grammatical and function words including particles, as well as miscellaneous other native words for which there are no kanji or whose kanji form is obscure or too formal for the writing purpose. Words that do have common kanji renditions may also sometimes be written instead in hiragana, according to an individual author's preference, for example to impart an informal feel. Hiragana is also used to write furigana, a reading aid that shows the pronunciation of kanji characters.

There are two main systems of ordering hiragana: the old-fashioned iroha ordering and the more prevalent gojūon ordering.

Kayah Li alphabet

The Kayah Li alphabet (Kayah Li: ꤊꤢ꤬ꤛꤢ꤭ ꤜꤟꤤ꤬) is used to write the Kayah languages Eastern Kayah Li and Western Kayah Li, which are members of Karenic branch of the Sino-Tibetan language family. They are also known as Red Karen and Karenni. Eastern Kayah Li is spoken by about 26,000 people, and Western Kayah Li by about 100,000 people, mostly in the Kayah and Karen states of Myanmar, but also by people living in Thailand.

Kpelle syllabary

The Kpelle syllabary was invented c. 1935 by Chief Gbili of Sanoyie, Liberia. It was intended for writing the Kpelle language, a member of the Mande group of Niger-Congo languages spoken by about 490,000 people in Liberia and around 300,000 people in Guinea at that time.The syllabary consists of 88 graphemes and is written from left to right in horizontal rows. Many of the glyphs have more than one form.

It was used to some extent by speakers of Kpelle in Liberia and Guinea during the 1930s and early 1940s but never achieved popular acceptance. It has been classed as a failed script.Today Kpelle is written with a version of the Latin alphabet.

Leke script

The Leke script, previously known as Karen Chicken Scratch script, is an abugida used to write the Pwo Karen language and Sgaw language in Burma (Myanmar) and Thailand. It has 25 consonants, 17 vowels and 3 tones. The script also has a unique set of numerals and punctuation, such as a full stop (period) and a comma.

In the traditional reading style most of the words the pronunciation came first then spelling came later. For example: (pronoun first School the “s.c.h.o.o.l” then repeat the same pronoun school again). In the modern Leke script, consonants come first, then the vowels follow. In writing system consonant always came first then following by vowels and tones. But just only two vowels have to apply first then consonant have to write down latter (just only for if you writing with hand). In the case of Leke script consonants are written horizontally from left to right, with vowels arranged below, above, to the left or to the right or combination of two vowels positions below of the consonants.

Mahajani

Mahajani is a Laṇḍā mercantile script that was historically used in northern India for writing accounts and financial records in Marwari, Hindi and Punjabi.

It is a Brahmic script and is written left-to-right. Mahajani refers to the Hindi word for 'bankers', also known as 'sarrafi' or 'kothival' (merchant).

Mende Kikakui script

The Mende Kikakui script is a syllabary used for writing the Mende language of Sierra Leone.

Nabataean alphabet

The Nabataean alphabet is an abjad (consonantal alphabet) that was used by the Nabataeans in the second century BC. Important inscriptions are found in Petra (now in Jordan), the Sinai Peninsula (now part of Egypt), and other archaeological sites including Avdat (now in Israel).

Old Permic script

The Old Permic script (Komi: Важ Перым гижӧм), sometimes called Abur or Anbur, is a "highly idiosyncratic adaptation" of the Cyrillic script once used to write medieval Komi (Permic).

Pracalit script

Prachalit Nepal script is a type of Abugida script developed from the Mol script derivatives of Brahmi script. It is used to write Nepal Bhasa, Sanskrit and Pali. Various publications are still published in this script including the Sikkim Herald the bulletin of the Sikkim government (Newari edition).

Traditional Chinese characters

Traditional Chinese characters (traditional Chinese: 正體字/繁體字; simplified Chinese: 正体字/繁体字; Pinyin: Zhèngtǐzì/Fántǐzì) are Chinese characters in any character set that does not contain newly created characters or character substitutions performed after 1946. They are most commonly the characters in the standardized character sets of Taiwan, of Hong Kong and Macau, and in the Kangxi Dictionary. The modern shapes of traditional Chinese characters first appeared with the emergence of the clerical script during the Han Dynasty, and have been more or less stable since the 5th century (during the Southern and Northern Dynasties).

The retronym "traditional Chinese" is used to contrast traditional characters with Simplified Chinese characters, a standardized character set introduced by the government of the People's Republic of China on Mainland China in the 1950s.

Traditional Chinese characters are currently used in Taiwan, Hong Kong, and Macau; as well as in Overseas Chinese communities outside Southeast Asia. In contrast, Simplified Chinese characters are used in mainland China, Singapore and Malaysia in official publications. However, several countries – such as Australia, the US and Canada – are increasing their number of printed materials in Simplified Chinese, to better accommodate citizens from mainland China.

The debate on traditional and simplified Chinese characters has been a long-running issue among Chinese communities. Currently, a large number of overseas Chinese online newspapers allow users to switch between both character sets.

Zhang-Zhung language

Zhang-Zhung (Tibetan: ཞང་ཞུང་, Wylie: zhang zhung) is an extinct Sino-Tibetan language that was spoken in what is now western Tibet. It is attested in a bilingual text called A Cavern of Treasures (mDzod phug) and several shorter texts.

A small number of documents preserved in Dunhuang contain an undeciphered language that has been called Old Zhangzhung, but the identification is controversial.

ISO 15924 script codes[a][b] and Unicode[c][d]
ISO 15924 Script in Unicode[e]
Code No. Name Alias[f] Direc­tion Ver­sion Char­acters Remark
Adlm 166 Adlam Adlam R-to-L 9.0 88
Afak 439 Afaka Varies Not in Unicode, proposal under review by the Unicode Technical Committee[8][9]
Aghb 239 Caucasian Albanian Caucasian Albanian L-to-R 7.0 53 Ancient/historic
Ahom 338 Ahom, Tai Ahom Ahom L-to-R 8.0 58 Ancient/historic
Arab 160 Arabic Arabic R-to-L 1.0 1,281
Aran 161 Arabic (Nastaliq variant) R-to-L Typographic variant of Arabic
Armi 124 Imperial Aramaic Imperial Aramaic R-to-L 5.2 31 Ancient/historic
Armn 230 Armenian Armenian L-to-R 1.0 95
Avst 134 Avestan Avestan R-to-L 5.2 61 Ancient/historic
Bali 360 Balinese Balinese L-to-R 5.0 121
Bamu 435 Bamum Bamum L-to-R 5.2 657
Bass 259 Bassa Vah Bassa Vah L-to-R 7.0 36 Ancient/historic
Batk 365 Batak Batak L-to-R 6.0 56
Beng 325 Bengali (Bangla) Bengali L-to-R 1.0 96
Bhks 334 Bhaiksuki Bhaiksuki L-to-R 9.0 97 Ancient/historic
Blis 550 Blissymbols Varies Not in Unicode, proposal in initial/exploratory stage[8]
Bopo 285 Bopomofo Bopomofo L-to-R 1.0 72
Brah 300 Brahmi Brahmi L-to-R 6.0 109 Ancient/historic
Brai 570 Braille Braille L-to-R 3.0 256
Bugi 367 Buginese Buginese L-to-R 4.1 30
Buhd 372 Buhid Buhid L-to-R 3.2 20
Cakm 349 Chakma Chakma L-to-R 6.1 70
Cans 440 Unified Canadian Aboriginal Syllabics Canadian Aboriginal L-to-R 3.0 710
Cari 201 Carian Carian L-to-R 5.1 49 Ancient/historic
Cham 358 Cham Cham L-to-R 5.1 83
Cher 445 Cherokee Cherokee L-to-R 3.0 172
Cirt 291 Cirth Varies Not in Unicode
Copt 204 Coptic Coptic L-to-R 1.0 137 Ancient/historic, Disunified from Greek in 4.1
Cpmn 402 Cypro-Minoan L-to-R Not in Unicode
Cprt 403 Cypriot syllabary Cypriot R-to-L 4.0 55 Ancient/historic
Cyrl 220 Cyrillic Cyrillic L-to-R 1.0 443
Cyrs 221 Cyrillic (Old Church Slavonic variant) Varies Ancient/historic, typographic variant of Cyrillic
Deva 315 Devanagari (Nagari) Devanagari L-to-R 1.0 154
Dogr 328 Dogra Dogra L-to-R 11.0 60 Ancient/historic
Dsrt 250 Deseret (Mormon) Deseret L-to-R 3.1 80
Dupl 755 Duployan shorthand, Duployan stenography Duployan L-to-R 7.0 143
Egyd 070 Egyptian demotic R-to-L Not in Unicode
Egyh 060 Egyptian hieratic R-to-L Not in Unicode
Egyp 050 Egyptian hieroglyphs Egyptian Hieroglyphs L-to-R 5.2 1,080 Ancient/historic
Elba 226 Elbasan Elbasan L-to-R 7.0 40 Ancient/historic
Elym 128 Elymaic Elymaic R-to-L 12.0 23 Ancient/historic
Ethi 430 Ethiopic (Geʻez) Ethiopic L-to-R 3.0 495
Geok 241 Khutsuri (Asomtavruli and Nuskhuri) Georgian Varies Unicode groups Geok and Geor together as "Georgian"
Geor 240 Georgian (Mkhedruli and Mtavruli) Georgian L-to-R 1.0 173 For Unicode, see also Geok
Glag 225 Glagolitic Glagolitic L-to-R 4.1 132 Ancient/historic
Gong 312 Gunjala Gondi Gunjala Gondi L-to-R 11.0 63
Gonm 313 Masaram Gondi Masaram Gondi L-to-R 10.0 75
Goth 206 Gothic Gothic L-to-R 3.1 27 Ancient/historic
Gran 343 Grantha Grantha L-to-R 7.0 85 Ancient/historic
Grek 200 Greek Greek L-to-R 1.0 518 Sometimes expressed as boustrophedon (mirroring of alternate lines rather than purely left-to-right)
Gujr 320 Gujarati Gujarati L-to-R 1.0 91
Guru 310 Gurmukhi Gurmukhi L-to-R 1.0 80
Hanb 503 Han with Bopomofo (alias for Han + Bopomofo) Varies See Hani, Bopo
Hang 286 Hangul (Hangŭl, Hangeul) Hangul L-to-R 1.0 11,739 Hangul syllables relocated in 2.0
Hani 500 Han (Hanzi, Kanji, Hanja) Han L-to-R 1.0 89,233
Hano 371 Hanunoo (Hanunóo) Hanunoo L-to-R 3.2 21
Hans 501 Han (Simplified variant) Varies Subset Hani
Hant 502 Han (Traditional variant) Varies Subset Hani
Hatr 127 Hatran Hatran R-to-L 8.0 26 Ancient/historic
Hebr 125 Hebrew Hebrew R-to-L 1.0 134
Hira 410 Hiragana Hiragana L-to-R 1.0 379
Hluw 080 Anatolian Hieroglyphs (Luwian Hieroglyphs, Hittite Hieroglyphs) Anatolian Hieroglyphs L-to-R 8.0 583 Ancient/historic
Hmng 450 Pahawh Hmong Pahawh Hmong L-to-R 7.0 127
Hmnp 451 Nyiakeng Puachue Hmong Nyiakeng Puachue Hmong L-to-R 12.0 71
Hrkt 412 Japanese syllabaries (alias for Hiragana + Katakana) Katakana or Hiragana Varies See Hira, Kana
Hung 176 Old Hungarian (Hungarian Runic) Old Hungarian R-to-L 8.0 108 Ancient/historic
Inds 610 Indus (Harappan) R-to-L Not in Unicode, proposal in initial/exploratory stage[8]
Ital 210 Old Italic (Etruscan, Oscan, etc.) Old Italic L-to-R 3.1 39 Ancient/historic
Jamo 284 Jamo (alias for Jamo subset of Hangul) Varies Subset Hang
Java 361 Javanese Javanese L-to-R 5.2 90
Jpan 413 Japanese (alias for Han + Hiragana + Katakana) Varies See Hani, Hira and Kana
Jurc 510 Jurchen L-to-R Not in Unicode
Kali 357 Kayah Li Kayah Li L-to-R 5.1 47
Kana 411 Katakana Katakana L-to-R 1.0 304
Khar 305 Kharoshthi Kharoshthi R-to-L 4.1 68 Ancient/historic
Khmr 355 Khmer Khmer L-to-R 3.0 146
Khoj 322 Khojki Khojki L-to-R 7.0 62 Ancient/historic
Kitl 505 Khitan large script L-to-R Not in Unicode
Kits 288 Khitan small script T-to-B Not in Unicode
Knda 345 Kannada Kannada L-to-R 1.0 89
Kore 287 Korean (alias for Hangul + Han) L-to-R See Hani and Hang
Kpel 436 Kpelle L-to-R Not in Unicode, proposal in initial/exploratory stage[8]
Kthi 317 Kaithi Kaithi L-to-R 5.2 67 Ancient/historic
Lana 351 Tai Tham (Lanna) Tai Tham L-to-R 5.2 127
Laoo 356 Lao Lao L-to-R 1.0 82
Latf 217 Latin (Fraktur variant) Varies Typographic variant of Latin
Latg 216 Latin (Gaelic variant) L-to-R Typographic variant of Latin
Latn 215 Latin Latin L-to-R 1.0 1,366 See Latin script in Unicode
Leke 364 Leke L-to-R Not in Unicode
Lepc 335 Lepcha (Róng) Lepcha L-to-R 5.1 74
Limb 336 Limbu Limbu L-to-R 4.0 68
Lina 400 Linear A Linear A L-to-R 7.0 341 Ancient/historic
Linb 401 Linear B Linear B L-to-R 4.0 211 Ancient/historic
Lisu 399 Lisu (Fraser) Lisu L-to-R 5.2 48
Loma 437 Loma L-to-R Not in Unicode, proposal in initial/exploratory stage[8]
Lyci 202 Lycian Lycian L-to-R 5.1 29 Ancient/historic
Lydi 116 Lydian Lydian R-to-L 5.1 27 Ancient/historic
Mahj 314 Mahajani Mahajani L-to-R 7.0 39 Ancient/historic
Maka 366 Makasar Makasar L-to-R 11.0 25 Ancient/historic
Mand 140 Mandaic, Mandaean Mandaic R-to-L 6.0 29
Mani 139 Manichaean Manichaean R-to-L 7.0 51 Ancient/historic
Marc 332 Marchen Marchen L-to-R 9.0 68 Ancient/historic
Maya 090 Mayan hieroglyphs Not in Unicode
Medf 265 Medefaidrin (Oberi Okaime, Oberi Ɔkaimɛ) Medefaidrin L-to-R 11.0 91
Mend 438 Mende Kikakui Mende Kikakui R-to-L 7.0 213
Merc 101 Meroitic Cursive Meroitic Cursive R-to-L 6.1 90 Ancient/historic
Mero 100 Meroitic Hieroglyphs Meroitic Hieroglyphs R-to-L 6.1 32 Ancient/historic
Mlym 347 Malayalam Malayalam L-to-R 1.0 117
Modi 324 Modi, Moḍī Modi L-to-R 7.0 79 Ancient/historic
Mong 145 Mongolian Mongolian T-to-B 3.0 167 Includes Clear, Manchu scripts
Moon 218 Moon (Moon code, Moon script, Moon type) Not in Unicode, proposal in initial/exploratory stage[8]
Mroo 264 Mro, Mru Mro L-to-R 7.0 43
Mtei 337 Meitei Mayek (Meithei, Meetei) Meetei Mayek L-to-R 5.2 79
Mult 323 Multani Multani L-to-R 8.0 38 Ancient/historic
Mymr 350 Myanmar (Burmese) Myanmar L-to-R 3.0 223
Nand 311 Nandinagari Nandinagari L-to-R 12.0 65 Ancient/historic
Narb 106 Old North Arabian (Ancient North Arabian) Old North Arabian R-to-L 7.0 32 Ancient/historic
Nbat 159 Nabataean Nabataean R-to-L 7.0 40 Ancient/historic
Newa 333 Newa, Newar, Newari, Nepāla lipi Newa L-to-R 9.0 94
Nkdb 085 Naxi Dongba (na²¹ɕi³³ to³³ba²¹, Nakhi Tomba) L-to-R Not in Unicode
Nkgb 420 Nakhi Geba (na²¹ɕi³³ gʌ²¹ba²¹, 'Na-'Khi ²Ggŏ-¹baw, Nakhi Geba) L-to-R Not in Unicode, proposal in initial/exploratory stage[8]
Nkoo 165 N’Ko NKo R-to-L 5.0 62
Nshu 499 Nüshu Nushu L-to-R 10.0 397
Ogam 212 Ogham Ogham 3.0 29 Ancient/historic
Olck 261 Ol Chiki (Ol Cemet’, Ol, Santali) Ol Chiki L-to-R 5.1 48
Orkh 175 Old Turkic, Orkhon Runic Old Turkic R-to-L 5.2 73 Ancient/historic
Orya 327 Oriya (Odia) Oriya L-to-R 1.0 90
Osge 219 Osage Osage L-to-R 9.0 72
Osma 260 Osmanya Osmanya L-to-R 4.0 40
Palm 126 Palmyrene Palmyrene R-to-L 7.0 32 Ancient/historic
Pauc 263 Pau Cin Hau Pau Cin Hau L-to-R 7.0 57
Perm 227 Old Permic Old Permic L-to-R 7.0 43 Ancient/historic
Phag 331 Phags-pa Phags-pa T-to-B 5.0 56 Ancient/historic
Phli 131 Inscriptional Pahlavi Inscriptional Pahlavi R-to-L 5.2 27 Ancient/historic
Phlp 132 Psalter Pahlavi Psalter Pahlavi R-to-L 7.0 29 Ancient/historic
Phlv 133 Book Pahlavi R-to-L Not in Unicode
Phnx 115 Phoenician Phoenician R-to-L 5.0 29 Ancient/historic
Piqd 293 Klingon (KLI pIqaD) L-to-R Rejected for inclusion in the Unicode Standard[10][11]
Plrd 282 Miao (Pollard) Miao L-to-R 6.1 149
Prti 130 Inscriptional Parthian Inscriptional Parthian R-to-L 5.2 30 Ancient/historic
Qaaa 900 Reserved for private use (start) Not in Unicode
Qaai 908 (Private use) Not in Unicode (Before version 5.2, this was used instead of Zinh)
Qabx 949 Reserved for private use (end) Not in Unicode
Rjng 363 Rejang (Redjang, Kaganga) Rejang L-to-R 5.1 37
Rohg 167 Hanifi Rohingya Hanifi Rohingya R-to-L 11.0 50
Roro 620 Rongorongo Not in Unicode, proposal in initial/exploratory stage[8]
Runr 211 Runic Runic L-to-R 3.0 86 Ancient/historic
Samr 123 Samaritan Samaritan R-to-L 5.2 61
Sara 292 Sarati Not in Unicode
Sarb 105 Old South Arabian Old South Arabian R-to-L 5.2 32 Ancient/historic
Saur 344 Saurashtra Saurashtra L-to-R 5.1 82
Sgnw 095 SignWriting SignWriting T-to-B 8.0 672
Shaw 281 Shavian (Shaw) Shavian L-to-R 4.0 48
Shrd 319 Sharada, Śāradā Sharada L-to-R 6.1 94
Shui 530 Shuishu L-to-R Not in Unicode
Sidd 302 Siddham, Siddhaṃ, Siddhamātṛkā Siddham L-to-R 7.0 92 Ancient/historic
Sind 318 Khudawadi, Sindhi Khudawadi L-to-R 7.0 69
Sinh 348 Sinhala Sinhala L-to-R 3.0 110
Sogd 141 Sogdian Sogdian R-to-L 11.0 42 Ancient/historic
Sogo 142 Old Sogdian Old Sogdian R-to-L 11.0 40 Ancient/historic
Sora 398 Sora Sompeng Sora Sompeng L-to-R 6.1 35
Soyo 329 Soyombo Soyombo L-to-R 10.0 83 Ancient/historic
Sund 362 Sundanese Sundanese L-to-R 5.1 72
Sylo 316 Syloti Nagri Syloti Nagri L-to-R 4.1 44
Syrc 135 Syriac Syriac R-to-L 3.0 88
Syre 138 Syriac (Estrangelo variant) R-to-L Typographic variant of Syriac
Syrj 137 Syriac (Western variant) R-to-L Typographic variant of Syriac
Syrn 136 Syriac (Eastern variant) R-to-L Typographic variant of Syriac
Tagb 373 Tagbanwa Tagbanwa L-to-R 3.2 18
Takr 321 Takri, Ṭākrī, Ṭāṅkrī Takri L-to-R 6.1 67
Tale 353 Tai Le Tai Le L-to-R 4.0 35
Talu 354 New Tai Lue New Tai Lue L-to-R 4.1 83
Taml 346 Tamil Tamil L-to-R 1.0 123
Tang 520 Tangut Tangut L-to-R 9.0 6,892 Ancient/historic
Tavt 359 Tai Viet Tai Viet L-to-R 5.2 72
Telu 340 Telugu Telugu L-to-R 1.0 98
Teng 290 Tengwar L-to-R Not in Unicode
Tfng 120 Tifinagh (Berber) Tifinagh L-to-R 4.1 59
Tglg 370 Tagalog (Baybayin, Alibata) Tagalog L-to-R 3.2 20
Thaa 170 Thaana Thaana R-to-L 3.0 50
Thai 352 Thai Thai L-to-R 1.0 86
Tibt 330 Tibetan Tibetan L-to-R 2.0 207 Added in 1.0, removed in 1.1 and reintroduced in 2.0
Tirh 326 Tirhuta Tirhuta L-to-R 7.0 82
Ugar 040 Ugaritic Ugaritic L-to-R 4.0 31 Ancient/historic
Vaii 470 Vai Vai L-to-R 5.1 300
Visp 280 Visible Speech L-to-R Not in Unicode
Wara 262 Warang Citi (Varang Kshiti) Warang Citi L-to-R 7.0 84
Wcho 283 Wancho Wancho L-to-R 12.0 59
Wole 480 Woleai R-to-L Not in Unicode, proposal in initial/exploratory stage[8]
Xpeo 030 Old Persian Old Persian L-to-R 4.1 50 Ancient/historic
Xsux 020 Cuneiform, Sumero-Akkadian Cuneiform L-to-R 5.0 1,234 Ancient/historic
Yiii 460 Yi Yi L-to-R 3.0 1,220
Zanb 339 Zanabazar Square (Zanabazarin Dörböljin Useg, Xewtee Dörböljin Bicig, Horizontal Square Script) Zanabazar Square L-to-R 10.0 72 Ancient/historic
Zinh 994 Code for inherited script Inherited Inherited 571
Zmth 995 Mathematical notation L-to-R Not a 'script' in Unicode
Zsym 996 Symbols Not a 'script' in Unicode
Zsye 993 Symbols (emoji variant) Not a 'script' in Unicode
Zxxx 997 Code for unwritten documents Not a 'script' in Unicode
Zyyy 998 Code for undetermined script Common 7,804
Zzzz 999 Code for uncoded script Unknown 976,119 All other code points
Notes
  1. ^ ISO 15924 publications As of 26 August 2018
  2. ^ ISO 15924 Normative text file As of 26 August 2018
  3. ^ ISO 15924 Changes (including Aliases for Unicode; as of 26 August 2018)
  4. ^ Unicode version 12.0
  5. ^ Unicode charts
  6. ^ Unicode uses the "Property Value Alias" (Alias) as the script-name. These Alias names are part of Unicode and are published informatively next to ISO 15924
ISO standards by standard number
1–9999
10000–19999
20000+
Unicode
Code points
Characters
Processing
On pairs of
code points
Usage
Related standards
Related topics

This page is based on a Wikipedia article written by authors (here).
Text is available under the CC BY-SA 3.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.