ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG...

43
ISO/IEC JTC1/SC2/WG2 N4142 L2/11-412 2011-10-25 Title: Proposal to Encode the Soyombo Script in ISO/IEC 10646 Source: Script Encoding Initiative (SEI) Author: Anshuman Pandey ([email protected]) Status: Liaison Contribution Action: For consideration by UTC and WG2 Date: 2011-10-25 1 Introduction This is a proposal to encode the Soyombo script in the Universal Character Set (ISO/IEC 10646). The request to include Soyombo in the UCS was made by the Mongolia and Japan national bodies in September 1998 (WG2 N1855 L2/98-358). An update on the status of the request was provided in January 2000 by Takayuki K. Sato (Japan), who stated that the project for encoding Soyombo was stalled due to funding issues (WG2 N2163 L2/00-055). The script was allocated to the Roadmap to the Supplementary Multilingual Plane (SMP) in WG2 Meeting 38 in March 2000 (WG2 N2203 L2/00-234). There was no further action. The present proposal aims to fulfill the original request. It builds upon the following documents: N3949 L2/10-399: Preliminary Proposal to Encode the Soyombo Script in ISO/IEC 10646 N3986 L2/11-054: Determining the Encoding Model for Soyombo Vowels N4026 L2/11-125: Revised Preliminary Proposal to Encode Soyombo in the UCS This document is a revision of N4026 and replaces it. Major changes to the preliminary proposal include a revision of the encoding model for vowels. Independent vowel letters have been replaced with a vowel- carrier letter and a set of dependent vowel signs. The encoding model for conjuncts has also changed. Previously, consonant ligatures found in script charts were included as independent characters. Additional research has shown the use of other conjuncts in manuscripts. As a result, the independent ligatures have been removed and a full set of subjoined letters have been proposed for encoding. This subjoined-letter model will enable the encoding of a broader range of conjuncts. The Soyombo font used here is based upon the font developed by Oliver Corff in August 1996 for his “Soyombo for L A T E X” package. Modifications have been made to Corff’s font and several new glyphs added by the present author. 2 Background Soyombo (Mongolian: Соёмбо бичиг ᠰᠣᠶᠣᠮ ᠴᠢᠭ soyombo bicig) is a script used for writing Mongolian, Sanskrit, and Tibetan. It was used mainly for producing ornamental Buddhist texts. 1 The script was designed in 1686 by Zanabazar (1635–1723), the first spiritual leader of Tibetan Buddhism in Mongolia, who also developed the Xäwtää Dörböljin (Horizonal Square) script. 2 The name soyombo ( ) is derived from Sanskrit वयंभु svayaṃbhu ‘self-existing’. 1 Atwood 2004: 518. 2 See N3956 L2/10-411. 1

Transcript of ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG...

Page 1: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

ISOIEC JTC1SC2WG2 N4142L211-4122011-10-25

Title Proposal to Encode the Soyombo Script in ISOIEC 10646Source Script Encoding Initiative (SEI)Author Anshuman Pandey (pandeyumichedu)Status Liaison ContributionAction For consideration by UTC and WG2Date 2011-10-25

1 Introduction

This is a proposal to encode the Soyombo script in the Universal Character Set (ISOIEC 10646) The requestto include Soyombo in the UCS was made by the Mongolia and Japan national bodies in September 1998(WG2 N1855 L298-358) An update on the status of the request was provided in January 2000 by TakayukiK Sato (Japan) who stated that the project for encoding Soyombo was stalled due to funding issues (WG2N2163 L200-055) The script was allocated to the Roadmap to the SupplementaryMultilingual Plane (SMP)in WG2 Meeting 38 in March 2000 (WG2 N2203 L200-234) There was no further action

The present proposal aims to fulfill the original request It builds upon the following documents

bull N3949 L210-399 Preliminary Proposal to Encode the Soyombo Script in ISOIEC 10646bull N3986 L211-054 Determining the Encoding Model for Soyombo Vowelsbull N4026 L211-125 Revised Preliminary Proposal to Encode Soyombo in the UCS

This document is a revision of N4026 and replaces it Major changes to the preliminary proposal includea revision of the encoding model for vowels Independent vowel letters have been replaced with a vowel-carrier letter and a set of dependent vowel signs The encoding model for conjuncts has also changedPreviously consonant ligatures found in script charts were included as independent characters Additionalresearch has shown the use of other conjuncts in manuscripts As a result the independent ligatures havebeen removed and a full set of subjoined letters have been proposed for encoding This subjoined-lettermodel will enable the encoding of a broader range of conjuncts

The Soyombo font used here is based upon the font developed by Oliver Corff in August 1996 for hisldquoSoyombo for LATEXrdquo package Modifications have been made to Corffrsquos font and several new glyphs addedby the present author

2 Background

Soyombo (Mongolian Соёмбо бичиг ᠰᠣᠶᠣᠮᠪᠣ ᠪᠢᠴᠢᠭ soyombo bicig) is a script used for writingMongolianSanskrit and Tibetan It was used mainly for producing ornamental Buddhist texts1 The script was designedin 1686 by Zanabazar (1635ndash1723) the first spiritual leader of Tibetan Buddhism in Mongolia who alsodeveloped the Xaumlwtaumlauml Doumlrboumlljin (Horizonal Square) script2 The name soyombo (717077176571702717447173371682) is derived fromSanskrit वयभ svayambhu lsquoself-existingrsquo

1 Atwood 2004 518 2 See N3956 L210-411

1

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

There are several records in Soyombo most of which are manuscripts such as that shown in Figure 28Soyombo is described in many secondary sources such as studies of Mongolian scripts eg Boldsaikhan etal (2005) Shagdarsuumlruumlng (2001) etc Soyombo is currently in use as attested by a poem that was recentlytypeset using a digitized font (see Figure 30)

Names for characters are based upon those given in secondary sources such as Shagdarsuumlruumlng (2001)Names for certain consonants follow those given in N1855 L298-358

3 Writing System

31 Structure

Soyombo is an alphasyllabic script that is influenced by Tibetan and Devanagari Vowels are representedusing dependent vowel signs Independent vowels are written by combining these signs with a vowel-carrierletter Consonant letters have an inherent vowel but in some language-specific contexts are purely alpha-betic Consonants are written using signs when syllable-final Clusters of consonants are represented asconjuncts which are written using subjoined forms of letters

It is written left to right Some records contain text written vertically from top to bottom (see Section 51)

The graphical structure of Soyombo letters is lsquotwo-dimensional not linearrsquo (Corff 1996 see figures 4 and5) Each letter is modelled upon the template To this template are added distinct character primitivesthat represent the base letter Vowel signs are written above the frame 71685 within it 71683 attached toa terminal 71689 or to the right of the frame 71687 Final-consonant signs are written inside the frame 71719Consonant conjuncts are represented using subjoined forms of non-initial letters which are written withinthe frame 71763 These features are described in further detail below

32 Vowels

There are 21 letters shown in traditional charts of Soyombo for writing vowels

71680 7168071689 7168071681 716807168171689 7168071682 716807168271689 7168071683 716807168371689 7168071684 716807168471689 7168071685 716807168571689 7168071686 716807168671689 7168071687 7168071688a ā i ī uuml ǖ ū ǖ e ē o ō ouml ȫ au ai

7168071742 716807174271689 7168071743 716807174371689 71740r r l l -a

Apart from a and -a these letters may be considered composite characters written using a lsquovowel-carrierrsquoletter and a distinct sign for each vowel

71680 71689 71681 7168171689 71682 7168271689 71683 7168371689 71684 7168471689 71685 7168571689 71686 7168671689 71687 71688a ā i ī uuml ǖ ū ǖ e ē o ō ouml ȫ au ai

71742 7174271689 71743 7174371689 71740r r l l -a

2

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

The representation of vowels in Soyombo is based upon the Tibetan system in which ཨ +0F68 is a vowel carrier to which dependent vowel signs are attached in order to write independentvowels The letter 71680 a is the Soyombo vowel-carrier Long vowels are written using a combination of abasic vowel sign and a 71689 length mark

Given this structure it is practical to encode a vowel-carrier letter and dependent vowel signs instead of afull set of independent vowel letters and their associated dependent forms All independent and dependentvowel forms may be written using the following set of 13 characters

71680

71681

71682

71683

71684

71685

71686

71687

71688

71742

71743

71689

71740

321 Notes on vowels

1 Ordering The 71689 is written after the accompanying vowel sign

71680 + 71681 + 71689 rarr 716807168171689 + - + rarr ī

2 Long vowel ā When the is written with a bare vowel-carrier it represents thelong vowel ā 7168071689

3 Sanskrit vocalic letters The r r l l are used for representing Sanskrit

4 Tibetan lsquoa chungrsquo The vowel -a is used for writing Tibetan and corresponds to འ +0F60 - It has a subjoined form (see Section 362)

5 Variant forms The vowels7168071681 and7168071683 and their corresponding long forms are written by attachingthe diphthong signs 71687 - and 71688 - to the vowel letter7168071681 ie 7168171687 u716817168971687 ū7168171688 ī716817168971688 ī The is written before the sign for the diphthong

33 Consonant Letters

The basic set of consonant letters is

71690 71691 71692 71693 71694 71695 71696 71697 71698 71699 71700 71701 71702 71703 71704 71705 71706 71707 71708

There are 14 additional consonant letters for writing Sanskrit The names for these characters contain thedescriptor lsquorsquo a Mongolian term indicating letters for non-Mongolian sounds It is used here for distin-guishing between distinct characters that are given identical names in script charts eg 71690 (Mongolian)and71721 (foreign) etc

3

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

71721 71722 71723 71724 71725 71726 71727 71728 71729 71730 71731 71732 71733 71734

There are 5 additional letters for writing Tibetan The descriptor lsquorsquo is also used in the names of thesecharacters

71735 71736 71737 71738 71739

34 Consonant Signs

Syllable-final Mongolian consonants are written using contextual forms of letters The following charactersare shown in script charts as representing such consonants

7168071709 7168071710 7168071711 7168071712 7168071713 7168071714 7168071715 7168071716 7168071717 7168071718 7168071719 7168071720ag ak an ad an ab am ar al aś as ān

These characters are combinations of71680 + combining sign The use of is illustrative

71709 71710 71711 71712 71713 71714 71715 71716 71717 71718 71719 71720-g -k -n -d -n -b -m -r -l -ś -s -ān

These 12 signs are proposed for encoding as the following characters

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

The final-consonant sign is always written after any vowel sign and

71691 + 71681 + 71689 + 71711 rarr 71691716817168971711 + - + + - rarr kīng

35 Geminated Consonants

Doubled consonants are written using the 71746 which is stacked above the triangle of aletter lt71690 + 71746 gt rarr kka

4

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

36 Consonant Conjuncts

Consonant clusters in Sanskrit are represented as conjuncts The initial consonant in a conjunct is the baseletter Non-initial consonants are written as subjoined forms which are stacked within the frame of the baseletter Certain letters have pre-fixed forms when they occur in conjunct-initial position These pre-fixedforms are attached to the following consonant There is one conjunct that is written using an atomic ligature

361 Prefixed forms

When occur as C1 they are written using prefixed forms that are attached to the regular letterfor C2 In such conjuncts C2 functions as the base letter but C1 is parsed first in the logical order

bull 71747 Conjunct initial form of71705 It is used for writing Tibetan ལམག la-mgoletters lka = lt 71747 71690 gt rarr lka

bull 71748 Conjunct initial form of 71706 lt 71748 + 71690 gt rarr śka

bull 71749 Conjunct initial form of71707 It is used for writing Tibetan སམག sa-mgoletters ska = lt 71749 71690 gt rarr ska

bull 71750 Conjunct initial form of71703 It is used for writing Tibetan རམག ra-mgoletters rka = lt 71750 +71690 gt rarr rka

362 Subjoined forms

Non-initial consonants of a cluster are written as subjoined forms attached to C1

71690 71691 71692 71693 71694 71695 71696 71697 71698 71699 71751 71752 71753 71754 71755 71756 71757 71758 71759 71760

71700 71701 71702 71703 71704 71705 71706 71707 71708 71761 71762 71763 71764 71765 71766 71767 71768 71769

71721 71722 71723 71724 71725 71726 71727 71728 71729 71730 71770 71771 71772 71773 71774 71775 71776 71777 71778 71779

71731 71732 71733 71734 71735 71736 71737 71738 71739 71740 71780 71781 71782 71783 71784 71785 71786 71787 71788 71789

5

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

The subjoined form is the base element of a letter The glyphic representation of subjoined letters is discussedfurther in Section 55

363 Independent conjunct

The conjunct 71741 ksa is used for writing the Sanskrit cluster ksa ltक +0915 +094D +0937 gt It is proposed for independentencoding as It is different from other conjuncts because it cannot be produced using a subjoinedletter the shape of 71783 does not correspond to any element in the graphical structureof

364 Atomic ligatures in script charts

The following conjuncts are enumerated in charts of Soyombo

71741 ksa kya kra kla kva kka nka ntildeca nta nta mpa lka śka ska rka

They are not atomic characters but conjuncts Moreover they are not the only conjuncts used in Soyomboothers are shown in manuscripts These conjuncts are likely shown in charts for purposes of illustrating thetypes of conjuncts that may be written

37 Various Signs

bull 71744 Indicates nasalization in words of Sanskrit origin eg om lt71680 71685 71744gt It corresponds to +0F7E

bull 71745 Represents an allophone of r or s at word-final position in Devanagari orthographyfor Sanskrit It is used in Mongolian Square for writing Sanskrit and it corresponds to ཿ +0F7E

38 Punctuation

bull 71790 and 71791 These characters are used for marking the ends of sentences andlonger sections They correspond to +0964 and to +0965

bull 71792 This character is used for marking the end of a syllable It corresponds to +0F0B

39 Head Marks

There are two lsquoheadmarksrsquo71793 and71794 These characters are used for indicatingthe beginning and end of section of text

6

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

310 Symbols

The71795 was design by Zanabazar the creator of the script It is an officially recognizedsymbol in Mongolia and appears on the flag and coat of arms of the country (see figures 34 and 35) Thesymbol is included as a character of the script but it is not part of the orthography of written text

4 Language-Specific Orthography

41 Tibetan

The representation of consonant clusters and syllable-final consonants in Tibetan words follows the orthog-raphy for Tibetan For example a word-initial bare consonant may be written using the regular letter insteadof forming a conjunct with the following consonant Also a syllable-final consonant may be represented bythe regular letter instead of the final-consonant sign The word མཆག mchog lsquosupremersquo is written as7170171694716857169071792lt gt using the regular71690 instead of 71709 The 71792 is added after to mark the end of the syllable The expected representation of mchog is71701717557168571709

5 Rendering and Shaping Behaviors

51 Vertical Text

Soyombo may be written vertically from top to bottom and left to right For example the text segment dzuumlrka as shown in the seal in Figure 27

717377171671691

71737 71682 - 71716

71691

In a vertical environment Soyombo orthography departs from general rules for writing syllables The syl-lables of a word may be written using subjoined letters as if forming a conjunct The example below showsthe word71697716847170171684716967168471709 temedeg lsquosymbolrsquo written vertically Its three syllables have the same vowel which is rep-resented using a single vowel sign above the base letter the consonants of non-initial syllables are writtenusing subjoined letters while the final consonant is written using a final-consonant sign

716977168471762717577170971697 71684 -71701 71684 -71696 71684 - 71709

A similar practice is observed in the vertical form of words comprised of syllables that contain differentvowels The example below shows the word 71702716847169871683 yenu Its two syllables have different vowels which arewritten above and below the base the consonant of the second syllable is written as a subjoined form

71702716847175971683 71702 71684 -71698 71683 -

7

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

It is likely that the above behavior occurs only when the syllables of a word possess the same vowel or whenthe syllables of a word have vowels whose signs are written in different positions It is unlikely that syllableswould be written as shown above if the vowels were different and written in the same position in order toprevent clashing

52 Vowel Signs

When vowel-signs that appear at the base of the letter frame are written with final-consonants signs thenthe size and position of the vowel signs are modified in order to accommodate the placement of both signsThis rule affects two vowel signs 71682 and 71683

bull Shaping and positioning of - 71682 -rarr 71691 + 71682 + 71711 rarr 716917168271711 rarr 7169171711 + - + - rarr kuumlng

bull Shaping and positioning of - 71683 -rarr 71691 + 71683 + 71711 rarr 716917168371711 rarr 7169171711 + - + - rarr kung

53 Gemination Sign

The height of the base letter may be lowered to accommodate the within the normalx-height compare with

54 Conjuncts

The glyph of the base letter in a conjunct is generally modified in order to accommodate the placement ofsubjoined letters within the letter frame For example in writing kya lt71690 71763 gt

the regular shape71690 of is altered to where the base element is slightly truncated and compressed

Another such adjustment is made when 71692 is written with 71690 71691 71721 71722 For instance in writing nga the regular shape of 71692 changes to before

is attached A further modification is the break in the frame that occurs in conjuncts involving

55 Glyphic Representation of Subjoined Letters

The glyphs for subjoined letters represent the base element of the regular letter In some cases it maybe necessary to include part of the frame stem in the glyphs for subjoined letters in order to maintain theSoyombo graphical structure For example the word grva consists of a base letter and two subjoined lettersThe position of the second subjoined letter would straddle the baseline or fall below it In such cases thestem of the base-letter frame would be extended

8

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

7176471765 rarr 7176471765 71721 71764 71765

This could be handled by creating glyphs for subjoined letters that contain a segment of the frame stem eg 71765 for 71765 etc

6 Character Data

61 Character Properties

11800SOYOMBO LETTER ALo0LN11801SOYOMBO VOWEL SIGN IMn0NSMN11802SOYOMBO VOWEL SIGN UMn0NSMN11803SOYOMBO VOWEL SIGN UEMn0NSMN11804SOYOMBO VOWEL SIGN EMn0NSMN11805SOYOMBO VOWEL SIGN OMn0NSMN11806SOYOMBO VOWEL SIGN OEMn0NSMN11807SOYOMBO VOWEL SIGN AUMn0NSMN11808SOYOMBO VOWEL SIGN AIMn0NSMN11809SOYOMBO VOWEL LENGTH MARKMn0NSMN1180ASOYOMBO LETTER GALo0LN1180BSOYOMBO LETTER KALo0LN1180CSOYOMBO LETTER NGALo0LN1180DSOYOMBO LETTER JALo0LN1180ESOYOMBO LETTER CALo0LN1180FSOYOMBO LETTER NYALo0LN11810SOYOMBO LETTER DALo0LN11811SOYOMBO LETTER TALo0LN11812SOYOMBO LETTER NALo0LN11813SOYOMBO LETTER BALo0LN11814SOYOMBO LETTER PALo0LN11815SOYOMBO LETTER MALo0LN11816SOYOMBO LETTER YALo0LN11817SOYOMBO LETTER RALo0LN11818SOYOMBO LETTER VALo0LN11819SOYOMBO LETTER LALo0LN1181ASOYOMBO LETTER SHALo0LN1181BSOYOMBO LETTER SALo0LN1181CSOYOMBO LETTER HALo0LN1181DSOYOMBO CONSONANT SIGN GMn0NSMN1181ESOYOMBO CONSONANT SIGN KMn0NSMN1181FSOYOMBO CONSONANT SIGN NGMn0NSMN11820SOYOMBO CONSONANT SIGN DMn0NSMN11821SOYOMBO CONSONANT SIGN NMn0NSMN11822SOYOMBO CONSONANT SIGN BMn0NSMN11823SOYOMBO CONSONANT SIGN MMn0NSMN11824SOYOMBO CONSONANT SIGN RMn0NSMN11825SOYOMBO CONSONANT SIGN LMn0NSMN11826SOYOMBO CONSONANT SIGN SHMn0NSMN11827SOYOMBO CONSONANT SIGN SMn0NSMN11828SOYOMBO CONSONANT SIGN ANGMn0NSMN11829SOYOMBO LETTER GALIG GALo0LN1182ASOYOMBO LETTER GALIG GHALo0LN1182BSOYOMBO LETTER GALIG JALo0LN1182CSOYOMBO LETTER GALIG JHALo0LN1182DSOYOMBO LETTER GALIG TTALo0LN1182ESOYOMBO LETTER GALIG TTHALo0LN1182FSOYOMBO LETTER GALIG DDALo0LN

9

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

11830SOYOMBO LETTER GALIG DDHALo0LN11831SOYOMBO LETTER GALIG NNALo0LN11832SOYOMBO LETTER GALIG DALo0LN11833SOYOMBO LETTER GALIG DHALo0LN11834SOYOMBO LETTER GALIG BALo0LN11835SOYOMBO LETTER GALIG BHALo0LN11836SOYOMBO LETTER GALIG SSALo0LN11837SOYOMBO LETTER GALIG TSALo0LN11838SOYOMBO LETTER GALIG TSHALo0LN11839SOYOMBO LETTER GALIG DZALo0LN1183ASOYOMBO LETTER GALIG ZHALo0LN1183BSOYOMBO LETTER GALIG ZALo0LN1183CSOYOMBO LETTER GALIG SMALL ALo0LN1183DSOYOMBO LETTER KSSALo0LN1183ESOYOMBO VOWEL SIGN VOCALIC RMn0NSMN1183FSOYOMBO VOWEL SIGN VOCALIC LMn0NSMN11840SOYOMBO SIGN ANUSVARAMn0NSMN11841SOYOMBO SIGN VISARGAMc0LN11842SOYOMBO GEMINATION SIGNMn0NSMN11843SOYOMBO PREFIXED LETTER LAMn0NSMN11844SOYOMBO PREFIXED LETTER SHAMn0NSMN11845SOYOMBO PREFIXED LETTER SAMn0NSMN11846SOYOMBO PREFIXED LETTER RAMn0NSMN11847SOYOMBO SUBJOINED LETTER GAMn0NSMN11848SOYOMBO SUBJOINED LETTER KAMn0NSMN11849SOYOMBO SUBJOINED LETTER NGAMn0NSMN1184ASOYOMBO SUBJOINED LETTER JAMn0NSMN1184BSOYOMBO SUBJOINED LETTER CAMn0NSMN1184CSOYOMBO SUBJOINED LETTER NYAMn0NSMN1184DSOYOMBO SUBJOINED LETTER DAMn0NSMN1184ESOYOMBO SUBJOINED LETTER TAMn0NSMN1184FSOYOMBO SUBJOINED LETTER NAMn0NSMN11850SOYOMBO SUBJOINED LETTER BAMn0NSMN11851SOYOMBO SUBJOINED LETTER PAMn0NSMN11852SOYOMBO SUBJOINED LETTER MAMn0NSMN11853SOYOMBO SUBJOINED LETTER YAMn0NSMN11854SOYOMBO SUBJOINED LETTER RAMn0NSMN11855SOYOMBO SUBJOINED LETTER VAMn0NSMN11856SOYOMBO SUBJOINED LETTER LAMn0NSMN11857SOYOMBO SUBJOINED LETTER SHAMn0NSMN11858SOYOMBO SUBJOINED LETTER SAMn0NSMN11859SOYOMBO SUBJOINED LETTER HAMn0NSMN1185ASOYOMBO SUBJOINED LETTER GALIG GAMn0NSMN1185BSOYOMBO SUBJOINED LETTER GALIG GHAMn0NSMN1185CSOYOMBO SUBJOINED LETTER GALIG JAMn0NSMN1185DSOYOMBO SUBJOINED LETTER GALIG JHAMn0NSMN1185ESOYOMBO SUBJOINED LETTER GALIG TTAMn0NSMN1185FSOYOMBO SUBJOINED LETTER GALIG TTHAMn0NSMN11860SOYOMBO SUBJOINED LETTER GALIG DDAMn0NSMN11861SOYOMBO SUBJOINED LETTER GALIG DDHAMn0NSMN11862SOYOMBO SUBJOINED LETTER GALIG NNAMn0NSMN11863SOYOMBO SUBJOINED LETTER GALIG DAMn0NSMN11864SOYOMBO SUBJOINED LETTER GALIG DHAMn0NSMN11865SOYOMBO SUBJOINED LETTER GALIG BAMn0NSMN11866SOYOMBO SUBJOINED LETTER GALIG BHAMn0NSMN11867SOYOMBO SUBJOINED LETTER GALIG SSAMn0NSMN11868SOYOMBO SUBJOINED LETTER GALIG TSAMn0NSMN11869SOYOMBO SUBJOINED LETTER GALIG TSHAMn0NSMN1186ASOYOMBO SUBJOINED LETTER GALIG DZAMn0NSMN1186BSOYOMBO SUBJOINED LETTER GALIG ZHAMn0NSMN1186CSOYOMBO SUBJOINED LETTER GALIG ZAMn0NSMN

10

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

1186DSOYOMBO SUBJOINED LETTER GALIG SMALL AMn0NSMN1186ESOYOMBO DANDAPo0LN1186FSOYOMBO DOUBLE DANDAPo0LN11870SOYOMBO TSHEGPo0LN11871SOYOMBO HEAD MARKPo0ONN11872SOYOMBO TERMINAL MARKPo0ONN11873SOYOMBO SYMBOL SVAYAMBHUSo0LN

62 Linebreaking Properties

Linebreaking properties given in the data format of LineBreaktxt

11800 AL LETTER A1180111809 CM VOWEL SIGN I VOWEL LENGTH MARK1180A1181C AL GA HA1181D11828 CM CONSONANT SIGN G CONSONANT SIGN ANG118291183D AL GALIG GA KSSA1183E11842 CM SIGN VOCALIC R GEMINATION SIGN118431186D CM PREFIXED LETTER LA SUBJOINED LETTER GALIG SMALL A1186E1186F BA DANDA DOUBLE DANDA11870 BA TSHEG11871 BB HEAD MARK11872 BA TERMINAL MARK11873 AL SYMBOL SVAYAMBHU

7 References

Atwood Christopher P 2004 Encyclopedia of Mongolia and the Mongol Empire New York Facts on File

Boldsaikhan B B Batsana and Ts Oyuntsetseg 2005 Соёмбо Нууц ба Синергетик [= Secret of

Soyombo and Synergetic] (Түвэд монгол бичгийн эхийг орчуулан хавсаргав) 71793717077176571702717447173371682 71680716817168971713 7169071681716907168371694 716997168071681716987168471716716917168471697716817171071794 Ed by T Bulgan Ulaanbaatar Shambala Association System Science Research InstituteMongolian University of Science and Technology

Corff Oliver ldquoSoyombo for LATEXrdquo v 09 September 2 1996 httpuserpagefu-berlinde

~corffimSoyombooverviewSoyombohtml

ldquoHistoire du livrerdquo 2010 ldquoHistoire du livre et histoire de lrsquoeacutecriture lrsquoeacutecriture entre pratique symboleet eacuteconomierdquo (26 August 2010) httphistoire-du-livreblogspotcom201008lecriture-entre-le-symbole-et-lahtml

Kapaj Luigi 2002 ldquoMongol Scriptsrdquo httpwwwviahistoriacomSilverHordemainhtmlresearchMongolScriptshtml

Mongolwiki 2008 ldquoСайн чанарын зураг байхгүй байнаrdquo httpwikiecm-outsourcingcomindexphptitle=ДүрсSoyombo_usgiin_bargif

Mongolia and Japan 1988 ldquoAddition of Soyombo Scriptrdquo N1855 L298-358 September 21 1998 httpstddkuugdkjtc1sc2wg2docsn1855pdf

Pandey Anshuman 2010a ldquoPreliminary Proposal to Encode the Xawtaa Dorboljin Script in ISOIEC10646rdquo N3956 L210-411 October 23 2010 httpstddkuugdkjtc1sc2wg2docsn3956pdf

11

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

mdashmdashmdash 2010b ldquoPreliminary Proposal to Encode the Soyombo Script in ISOIEC 10646rdquo N3949 L210-399 October 30 2010 httpstddkuugdkjtc1sc2wg2docsn3949pdf

mdashmdashmdash 2011a ldquoDetermining the Encoding Model for Soyombo Vowelsrdquo N3986 L211-054 February 52011 httpstddkuugdkjtc1sc2wg2docsn3986pdf

mdashmdashmdash 2011b ldquoRevised Preliminary Proposal to Encode Soyombo in the UCSrdquo N4026 L211-125 April25 2011 httpstddkuugdkjtc1sc2wg2docsn4026pdf

Sato Takayuki K ldquoSoyombo and Pagba (old Mongol scripts)rdquo N2163 L200-055 January 6 2000 httpstddkuugdkjtc1sc2wg2docsn2163doc

Shagdarsuumlruumlng Tseveliin 2001 Study of Mongolian Scripts (Graphic Study or Grammatology) Enl 2nded Bibliotheca Mongolica Monograph 1 Ed by Sharaviin Choimaa Ulaanbaatar Center for MongolStudies National University of Mongolia

Wikimedia 2006 ldquo1000 Tugriks - Rectordquo httpenwikipediaorgwikiFile1000_Tugriks_-_

Rectojpg

mdashmdashmdash 2008 ldquoFlag of Mongoliardquo httpenwikipediaorgwikiFileFlag_of_Mongoliasvg

mdashmdashmdash 2009a ldquoCoat of Arms of Mongoliardquo httpenwikipediaorgwikiFileCoat_of_Arms_

of_Mongoliasvg

mdashmdashmdash 2009b ldquoGreen Tara poem typed in Soyombo scriptrdquo httpenwikipediaorgwikiFile

Green_Tara_Poem_Soyombojpg

8 Acknowledgments

I would like to thank Gyoumlrgy Kara (Indiana University Bloomington) for reviewing a draft of this proposaland for providing additional information about Soyombo

This project was made possible in part by a grant from the United States National Endowment for the Hu-manities which funded the Universal Scripts Project (part of the Script Encoding Initiative at the Universityof California Berkeley) Any views findings conclusions or recommendations expressed in this publicationdo not necessarily reflect those of the National Endowment for the Humanities

12

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 2: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

There are several records in Soyombo most of which are manuscripts such as that shown in Figure 28Soyombo is described in many secondary sources such as studies of Mongolian scripts eg Boldsaikhan etal (2005) Shagdarsuumlruumlng (2001) etc Soyombo is currently in use as attested by a poem that was recentlytypeset using a digitized font (see Figure 30)

Names for characters are based upon those given in secondary sources such as Shagdarsuumlruumlng (2001)Names for certain consonants follow those given in N1855 L298-358

3 Writing System

31 Structure

Soyombo is an alphasyllabic script that is influenced by Tibetan and Devanagari Vowels are representedusing dependent vowel signs Independent vowels are written by combining these signs with a vowel-carrierletter Consonant letters have an inherent vowel but in some language-specific contexts are purely alpha-betic Consonants are written using signs when syllable-final Clusters of consonants are represented asconjuncts which are written using subjoined forms of letters

It is written left to right Some records contain text written vertically from top to bottom (see Section 51)

The graphical structure of Soyombo letters is lsquotwo-dimensional not linearrsquo (Corff 1996 see figures 4 and5) Each letter is modelled upon the template To this template are added distinct character primitivesthat represent the base letter Vowel signs are written above the frame 71685 within it 71683 attached toa terminal 71689 or to the right of the frame 71687 Final-consonant signs are written inside the frame 71719Consonant conjuncts are represented using subjoined forms of non-initial letters which are written withinthe frame 71763 These features are described in further detail below

32 Vowels

There are 21 letters shown in traditional charts of Soyombo for writing vowels

71680 7168071689 7168071681 716807168171689 7168071682 716807168271689 7168071683 716807168371689 7168071684 716807168471689 7168071685 716807168571689 7168071686 716807168671689 7168071687 7168071688a ā i ī uuml ǖ ū ǖ e ē o ō ouml ȫ au ai

7168071742 716807174271689 7168071743 716807174371689 71740r r l l -a

Apart from a and -a these letters may be considered composite characters written using a lsquovowel-carrierrsquoletter and a distinct sign for each vowel

71680 71689 71681 7168171689 71682 7168271689 71683 7168371689 71684 7168471689 71685 7168571689 71686 7168671689 71687 71688a ā i ī uuml ǖ ū ǖ e ē o ō ouml ȫ au ai

71742 7174271689 71743 7174371689 71740r r l l -a

2

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

The representation of vowels in Soyombo is based upon the Tibetan system in which ཨ +0F68 is a vowel carrier to which dependent vowel signs are attached in order to write independentvowels The letter 71680 a is the Soyombo vowel-carrier Long vowels are written using a combination of abasic vowel sign and a 71689 length mark

Given this structure it is practical to encode a vowel-carrier letter and dependent vowel signs instead of afull set of independent vowel letters and their associated dependent forms All independent and dependentvowel forms may be written using the following set of 13 characters

71680

71681

71682

71683

71684

71685

71686

71687

71688

71742

71743

71689

71740

321 Notes on vowels

1 Ordering The 71689 is written after the accompanying vowel sign

71680 + 71681 + 71689 rarr 716807168171689 + - + rarr ī

2 Long vowel ā When the is written with a bare vowel-carrier it represents thelong vowel ā 7168071689

3 Sanskrit vocalic letters The r r l l are used for representing Sanskrit

4 Tibetan lsquoa chungrsquo The vowel -a is used for writing Tibetan and corresponds to འ +0F60 - It has a subjoined form (see Section 362)

5 Variant forms The vowels7168071681 and7168071683 and their corresponding long forms are written by attachingthe diphthong signs 71687 - and 71688 - to the vowel letter7168071681 ie 7168171687 u716817168971687 ū7168171688 ī716817168971688 ī The is written before the sign for the diphthong

33 Consonant Letters

The basic set of consonant letters is

71690 71691 71692 71693 71694 71695 71696 71697 71698 71699 71700 71701 71702 71703 71704 71705 71706 71707 71708

There are 14 additional consonant letters for writing Sanskrit The names for these characters contain thedescriptor lsquorsquo a Mongolian term indicating letters for non-Mongolian sounds It is used here for distin-guishing between distinct characters that are given identical names in script charts eg 71690 (Mongolian)and71721 (foreign) etc

3

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

71721 71722 71723 71724 71725 71726 71727 71728 71729 71730 71731 71732 71733 71734

There are 5 additional letters for writing Tibetan The descriptor lsquorsquo is also used in the names of thesecharacters

71735 71736 71737 71738 71739

34 Consonant Signs

Syllable-final Mongolian consonants are written using contextual forms of letters The following charactersare shown in script charts as representing such consonants

7168071709 7168071710 7168071711 7168071712 7168071713 7168071714 7168071715 7168071716 7168071717 7168071718 7168071719 7168071720ag ak an ad an ab am ar al aś as ān

These characters are combinations of71680 + combining sign The use of is illustrative

71709 71710 71711 71712 71713 71714 71715 71716 71717 71718 71719 71720-g -k -n -d -n -b -m -r -l -ś -s -ān

These 12 signs are proposed for encoding as the following characters

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

The final-consonant sign is always written after any vowel sign and

71691 + 71681 + 71689 + 71711 rarr 71691716817168971711 + - + + - rarr kīng

35 Geminated Consonants

Doubled consonants are written using the 71746 which is stacked above the triangle of aletter lt71690 + 71746 gt rarr kka

4

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

36 Consonant Conjuncts

Consonant clusters in Sanskrit are represented as conjuncts The initial consonant in a conjunct is the baseletter Non-initial consonants are written as subjoined forms which are stacked within the frame of the baseletter Certain letters have pre-fixed forms when they occur in conjunct-initial position These pre-fixedforms are attached to the following consonant There is one conjunct that is written using an atomic ligature

361 Prefixed forms

When occur as C1 they are written using prefixed forms that are attached to the regular letterfor C2 In such conjuncts C2 functions as the base letter but C1 is parsed first in the logical order

bull 71747 Conjunct initial form of71705 It is used for writing Tibetan ལམག la-mgoletters lka = lt 71747 71690 gt rarr lka

bull 71748 Conjunct initial form of 71706 lt 71748 + 71690 gt rarr śka

bull 71749 Conjunct initial form of71707 It is used for writing Tibetan སམག sa-mgoletters ska = lt 71749 71690 gt rarr ska

bull 71750 Conjunct initial form of71703 It is used for writing Tibetan རམག ra-mgoletters rka = lt 71750 +71690 gt rarr rka

362 Subjoined forms

Non-initial consonants of a cluster are written as subjoined forms attached to C1

71690 71691 71692 71693 71694 71695 71696 71697 71698 71699 71751 71752 71753 71754 71755 71756 71757 71758 71759 71760

71700 71701 71702 71703 71704 71705 71706 71707 71708 71761 71762 71763 71764 71765 71766 71767 71768 71769

71721 71722 71723 71724 71725 71726 71727 71728 71729 71730 71770 71771 71772 71773 71774 71775 71776 71777 71778 71779

71731 71732 71733 71734 71735 71736 71737 71738 71739 71740 71780 71781 71782 71783 71784 71785 71786 71787 71788 71789

5

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

The subjoined form is the base element of a letter The glyphic representation of subjoined letters is discussedfurther in Section 55

363 Independent conjunct

The conjunct 71741 ksa is used for writing the Sanskrit cluster ksa ltक +0915 +094D +0937 gt It is proposed for independentencoding as It is different from other conjuncts because it cannot be produced using a subjoinedletter the shape of 71783 does not correspond to any element in the graphical structureof

364 Atomic ligatures in script charts

The following conjuncts are enumerated in charts of Soyombo

71741 ksa kya kra kla kva kka nka ntildeca nta nta mpa lka śka ska rka

They are not atomic characters but conjuncts Moreover they are not the only conjuncts used in Soyomboothers are shown in manuscripts These conjuncts are likely shown in charts for purposes of illustrating thetypes of conjuncts that may be written

37 Various Signs

bull 71744 Indicates nasalization in words of Sanskrit origin eg om lt71680 71685 71744gt It corresponds to +0F7E

bull 71745 Represents an allophone of r or s at word-final position in Devanagari orthographyfor Sanskrit It is used in Mongolian Square for writing Sanskrit and it corresponds to ཿ +0F7E

38 Punctuation

bull 71790 and 71791 These characters are used for marking the ends of sentences andlonger sections They correspond to +0964 and to +0965

bull 71792 This character is used for marking the end of a syllable It corresponds to +0F0B

39 Head Marks

There are two lsquoheadmarksrsquo71793 and71794 These characters are used for indicatingthe beginning and end of section of text

6

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

310 Symbols

The71795 was design by Zanabazar the creator of the script It is an officially recognizedsymbol in Mongolia and appears on the flag and coat of arms of the country (see figures 34 and 35) Thesymbol is included as a character of the script but it is not part of the orthography of written text

4 Language-Specific Orthography

41 Tibetan

The representation of consonant clusters and syllable-final consonants in Tibetan words follows the orthog-raphy for Tibetan For example a word-initial bare consonant may be written using the regular letter insteadof forming a conjunct with the following consonant Also a syllable-final consonant may be represented bythe regular letter instead of the final-consonant sign The word མཆག mchog lsquosupremersquo is written as7170171694716857169071792lt gt using the regular71690 instead of 71709 The 71792 is added after to mark the end of the syllable The expected representation of mchog is71701717557168571709

5 Rendering and Shaping Behaviors

51 Vertical Text

Soyombo may be written vertically from top to bottom and left to right For example the text segment dzuumlrka as shown in the seal in Figure 27

717377171671691

71737 71682 - 71716

71691

In a vertical environment Soyombo orthography departs from general rules for writing syllables The syl-lables of a word may be written using subjoined letters as if forming a conjunct The example below showsthe word71697716847170171684716967168471709 temedeg lsquosymbolrsquo written vertically Its three syllables have the same vowel which is rep-resented using a single vowel sign above the base letter the consonants of non-initial syllables are writtenusing subjoined letters while the final consonant is written using a final-consonant sign

716977168471762717577170971697 71684 -71701 71684 -71696 71684 - 71709

A similar practice is observed in the vertical form of words comprised of syllables that contain differentvowels The example below shows the word 71702716847169871683 yenu Its two syllables have different vowels which arewritten above and below the base the consonant of the second syllable is written as a subjoined form

71702716847175971683 71702 71684 -71698 71683 -

7

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

It is likely that the above behavior occurs only when the syllables of a word possess the same vowel or whenthe syllables of a word have vowels whose signs are written in different positions It is unlikely that syllableswould be written as shown above if the vowels were different and written in the same position in order toprevent clashing

52 Vowel Signs

When vowel-signs that appear at the base of the letter frame are written with final-consonants signs thenthe size and position of the vowel signs are modified in order to accommodate the placement of both signsThis rule affects two vowel signs 71682 and 71683

bull Shaping and positioning of - 71682 -rarr 71691 + 71682 + 71711 rarr 716917168271711 rarr 7169171711 + - + - rarr kuumlng

bull Shaping and positioning of - 71683 -rarr 71691 + 71683 + 71711 rarr 716917168371711 rarr 7169171711 + - + - rarr kung

53 Gemination Sign

The height of the base letter may be lowered to accommodate the within the normalx-height compare with

54 Conjuncts

The glyph of the base letter in a conjunct is generally modified in order to accommodate the placement ofsubjoined letters within the letter frame For example in writing kya lt71690 71763 gt

the regular shape71690 of is altered to where the base element is slightly truncated and compressed

Another such adjustment is made when 71692 is written with 71690 71691 71721 71722 For instance in writing nga the regular shape of 71692 changes to before

is attached A further modification is the break in the frame that occurs in conjuncts involving

55 Glyphic Representation of Subjoined Letters

The glyphs for subjoined letters represent the base element of the regular letter In some cases it maybe necessary to include part of the frame stem in the glyphs for subjoined letters in order to maintain theSoyombo graphical structure For example the word grva consists of a base letter and two subjoined lettersThe position of the second subjoined letter would straddle the baseline or fall below it In such cases thestem of the base-letter frame would be extended

8

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

7176471765 rarr 7176471765 71721 71764 71765

This could be handled by creating glyphs for subjoined letters that contain a segment of the frame stem eg 71765 for 71765 etc

6 Character Data

61 Character Properties

11800SOYOMBO LETTER ALo0LN11801SOYOMBO VOWEL SIGN IMn0NSMN11802SOYOMBO VOWEL SIGN UMn0NSMN11803SOYOMBO VOWEL SIGN UEMn0NSMN11804SOYOMBO VOWEL SIGN EMn0NSMN11805SOYOMBO VOWEL SIGN OMn0NSMN11806SOYOMBO VOWEL SIGN OEMn0NSMN11807SOYOMBO VOWEL SIGN AUMn0NSMN11808SOYOMBO VOWEL SIGN AIMn0NSMN11809SOYOMBO VOWEL LENGTH MARKMn0NSMN1180ASOYOMBO LETTER GALo0LN1180BSOYOMBO LETTER KALo0LN1180CSOYOMBO LETTER NGALo0LN1180DSOYOMBO LETTER JALo0LN1180ESOYOMBO LETTER CALo0LN1180FSOYOMBO LETTER NYALo0LN11810SOYOMBO LETTER DALo0LN11811SOYOMBO LETTER TALo0LN11812SOYOMBO LETTER NALo0LN11813SOYOMBO LETTER BALo0LN11814SOYOMBO LETTER PALo0LN11815SOYOMBO LETTER MALo0LN11816SOYOMBO LETTER YALo0LN11817SOYOMBO LETTER RALo0LN11818SOYOMBO LETTER VALo0LN11819SOYOMBO LETTER LALo0LN1181ASOYOMBO LETTER SHALo0LN1181BSOYOMBO LETTER SALo0LN1181CSOYOMBO LETTER HALo0LN1181DSOYOMBO CONSONANT SIGN GMn0NSMN1181ESOYOMBO CONSONANT SIGN KMn0NSMN1181FSOYOMBO CONSONANT SIGN NGMn0NSMN11820SOYOMBO CONSONANT SIGN DMn0NSMN11821SOYOMBO CONSONANT SIGN NMn0NSMN11822SOYOMBO CONSONANT SIGN BMn0NSMN11823SOYOMBO CONSONANT SIGN MMn0NSMN11824SOYOMBO CONSONANT SIGN RMn0NSMN11825SOYOMBO CONSONANT SIGN LMn0NSMN11826SOYOMBO CONSONANT SIGN SHMn0NSMN11827SOYOMBO CONSONANT SIGN SMn0NSMN11828SOYOMBO CONSONANT SIGN ANGMn0NSMN11829SOYOMBO LETTER GALIG GALo0LN1182ASOYOMBO LETTER GALIG GHALo0LN1182BSOYOMBO LETTER GALIG JALo0LN1182CSOYOMBO LETTER GALIG JHALo0LN1182DSOYOMBO LETTER GALIG TTALo0LN1182ESOYOMBO LETTER GALIG TTHALo0LN1182FSOYOMBO LETTER GALIG DDALo0LN

9

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

11830SOYOMBO LETTER GALIG DDHALo0LN11831SOYOMBO LETTER GALIG NNALo0LN11832SOYOMBO LETTER GALIG DALo0LN11833SOYOMBO LETTER GALIG DHALo0LN11834SOYOMBO LETTER GALIG BALo0LN11835SOYOMBO LETTER GALIG BHALo0LN11836SOYOMBO LETTER GALIG SSALo0LN11837SOYOMBO LETTER GALIG TSALo0LN11838SOYOMBO LETTER GALIG TSHALo0LN11839SOYOMBO LETTER GALIG DZALo0LN1183ASOYOMBO LETTER GALIG ZHALo0LN1183BSOYOMBO LETTER GALIG ZALo0LN1183CSOYOMBO LETTER GALIG SMALL ALo0LN1183DSOYOMBO LETTER KSSALo0LN1183ESOYOMBO VOWEL SIGN VOCALIC RMn0NSMN1183FSOYOMBO VOWEL SIGN VOCALIC LMn0NSMN11840SOYOMBO SIGN ANUSVARAMn0NSMN11841SOYOMBO SIGN VISARGAMc0LN11842SOYOMBO GEMINATION SIGNMn0NSMN11843SOYOMBO PREFIXED LETTER LAMn0NSMN11844SOYOMBO PREFIXED LETTER SHAMn0NSMN11845SOYOMBO PREFIXED LETTER SAMn0NSMN11846SOYOMBO PREFIXED LETTER RAMn0NSMN11847SOYOMBO SUBJOINED LETTER GAMn0NSMN11848SOYOMBO SUBJOINED LETTER KAMn0NSMN11849SOYOMBO SUBJOINED LETTER NGAMn0NSMN1184ASOYOMBO SUBJOINED LETTER JAMn0NSMN1184BSOYOMBO SUBJOINED LETTER CAMn0NSMN1184CSOYOMBO SUBJOINED LETTER NYAMn0NSMN1184DSOYOMBO SUBJOINED LETTER DAMn0NSMN1184ESOYOMBO SUBJOINED LETTER TAMn0NSMN1184FSOYOMBO SUBJOINED LETTER NAMn0NSMN11850SOYOMBO SUBJOINED LETTER BAMn0NSMN11851SOYOMBO SUBJOINED LETTER PAMn0NSMN11852SOYOMBO SUBJOINED LETTER MAMn0NSMN11853SOYOMBO SUBJOINED LETTER YAMn0NSMN11854SOYOMBO SUBJOINED LETTER RAMn0NSMN11855SOYOMBO SUBJOINED LETTER VAMn0NSMN11856SOYOMBO SUBJOINED LETTER LAMn0NSMN11857SOYOMBO SUBJOINED LETTER SHAMn0NSMN11858SOYOMBO SUBJOINED LETTER SAMn0NSMN11859SOYOMBO SUBJOINED LETTER HAMn0NSMN1185ASOYOMBO SUBJOINED LETTER GALIG GAMn0NSMN1185BSOYOMBO SUBJOINED LETTER GALIG GHAMn0NSMN1185CSOYOMBO SUBJOINED LETTER GALIG JAMn0NSMN1185DSOYOMBO SUBJOINED LETTER GALIG JHAMn0NSMN1185ESOYOMBO SUBJOINED LETTER GALIG TTAMn0NSMN1185FSOYOMBO SUBJOINED LETTER GALIG TTHAMn0NSMN11860SOYOMBO SUBJOINED LETTER GALIG DDAMn0NSMN11861SOYOMBO SUBJOINED LETTER GALIG DDHAMn0NSMN11862SOYOMBO SUBJOINED LETTER GALIG NNAMn0NSMN11863SOYOMBO SUBJOINED LETTER GALIG DAMn0NSMN11864SOYOMBO SUBJOINED LETTER GALIG DHAMn0NSMN11865SOYOMBO SUBJOINED LETTER GALIG BAMn0NSMN11866SOYOMBO SUBJOINED LETTER GALIG BHAMn0NSMN11867SOYOMBO SUBJOINED LETTER GALIG SSAMn0NSMN11868SOYOMBO SUBJOINED LETTER GALIG TSAMn0NSMN11869SOYOMBO SUBJOINED LETTER GALIG TSHAMn0NSMN1186ASOYOMBO SUBJOINED LETTER GALIG DZAMn0NSMN1186BSOYOMBO SUBJOINED LETTER GALIG ZHAMn0NSMN1186CSOYOMBO SUBJOINED LETTER GALIG ZAMn0NSMN

10

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

1186DSOYOMBO SUBJOINED LETTER GALIG SMALL AMn0NSMN1186ESOYOMBO DANDAPo0LN1186FSOYOMBO DOUBLE DANDAPo0LN11870SOYOMBO TSHEGPo0LN11871SOYOMBO HEAD MARKPo0ONN11872SOYOMBO TERMINAL MARKPo0ONN11873SOYOMBO SYMBOL SVAYAMBHUSo0LN

62 Linebreaking Properties

Linebreaking properties given in the data format of LineBreaktxt

11800 AL LETTER A1180111809 CM VOWEL SIGN I VOWEL LENGTH MARK1180A1181C AL GA HA1181D11828 CM CONSONANT SIGN G CONSONANT SIGN ANG118291183D AL GALIG GA KSSA1183E11842 CM SIGN VOCALIC R GEMINATION SIGN118431186D CM PREFIXED LETTER LA SUBJOINED LETTER GALIG SMALL A1186E1186F BA DANDA DOUBLE DANDA11870 BA TSHEG11871 BB HEAD MARK11872 BA TERMINAL MARK11873 AL SYMBOL SVAYAMBHU

7 References

Atwood Christopher P 2004 Encyclopedia of Mongolia and the Mongol Empire New York Facts on File

Boldsaikhan B B Batsana and Ts Oyuntsetseg 2005 Соёмбо Нууц ба Синергетик [= Secret of

Soyombo and Synergetic] (Түвэд монгол бичгийн эхийг орчуулан хавсаргав) 71793717077176571702717447173371682 71680716817168971713 7169071681716907168371694 716997168071681716987168471716716917168471697716817171071794 Ed by T Bulgan Ulaanbaatar Shambala Association System Science Research InstituteMongolian University of Science and Technology

Corff Oliver ldquoSoyombo for LATEXrdquo v 09 September 2 1996 httpuserpagefu-berlinde

~corffimSoyombooverviewSoyombohtml

ldquoHistoire du livrerdquo 2010 ldquoHistoire du livre et histoire de lrsquoeacutecriture lrsquoeacutecriture entre pratique symboleet eacuteconomierdquo (26 August 2010) httphistoire-du-livreblogspotcom201008lecriture-entre-le-symbole-et-lahtml

Kapaj Luigi 2002 ldquoMongol Scriptsrdquo httpwwwviahistoriacomSilverHordemainhtmlresearchMongolScriptshtml

Mongolwiki 2008 ldquoСайн чанарын зураг байхгүй байнаrdquo httpwikiecm-outsourcingcomindexphptitle=ДүрсSoyombo_usgiin_bargif

Mongolia and Japan 1988 ldquoAddition of Soyombo Scriptrdquo N1855 L298-358 September 21 1998 httpstddkuugdkjtc1sc2wg2docsn1855pdf

Pandey Anshuman 2010a ldquoPreliminary Proposal to Encode the Xawtaa Dorboljin Script in ISOIEC10646rdquo N3956 L210-411 October 23 2010 httpstddkuugdkjtc1sc2wg2docsn3956pdf

11

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

mdashmdashmdash 2010b ldquoPreliminary Proposal to Encode the Soyombo Script in ISOIEC 10646rdquo N3949 L210-399 October 30 2010 httpstddkuugdkjtc1sc2wg2docsn3949pdf

mdashmdashmdash 2011a ldquoDetermining the Encoding Model for Soyombo Vowelsrdquo N3986 L211-054 February 52011 httpstddkuugdkjtc1sc2wg2docsn3986pdf

mdashmdashmdash 2011b ldquoRevised Preliminary Proposal to Encode Soyombo in the UCSrdquo N4026 L211-125 April25 2011 httpstddkuugdkjtc1sc2wg2docsn4026pdf

Sato Takayuki K ldquoSoyombo and Pagba (old Mongol scripts)rdquo N2163 L200-055 January 6 2000 httpstddkuugdkjtc1sc2wg2docsn2163doc

Shagdarsuumlruumlng Tseveliin 2001 Study of Mongolian Scripts (Graphic Study or Grammatology) Enl 2nded Bibliotheca Mongolica Monograph 1 Ed by Sharaviin Choimaa Ulaanbaatar Center for MongolStudies National University of Mongolia

Wikimedia 2006 ldquo1000 Tugriks - Rectordquo httpenwikipediaorgwikiFile1000_Tugriks_-_

Rectojpg

mdashmdashmdash 2008 ldquoFlag of Mongoliardquo httpenwikipediaorgwikiFileFlag_of_Mongoliasvg

mdashmdashmdash 2009a ldquoCoat of Arms of Mongoliardquo httpenwikipediaorgwikiFileCoat_of_Arms_

of_Mongoliasvg

mdashmdashmdash 2009b ldquoGreen Tara poem typed in Soyombo scriptrdquo httpenwikipediaorgwikiFile

Green_Tara_Poem_Soyombojpg

8 Acknowledgments

I would like to thank Gyoumlrgy Kara (Indiana University Bloomington) for reviewing a draft of this proposaland for providing additional information about Soyombo

This project was made possible in part by a grant from the United States National Endowment for the Hu-manities which funded the Universal Scripts Project (part of the Script Encoding Initiative at the Universityof California Berkeley) Any views findings conclusions or recommendations expressed in this publicationdo not necessarily reflect those of the National Endowment for the Humanities

12

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 3: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

The representation of vowels in Soyombo is based upon the Tibetan system in which ཨ +0F68 is a vowel carrier to which dependent vowel signs are attached in order to write independentvowels The letter 71680 a is the Soyombo vowel-carrier Long vowels are written using a combination of abasic vowel sign and a 71689 length mark

Given this structure it is practical to encode a vowel-carrier letter and dependent vowel signs instead of afull set of independent vowel letters and their associated dependent forms All independent and dependentvowel forms may be written using the following set of 13 characters

71680

71681

71682

71683

71684

71685

71686

71687

71688

71742

71743

71689

71740

321 Notes on vowels

1 Ordering The 71689 is written after the accompanying vowel sign

71680 + 71681 + 71689 rarr 716807168171689 + - + rarr ī

2 Long vowel ā When the is written with a bare vowel-carrier it represents thelong vowel ā 7168071689

3 Sanskrit vocalic letters The r r l l are used for representing Sanskrit

4 Tibetan lsquoa chungrsquo The vowel -a is used for writing Tibetan and corresponds to འ +0F60 - It has a subjoined form (see Section 362)

5 Variant forms The vowels7168071681 and7168071683 and their corresponding long forms are written by attachingthe diphthong signs 71687 - and 71688 - to the vowel letter7168071681 ie 7168171687 u716817168971687 ū7168171688 ī716817168971688 ī The is written before the sign for the diphthong

33 Consonant Letters

The basic set of consonant letters is

71690 71691 71692 71693 71694 71695 71696 71697 71698 71699 71700 71701 71702 71703 71704 71705 71706 71707 71708

There are 14 additional consonant letters for writing Sanskrit The names for these characters contain thedescriptor lsquorsquo a Mongolian term indicating letters for non-Mongolian sounds It is used here for distin-guishing between distinct characters that are given identical names in script charts eg 71690 (Mongolian)and71721 (foreign) etc

3

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

71721 71722 71723 71724 71725 71726 71727 71728 71729 71730 71731 71732 71733 71734

There are 5 additional letters for writing Tibetan The descriptor lsquorsquo is also used in the names of thesecharacters

71735 71736 71737 71738 71739

34 Consonant Signs

Syllable-final Mongolian consonants are written using contextual forms of letters The following charactersare shown in script charts as representing such consonants

7168071709 7168071710 7168071711 7168071712 7168071713 7168071714 7168071715 7168071716 7168071717 7168071718 7168071719 7168071720ag ak an ad an ab am ar al aś as ān

These characters are combinations of71680 + combining sign The use of is illustrative

71709 71710 71711 71712 71713 71714 71715 71716 71717 71718 71719 71720-g -k -n -d -n -b -m -r -l -ś -s -ān

These 12 signs are proposed for encoding as the following characters

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

The final-consonant sign is always written after any vowel sign and

71691 + 71681 + 71689 + 71711 rarr 71691716817168971711 + - + + - rarr kīng

35 Geminated Consonants

Doubled consonants are written using the 71746 which is stacked above the triangle of aletter lt71690 + 71746 gt rarr kka

4

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

36 Consonant Conjuncts

Consonant clusters in Sanskrit are represented as conjuncts The initial consonant in a conjunct is the baseletter Non-initial consonants are written as subjoined forms which are stacked within the frame of the baseletter Certain letters have pre-fixed forms when they occur in conjunct-initial position These pre-fixedforms are attached to the following consonant There is one conjunct that is written using an atomic ligature

361 Prefixed forms

When occur as C1 they are written using prefixed forms that are attached to the regular letterfor C2 In such conjuncts C2 functions as the base letter but C1 is parsed first in the logical order

bull 71747 Conjunct initial form of71705 It is used for writing Tibetan ལམག la-mgoletters lka = lt 71747 71690 gt rarr lka

bull 71748 Conjunct initial form of 71706 lt 71748 + 71690 gt rarr śka

bull 71749 Conjunct initial form of71707 It is used for writing Tibetan སམག sa-mgoletters ska = lt 71749 71690 gt rarr ska

bull 71750 Conjunct initial form of71703 It is used for writing Tibetan རམག ra-mgoletters rka = lt 71750 +71690 gt rarr rka

362 Subjoined forms

Non-initial consonants of a cluster are written as subjoined forms attached to C1

71690 71691 71692 71693 71694 71695 71696 71697 71698 71699 71751 71752 71753 71754 71755 71756 71757 71758 71759 71760

71700 71701 71702 71703 71704 71705 71706 71707 71708 71761 71762 71763 71764 71765 71766 71767 71768 71769

71721 71722 71723 71724 71725 71726 71727 71728 71729 71730 71770 71771 71772 71773 71774 71775 71776 71777 71778 71779

71731 71732 71733 71734 71735 71736 71737 71738 71739 71740 71780 71781 71782 71783 71784 71785 71786 71787 71788 71789

5

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

The subjoined form is the base element of a letter The glyphic representation of subjoined letters is discussedfurther in Section 55

363 Independent conjunct

The conjunct 71741 ksa is used for writing the Sanskrit cluster ksa ltक +0915 +094D +0937 gt It is proposed for independentencoding as It is different from other conjuncts because it cannot be produced using a subjoinedletter the shape of 71783 does not correspond to any element in the graphical structureof

364 Atomic ligatures in script charts

The following conjuncts are enumerated in charts of Soyombo

71741 ksa kya kra kla kva kka nka ntildeca nta nta mpa lka śka ska rka

They are not atomic characters but conjuncts Moreover they are not the only conjuncts used in Soyomboothers are shown in manuscripts These conjuncts are likely shown in charts for purposes of illustrating thetypes of conjuncts that may be written

37 Various Signs

bull 71744 Indicates nasalization in words of Sanskrit origin eg om lt71680 71685 71744gt It corresponds to +0F7E

bull 71745 Represents an allophone of r or s at word-final position in Devanagari orthographyfor Sanskrit It is used in Mongolian Square for writing Sanskrit and it corresponds to ཿ +0F7E

38 Punctuation

bull 71790 and 71791 These characters are used for marking the ends of sentences andlonger sections They correspond to +0964 and to +0965

bull 71792 This character is used for marking the end of a syllable It corresponds to +0F0B

39 Head Marks

There are two lsquoheadmarksrsquo71793 and71794 These characters are used for indicatingthe beginning and end of section of text

6

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

310 Symbols

The71795 was design by Zanabazar the creator of the script It is an officially recognizedsymbol in Mongolia and appears on the flag and coat of arms of the country (see figures 34 and 35) Thesymbol is included as a character of the script but it is not part of the orthography of written text

4 Language-Specific Orthography

41 Tibetan

The representation of consonant clusters and syllable-final consonants in Tibetan words follows the orthog-raphy for Tibetan For example a word-initial bare consonant may be written using the regular letter insteadof forming a conjunct with the following consonant Also a syllable-final consonant may be represented bythe regular letter instead of the final-consonant sign The word མཆག mchog lsquosupremersquo is written as7170171694716857169071792lt gt using the regular71690 instead of 71709 The 71792 is added after to mark the end of the syllable The expected representation of mchog is71701717557168571709

5 Rendering and Shaping Behaviors

51 Vertical Text

Soyombo may be written vertically from top to bottom and left to right For example the text segment dzuumlrka as shown in the seal in Figure 27

717377171671691

71737 71682 - 71716

71691

In a vertical environment Soyombo orthography departs from general rules for writing syllables The syl-lables of a word may be written using subjoined letters as if forming a conjunct The example below showsthe word71697716847170171684716967168471709 temedeg lsquosymbolrsquo written vertically Its three syllables have the same vowel which is rep-resented using a single vowel sign above the base letter the consonants of non-initial syllables are writtenusing subjoined letters while the final consonant is written using a final-consonant sign

716977168471762717577170971697 71684 -71701 71684 -71696 71684 - 71709

A similar practice is observed in the vertical form of words comprised of syllables that contain differentvowels The example below shows the word 71702716847169871683 yenu Its two syllables have different vowels which arewritten above and below the base the consonant of the second syllable is written as a subjoined form

71702716847175971683 71702 71684 -71698 71683 -

7

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

It is likely that the above behavior occurs only when the syllables of a word possess the same vowel or whenthe syllables of a word have vowels whose signs are written in different positions It is unlikely that syllableswould be written as shown above if the vowels were different and written in the same position in order toprevent clashing

52 Vowel Signs

When vowel-signs that appear at the base of the letter frame are written with final-consonants signs thenthe size and position of the vowel signs are modified in order to accommodate the placement of both signsThis rule affects two vowel signs 71682 and 71683

bull Shaping and positioning of - 71682 -rarr 71691 + 71682 + 71711 rarr 716917168271711 rarr 7169171711 + - + - rarr kuumlng

bull Shaping and positioning of - 71683 -rarr 71691 + 71683 + 71711 rarr 716917168371711 rarr 7169171711 + - + - rarr kung

53 Gemination Sign

The height of the base letter may be lowered to accommodate the within the normalx-height compare with

54 Conjuncts

The glyph of the base letter in a conjunct is generally modified in order to accommodate the placement ofsubjoined letters within the letter frame For example in writing kya lt71690 71763 gt

the regular shape71690 of is altered to where the base element is slightly truncated and compressed

Another such adjustment is made when 71692 is written with 71690 71691 71721 71722 For instance in writing nga the regular shape of 71692 changes to before

is attached A further modification is the break in the frame that occurs in conjuncts involving

55 Glyphic Representation of Subjoined Letters

The glyphs for subjoined letters represent the base element of the regular letter In some cases it maybe necessary to include part of the frame stem in the glyphs for subjoined letters in order to maintain theSoyombo graphical structure For example the word grva consists of a base letter and two subjoined lettersThe position of the second subjoined letter would straddle the baseline or fall below it In such cases thestem of the base-letter frame would be extended

8

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

7176471765 rarr 7176471765 71721 71764 71765

This could be handled by creating glyphs for subjoined letters that contain a segment of the frame stem eg 71765 for 71765 etc

6 Character Data

61 Character Properties

11800SOYOMBO LETTER ALo0LN11801SOYOMBO VOWEL SIGN IMn0NSMN11802SOYOMBO VOWEL SIGN UMn0NSMN11803SOYOMBO VOWEL SIGN UEMn0NSMN11804SOYOMBO VOWEL SIGN EMn0NSMN11805SOYOMBO VOWEL SIGN OMn0NSMN11806SOYOMBO VOWEL SIGN OEMn0NSMN11807SOYOMBO VOWEL SIGN AUMn0NSMN11808SOYOMBO VOWEL SIGN AIMn0NSMN11809SOYOMBO VOWEL LENGTH MARKMn0NSMN1180ASOYOMBO LETTER GALo0LN1180BSOYOMBO LETTER KALo0LN1180CSOYOMBO LETTER NGALo0LN1180DSOYOMBO LETTER JALo0LN1180ESOYOMBO LETTER CALo0LN1180FSOYOMBO LETTER NYALo0LN11810SOYOMBO LETTER DALo0LN11811SOYOMBO LETTER TALo0LN11812SOYOMBO LETTER NALo0LN11813SOYOMBO LETTER BALo0LN11814SOYOMBO LETTER PALo0LN11815SOYOMBO LETTER MALo0LN11816SOYOMBO LETTER YALo0LN11817SOYOMBO LETTER RALo0LN11818SOYOMBO LETTER VALo0LN11819SOYOMBO LETTER LALo0LN1181ASOYOMBO LETTER SHALo0LN1181BSOYOMBO LETTER SALo0LN1181CSOYOMBO LETTER HALo0LN1181DSOYOMBO CONSONANT SIGN GMn0NSMN1181ESOYOMBO CONSONANT SIGN KMn0NSMN1181FSOYOMBO CONSONANT SIGN NGMn0NSMN11820SOYOMBO CONSONANT SIGN DMn0NSMN11821SOYOMBO CONSONANT SIGN NMn0NSMN11822SOYOMBO CONSONANT SIGN BMn0NSMN11823SOYOMBO CONSONANT SIGN MMn0NSMN11824SOYOMBO CONSONANT SIGN RMn0NSMN11825SOYOMBO CONSONANT SIGN LMn0NSMN11826SOYOMBO CONSONANT SIGN SHMn0NSMN11827SOYOMBO CONSONANT SIGN SMn0NSMN11828SOYOMBO CONSONANT SIGN ANGMn0NSMN11829SOYOMBO LETTER GALIG GALo0LN1182ASOYOMBO LETTER GALIG GHALo0LN1182BSOYOMBO LETTER GALIG JALo0LN1182CSOYOMBO LETTER GALIG JHALo0LN1182DSOYOMBO LETTER GALIG TTALo0LN1182ESOYOMBO LETTER GALIG TTHALo0LN1182FSOYOMBO LETTER GALIG DDALo0LN

9

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

11830SOYOMBO LETTER GALIG DDHALo0LN11831SOYOMBO LETTER GALIG NNALo0LN11832SOYOMBO LETTER GALIG DALo0LN11833SOYOMBO LETTER GALIG DHALo0LN11834SOYOMBO LETTER GALIG BALo0LN11835SOYOMBO LETTER GALIG BHALo0LN11836SOYOMBO LETTER GALIG SSALo0LN11837SOYOMBO LETTER GALIG TSALo0LN11838SOYOMBO LETTER GALIG TSHALo0LN11839SOYOMBO LETTER GALIG DZALo0LN1183ASOYOMBO LETTER GALIG ZHALo0LN1183BSOYOMBO LETTER GALIG ZALo0LN1183CSOYOMBO LETTER GALIG SMALL ALo0LN1183DSOYOMBO LETTER KSSALo0LN1183ESOYOMBO VOWEL SIGN VOCALIC RMn0NSMN1183FSOYOMBO VOWEL SIGN VOCALIC LMn0NSMN11840SOYOMBO SIGN ANUSVARAMn0NSMN11841SOYOMBO SIGN VISARGAMc0LN11842SOYOMBO GEMINATION SIGNMn0NSMN11843SOYOMBO PREFIXED LETTER LAMn0NSMN11844SOYOMBO PREFIXED LETTER SHAMn0NSMN11845SOYOMBO PREFIXED LETTER SAMn0NSMN11846SOYOMBO PREFIXED LETTER RAMn0NSMN11847SOYOMBO SUBJOINED LETTER GAMn0NSMN11848SOYOMBO SUBJOINED LETTER KAMn0NSMN11849SOYOMBO SUBJOINED LETTER NGAMn0NSMN1184ASOYOMBO SUBJOINED LETTER JAMn0NSMN1184BSOYOMBO SUBJOINED LETTER CAMn0NSMN1184CSOYOMBO SUBJOINED LETTER NYAMn0NSMN1184DSOYOMBO SUBJOINED LETTER DAMn0NSMN1184ESOYOMBO SUBJOINED LETTER TAMn0NSMN1184FSOYOMBO SUBJOINED LETTER NAMn0NSMN11850SOYOMBO SUBJOINED LETTER BAMn0NSMN11851SOYOMBO SUBJOINED LETTER PAMn0NSMN11852SOYOMBO SUBJOINED LETTER MAMn0NSMN11853SOYOMBO SUBJOINED LETTER YAMn0NSMN11854SOYOMBO SUBJOINED LETTER RAMn0NSMN11855SOYOMBO SUBJOINED LETTER VAMn0NSMN11856SOYOMBO SUBJOINED LETTER LAMn0NSMN11857SOYOMBO SUBJOINED LETTER SHAMn0NSMN11858SOYOMBO SUBJOINED LETTER SAMn0NSMN11859SOYOMBO SUBJOINED LETTER HAMn0NSMN1185ASOYOMBO SUBJOINED LETTER GALIG GAMn0NSMN1185BSOYOMBO SUBJOINED LETTER GALIG GHAMn0NSMN1185CSOYOMBO SUBJOINED LETTER GALIG JAMn0NSMN1185DSOYOMBO SUBJOINED LETTER GALIG JHAMn0NSMN1185ESOYOMBO SUBJOINED LETTER GALIG TTAMn0NSMN1185FSOYOMBO SUBJOINED LETTER GALIG TTHAMn0NSMN11860SOYOMBO SUBJOINED LETTER GALIG DDAMn0NSMN11861SOYOMBO SUBJOINED LETTER GALIG DDHAMn0NSMN11862SOYOMBO SUBJOINED LETTER GALIG NNAMn0NSMN11863SOYOMBO SUBJOINED LETTER GALIG DAMn0NSMN11864SOYOMBO SUBJOINED LETTER GALIG DHAMn0NSMN11865SOYOMBO SUBJOINED LETTER GALIG BAMn0NSMN11866SOYOMBO SUBJOINED LETTER GALIG BHAMn0NSMN11867SOYOMBO SUBJOINED LETTER GALIG SSAMn0NSMN11868SOYOMBO SUBJOINED LETTER GALIG TSAMn0NSMN11869SOYOMBO SUBJOINED LETTER GALIG TSHAMn0NSMN1186ASOYOMBO SUBJOINED LETTER GALIG DZAMn0NSMN1186BSOYOMBO SUBJOINED LETTER GALIG ZHAMn0NSMN1186CSOYOMBO SUBJOINED LETTER GALIG ZAMn0NSMN

10

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

1186DSOYOMBO SUBJOINED LETTER GALIG SMALL AMn0NSMN1186ESOYOMBO DANDAPo0LN1186FSOYOMBO DOUBLE DANDAPo0LN11870SOYOMBO TSHEGPo0LN11871SOYOMBO HEAD MARKPo0ONN11872SOYOMBO TERMINAL MARKPo0ONN11873SOYOMBO SYMBOL SVAYAMBHUSo0LN

62 Linebreaking Properties

Linebreaking properties given in the data format of LineBreaktxt

11800 AL LETTER A1180111809 CM VOWEL SIGN I VOWEL LENGTH MARK1180A1181C AL GA HA1181D11828 CM CONSONANT SIGN G CONSONANT SIGN ANG118291183D AL GALIG GA KSSA1183E11842 CM SIGN VOCALIC R GEMINATION SIGN118431186D CM PREFIXED LETTER LA SUBJOINED LETTER GALIG SMALL A1186E1186F BA DANDA DOUBLE DANDA11870 BA TSHEG11871 BB HEAD MARK11872 BA TERMINAL MARK11873 AL SYMBOL SVAYAMBHU

7 References

Atwood Christopher P 2004 Encyclopedia of Mongolia and the Mongol Empire New York Facts on File

Boldsaikhan B B Batsana and Ts Oyuntsetseg 2005 Соёмбо Нууц ба Синергетик [= Secret of

Soyombo and Synergetic] (Түвэд монгол бичгийн эхийг орчуулан хавсаргав) 71793717077176571702717447173371682 71680716817168971713 7169071681716907168371694 716997168071681716987168471716716917168471697716817171071794 Ed by T Bulgan Ulaanbaatar Shambala Association System Science Research InstituteMongolian University of Science and Technology

Corff Oliver ldquoSoyombo for LATEXrdquo v 09 September 2 1996 httpuserpagefu-berlinde

~corffimSoyombooverviewSoyombohtml

ldquoHistoire du livrerdquo 2010 ldquoHistoire du livre et histoire de lrsquoeacutecriture lrsquoeacutecriture entre pratique symboleet eacuteconomierdquo (26 August 2010) httphistoire-du-livreblogspotcom201008lecriture-entre-le-symbole-et-lahtml

Kapaj Luigi 2002 ldquoMongol Scriptsrdquo httpwwwviahistoriacomSilverHordemainhtmlresearchMongolScriptshtml

Mongolwiki 2008 ldquoСайн чанарын зураг байхгүй байнаrdquo httpwikiecm-outsourcingcomindexphptitle=ДүрсSoyombo_usgiin_bargif

Mongolia and Japan 1988 ldquoAddition of Soyombo Scriptrdquo N1855 L298-358 September 21 1998 httpstddkuugdkjtc1sc2wg2docsn1855pdf

Pandey Anshuman 2010a ldquoPreliminary Proposal to Encode the Xawtaa Dorboljin Script in ISOIEC10646rdquo N3956 L210-411 October 23 2010 httpstddkuugdkjtc1sc2wg2docsn3956pdf

11

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

mdashmdashmdash 2010b ldquoPreliminary Proposal to Encode the Soyombo Script in ISOIEC 10646rdquo N3949 L210-399 October 30 2010 httpstddkuugdkjtc1sc2wg2docsn3949pdf

mdashmdashmdash 2011a ldquoDetermining the Encoding Model for Soyombo Vowelsrdquo N3986 L211-054 February 52011 httpstddkuugdkjtc1sc2wg2docsn3986pdf

mdashmdashmdash 2011b ldquoRevised Preliminary Proposal to Encode Soyombo in the UCSrdquo N4026 L211-125 April25 2011 httpstddkuugdkjtc1sc2wg2docsn4026pdf

Sato Takayuki K ldquoSoyombo and Pagba (old Mongol scripts)rdquo N2163 L200-055 January 6 2000 httpstddkuugdkjtc1sc2wg2docsn2163doc

Shagdarsuumlruumlng Tseveliin 2001 Study of Mongolian Scripts (Graphic Study or Grammatology) Enl 2nded Bibliotheca Mongolica Monograph 1 Ed by Sharaviin Choimaa Ulaanbaatar Center for MongolStudies National University of Mongolia

Wikimedia 2006 ldquo1000 Tugriks - Rectordquo httpenwikipediaorgwikiFile1000_Tugriks_-_

Rectojpg

mdashmdashmdash 2008 ldquoFlag of Mongoliardquo httpenwikipediaorgwikiFileFlag_of_Mongoliasvg

mdashmdashmdash 2009a ldquoCoat of Arms of Mongoliardquo httpenwikipediaorgwikiFileCoat_of_Arms_

of_Mongoliasvg

mdashmdashmdash 2009b ldquoGreen Tara poem typed in Soyombo scriptrdquo httpenwikipediaorgwikiFile

Green_Tara_Poem_Soyombojpg

8 Acknowledgments

I would like to thank Gyoumlrgy Kara (Indiana University Bloomington) for reviewing a draft of this proposaland for providing additional information about Soyombo

This project was made possible in part by a grant from the United States National Endowment for the Hu-manities which funded the Universal Scripts Project (part of the Script Encoding Initiative at the Universityof California Berkeley) Any views findings conclusions or recommendations expressed in this publicationdo not necessarily reflect those of the National Endowment for the Humanities

12

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 4: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

71721 71722 71723 71724 71725 71726 71727 71728 71729 71730 71731 71732 71733 71734

There are 5 additional letters for writing Tibetan The descriptor lsquorsquo is also used in the names of thesecharacters

71735 71736 71737 71738 71739

34 Consonant Signs

Syllable-final Mongolian consonants are written using contextual forms of letters The following charactersare shown in script charts as representing such consonants

7168071709 7168071710 7168071711 7168071712 7168071713 7168071714 7168071715 7168071716 7168071717 7168071718 7168071719 7168071720ag ak an ad an ab am ar al aś as ān

These characters are combinations of71680 + combining sign The use of is illustrative

71709 71710 71711 71712 71713 71714 71715 71716 71717 71718 71719 71720-g -k -n -d -n -b -m -r -l -ś -s -ān

These 12 signs are proposed for encoding as the following characters

71709

71710

71711

71712

71713

71714

71715

71716

71717

71718

71719

71720

The final-consonant sign is always written after any vowel sign and

71691 + 71681 + 71689 + 71711 rarr 71691716817168971711 + - + + - rarr kīng

35 Geminated Consonants

Doubled consonants are written using the 71746 which is stacked above the triangle of aletter lt71690 + 71746 gt rarr kka

4

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

36 Consonant Conjuncts

Consonant clusters in Sanskrit are represented as conjuncts The initial consonant in a conjunct is the baseletter Non-initial consonants are written as subjoined forms which are stacked within the frame of the baseletter Certain letters have pre-fixed forms when they occur in conjunct-initial position These pre-fixedforms are attached to the following consonant There is one conjunct that is written using an atomic ligature

361 Prefixed forms

When occur as C1 they are written using prefixed forms that are attached to the regular letterfor C2 In such conjuncts C2 functions as the base letter but C1 is parsed first in the logical order

bull 71747 Conjunct initial form of71705 It is used for writing Tibetan ལམག la-mgoletters lka = lt 71747 71690 gt rarr lka

bull 71748 Conjunct initial form of 71706 lt 71748 + 71690 gt rarr śka

bull 71749 Conjunct initial form of71707 It is used for writing Tibetan སམག sa-mgoletters ska = lt 71749 71690 gt rarr ska

bull 71750 Conjunct initial form of71703 It is used for writing Tibetan རམག ra-mgoletters rka = lt 71750 +71690 gt rarr rka

362 Subjoined forms

Non-initial consonants of a cluster are written as subjoined forms attached to C1

71690 71691 71692 71693 71694 71695 71696 71697 71698 71699 71751 71752 71753 71754 71755 71756 71757 71758 71759 71760

71700 71701 71702 71703 71704 71705 71706 71707 71708 71761 71762 71763 71764 71765 71766 71767 71768 71769

71721 71722 71723 71724 71725 71726 71727 71728 71729 71730 71770 71771 71772 71773 71774 71775 71776 71777 71778 71779

71731 71732 71733 71734 71735 71736 71737 71738 71739 71740 71780 71781 71782 71783 71784 71785 71786 71787 71788 71789

5

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

The subjoined form is the base element of a letter The glyphic representation of subjoined letters is discussedfurther in Section 55

363 Independent conjunct

The conjunct 71741 ksa is used for writing the Sanskrit cluster ksa ltक +0915 +094D +0937 gt It is proposed for independentencoding as It is different from other conjuncts because it cannot be produced using a subjoinedletter the shape of 71783 does not correspond to any element in the graphical structureof

364 Atomic ligatures in script charts

The following conjuncts are enumerated in charts of Soyombo

71741 ksa kya kra kla kva kka nka ntildeca nta nta mpa lka śka ska rka

They are not atomic characters but conjuncts Moreover they are not the only conjuncts used in Soyomboothers are shown in manuscripts These conjuncts are likely shown in charts for purposes of illustrating thetypes of conjuncts that may be written

37 Various Signs

bull 71744 Indicates nasalization in words of Sanskrit origin eg om lt71680 71685 71744gt It corresponds to +0F7E

bull 71745 Represents an allophone of r or s at word-final position in Devanagari orthographyfor Sanskrit It is used in Mongolian Square for writing Sanskrit and it corresponds to ཿ +0F7E

38 Punctuation

bull 71790 and 71791 These characters are used for marking the ends of sentences andlonger sections They correspond to +0964 and to +0965

bull 71792 This character is used for marking the end of a syllable It corresponds to +0F0B

39 Head Marks

There are two lsquoheadmarksrsquo71793 and71794 These characters are used for indicatingthe beginning and end of section of text

6

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

310 Symbols

The71795 was design by Zanabazar the creator of the script It is an officially recognizedsymbol in Mongolia and appears on the flag and coat of arms of the country (see figures 34 and 35) Thesymbol is included as a character of the script but it is not part of the orthography of written text

4 Language-Specific Orthography

41 Tibetan

The representation of consonant clusters and syllable-final consonants in Tibetan words follows the orthog-raphy for Tibetan For example a word-initial bare consonant may be written using the regular letter insteadof forming a conjunct with the following consonant Also a syllable-final consonant may be represented bythe regular letter instead of the final-consonant sign The word མཆག mchog lsquosupremersquo is written as7170171694716857169071792lt gt using the regular71690 instead of 71709 The 71792 is added after to mark the end of the syllable The expected representation of mchog is71701717557168571709

5 Rendering and Shaping Behaviors

51 Vertical Text

Soyombo may be written vertically from top to bottom and left to right For example the text segment dzuumlrka as shown in the seal in Figure 27

717377171671691

71737 71682 - 71716

71691

In a vertical environment Soyombo orthography departs from general rules for writing syllables The syl-lables of a word may be written using subjoined letters as if forming a conjunct The example below showsthe word71697716847170171684716967168471709 temedeg lsquosymbolrsquo written vertically Its three syllables have the same vowel which is rep-resented using a single vowel sign above the base letter the consonants of non-initial syllables are writtenusing subjoined letters while the final consonant is written using a final-consonant sign

716977168471762717577170971697 71684 -71701 71684 -71696 71684 - 71709

A similar practice is observed in the vertical form of words comprised of syllables that contain differentvowels The example below shows the word 71702716847169871683 yenu Its two syllables have different vowels which arewritten above and below the base the consonant of the second syllable is written as a subjoined form

71702716847175971683 71702 71684 -71698 71683 -

7

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

It is likely that the above behavior occurs only when the syllables of a word possess the same vowel or whenthe syllables of a word have vowels whose signs are written in different positions It is unlikely that syllableswould be written as shown above if the vowels were different and written in the same position in order toprevent clashing

52 Vowel Signs

When vowel-signs that appear at the base of the letter frame are written with final-consonants signs thenthe size and position of the vowel signs are modified in order to accommodate the placement of both signsThis rule affects two vowel signs 71682 and 71683

bull Shaping and positioning of - 71682 -rarr 71691 + 71682 + 71711 rarr 716917168271711 rarr 7169171711 + - + - rarr kuumlng

bull Shaping and positioning of - 71683 -rarr 71691 + 71683 + 71711 rarr 716917168371711 rarr 7169171711 + - + - rarr kung

53 Gemination Sign

The height of the base letter may be lowered to accommodate the within the normalx-height compare with

54 Conjuncts

The glyph of the base letter in a conjunct is generally modified in order to accommodate the placement ofsubjoined letters within the letter frame For example in writing kya lt71690 71763 gt

the regular shape71690 of is altered to where the base element is slightly truncated and compressed

Another such adjustment is made when 71692 is written with 71690 71691 71721 71722 For instance in writing nga the regular shape of 71692 changes to before

is attached A further modification is the break in the frame that occurs in conjuncts involving

55 Glyphic Representation of Subjoined Letters

The glyphs for subjoined letters represent the base element of the regular letter In some cases it maybe necessary to include part of the frame stem in the glyphs for subjoined letters in order to maintain theSoyombo graphical structure For example the word grva consists of a base letter and two subjoined lettersThe position of the second subjoined letter would straddle the baseline or fall below it In such cases thestem of the base-letter frame would be extended

8

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

7176471765 rarr 7176471765 71721 71764 71765

This could be handled by creating glyphs for subjoined letters that contain a segment of the frame stem eg 71765 for 71765 etc

6 Character Data

61 Character Properties

11800SOYOMBO LETTER ALo0LN11801SOYOMBO VOWEL SIGN IMn0NSMN11802SOYOMBO VOWEL SIGN UMn0NSMN11803SOYOMBO VOWEL SIGN UEMn0NSMN11804SOYOMBO VOWEL SIGN EMn0NSMN11805SOYOMBO VOWEL SIGN OMn0NSMN11806SOYOMBO VOWEL SIGN OEMn0NSMN11807SOYOMBO VOWEL SIGN AUMn0NSMN11808SOYOMBO VOWEL SIGN AIMn0NSMN11809SOYOMBO VOWEL LENGTH MARKMn0NSMN1180ASOYOMBO LETTER GALo0LN1180BSOYOMBO LETTER KALo0LN1180CSOYOMBO LETTER NGALo0LN1180DSOYOMBO LETTER JALo0LN1180ESOYOMBO LETTER CALo0LN1180FSOYOMBO LETTER NYALo0LN11810SOYOMBO LETTER DALo0LN11811SOYOMBO LETTER TALo0LN11812SOYOMBO LETTER NALo0LN11813SOYOMBO LETTER BALo0LN11814SOYOMBO LETTER PALo0LN11815SOYOMBO LETTER MALo0LN11816SOYOMBO LETTER YALo0LN11817SOYOMBO LETTER RALo0LN11818SOYOMBO LETTER VALo0LN11819SOYOMBO LETTER LALo0LN1181ASOYOMBO LETTER SHALo0LN1181BSOYOMBO LETTER SALo0LN1181CSOYOMBO LETTER HALo0LN1181DSOYOMBO CONSONANT SIGN GMn0NSMN1181ESOYOMBO CONSONANT SIGN KMn0NSMN1181FSOYOMBO CONSONANT SIGN NGMn0NSMN11820SOYOMBO CONSONANT SIGN DMn0NSMN11821SOYOMBO CONSONANT SIGN NMn0NSMN11822SOYOMBO CONSONANT SIGN BMn0NSMN11823SOYOMBO CONSONANT SIGN MMn0NSMN11824SOYOMBO CONSONANT SIGN RMn0NSMN11825SOYOMBO CONSONANT SIGN LMn0NSMN11826SOYOMBO CONSONANT SIGN SHMn0NSMN11827SOYOMBO CONSONANT SIGN SMn0NSMN11828SOYOMBO CONSONANT SIGN ANGMn0NSMN11829SOYOMBO LETTER GALIG GALo0LN1182ASOYOMBO LETTER GALIG GHALo0LN1182BSOYOMBO LETTER GALIG JALo0LN1182CSOYOMBO LETTER GALIG JHALo0LN1182DSOYOMBO LETTER GALIG TTALo0LN1182ESOYOMBO LETTER GALIG TTHALo0LN1182FSOYOMBO LETTER GALIG DDALo0LN

9

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

11830SOYOMBO LETTER GALIG DDHALo0LN11831SOYOMBO LETTER GALIG NNALo0LN11832SOYOMBO LETTER GALIG DALo0LN11833SOYOMBO LETTER GALIG DHALo0LN11834SOYOMBO LETTER GALIG BALo0LN11835SOYOMBO LETTER GALIG BHALo0LN11836SOYOMBO LETTER GALIG SSALo0LN11837SOYOMBO LETTER GALIG TSALo0LN11838SOYOMBO LETTER GALIG TSHALo0LN11839SOYOMBO LETTER GALIG DZALo0LN1183ASOYOMBO LETTER GALIG ZHALo0LN1183BSOYOMBO LETTER GALIG ZALo0LN1183CSOYOMBO LETTER GALIG SMALL ALo0LN1183DSOYOMBO LETTER KSSALo0LN1183ESOYOMBO VOWEL SIGN VOCALIC RMn0NSMN1183FSOYOMBO VOWEL SIGN VOCALIC LMn0NSMN11840SOYOMBO SIGN ANUSVARAMn0NSMN11841SOYOMBO SIGN VISARGAMc0LN11842SOYOMBO GEMINATION SIGNMn0NSMN11843SOYOMBO PREFIXED LETTER LAMn0NSMN11844SOYOMBO PREFIXED LETTER SHAMn0NSMN11845SOYOMBO PREFIXED LETTER SAMn0NSMN11846SOYOMBO PREFIXED LETTER RAMn0NSMN11847SOYOMBO SUBJOINED LETTER GAMn0NSMN11848SOYOMBO SUBJOINED LETTER KAMn0NSMN11849SOYOMBO SUBJOINED LETTER NGAMn0NSMN1184ASOYOMBO SUBJOINED LETTER JAMn0NSMN1184BSOYOMBO SUBJOINED LETTER CAMn0NSMN1184CSOYOMBO SUBJOINED LETTER NYAMn0NSMN1184DSOYOMBO SUBJOINED LETTER DAMn0NSMN1184ESOYOMBO SUBJOINED LETTER TAMn0NSMN1184FSOYOMBO SUBJOINED LETTER NAMn0NSMN11850SOYOMBO SUBJOINED LETTER BAMn0NSMN11851SOYOMBO SUBJOINED LETTER PAMn0NSMN11852SOYOMBO SUBJOINED LETTER MAMn0NSMN11853SOYOMBO SUBJOINED LETTER YAMn0NSMN11854SOYOMBO SUBJOINED LETTER RAMn0NSMN11855SOYOMBO SUBJOINED LETTER VAMn0NSMN11856SOYOMBO SUBJOINED LETTER LAMn0NSMN11857SOYOMBO SUBJOINED LETTER SHAMn0NSMN11858SOYOMBO SUBJOINED LETTER SAMn0NSMN11859SOYOMBO SUBJOINED LETTER HAMn0NSMN1185ASOYOMBO SUBJOINED LETTER GALIG GAMn0NSMN1185BSOYOMBO SUBJOINED LETTER GALIG GHAMn0NSMN1185CSOYOMBO SUBJOINED LETTER GALIG JAMn0NSMN1185DSOYOMBO SUBJOINED LETTER GALIG JHAMn0NSMN1185ESOYOMBO SUBJOINED LETTER GALIG TTAMn0NSMN1185FSOYOMBO SUBJOINED LETTER GALIG TTHAMn0NSMN11860SOYOMBO SUBJOINED LETTER GALIG DDAMn0NSMN11861SOYOMBO SUBJOINED LETTER GALIG DDHAMn0NSMN11862SOYOMBO SUBJOINED LETTER GALIG NNAMn0NSMN11863SOYOMBO SUBJOINED LETTER GALIG DAMn0NSMN11864SOYOMBO SUBJOINED LETTER GALIG DHAMn0NSMN11865SOYOMBO SUBJOINED LETTER GALIG BAMn0NSMN11866SOYOMBO SUBJOINED LETTER GALIG BHAMn0NSMN11867SOYOMBO SUBJOINED LETTER GALIG SSAMn0NSMN11868SOYOMBO SUBJOINED LETTER GALIG TSAMn0NSMN11869SOYOMBO SUBJOINED LETTER GALIG TSHAMn0NSMN1186ASOYOMBO SUBJOINED LETTER GALIG DZAMn0NSMN1186BSOYOMBO SUBJOINED LETTER GALIG ZHAMn0NSMN1186CSOYOMBO SUBJOINED LETTER GALIG ZAMn0NSMN

10

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

1186DSOYOMBO SUBJOINED LETTER GALIG SMALL AMn0NSMN1186ESOYOMBO DANDAPo0LN1186FSOYOMBO DOUBLE DANDAPo0LN11870SOYOMBO TSHEGPo0LN11871SOYOMBO HEAD MARKPo0ONN11872SOYOMBO TERMINAL MARKPo0ONN11873SOYOMBO SYMBOL SVAYAMBHUSo0LN

62 Linebreaking Properties

Linebreaking properties given in the data format of LineBreaktxt

11800 AL LETTER A1180111809 CM VOWEL SIGN I VOWEL LENGTH MARK1180A1181C AL GA HA1181D11828 CM CONSONANT SIGN G CONSONANT SIGN ANG118291183D AL GALIG GA KSSA1183E11842 CM SIGN VOCALIC R GEMINATION SIGN118431186D CM PREFIXED LETTER LA SUBJOINED LETTER GALIG SMALL A1186E1186F BA DANDA DOUBLE DANDA11870 BA TSHEG11871 BB HEAD MARK11872 BA TERMINAL MARK11873 AL SYMBOL SVAYAMBHU

7 References

Atwood Christopher P 2004 Encyclopedia of Mongolia and the Mongol Empire New York Facts on File

Boldsaikhan B B Batsana and Ts Oyuntsetseg 2005 Соёмбо Нууц ба Синергетик [= Secret of

Soyombo and Synergetic] (Түвэд монгол бичгийн эхийг орчуулан хавсаргав) 71793717077176571702717447173371682 71680716817168971713 7169071681716907168371694 716997168071681716987168471716716917168471697716817171071794 Ed by T Bulgan Ulaanbaatar Shambala Association System Science Research InstituteMongolian University of Science and Technology

Corff Oliver ldquoSoyombo for LATEXrdquo v 09 September 2 1996 httpuserpagefu-berlinde

~corffimSoyombooverviewSoyombohtml

ldquoHistoire du livrerdquo 2010 ldquoHistoire du livre et histoire de lrsquoeacutecriture lrsquoeacutecriture entre pratique symboleet eacuteconomierdquo (26 August 2010) httphistoire-du-livreblogspotcom201008lecriture-entre-le-symbole-et-lahtml

Kapaj Luigi 2002 ldquoMongol Scriptsrdquo httpwwwviahistoriacomSilverHordemainhtmlresearchMongolScriptshtml

Mongolwiki 2008 ldquoСайн чанарын зураг байхгүй байнаrdquo httpwikiecm-outsourcingcomindexphptitle=ДүрсSoyombo_usgiin_bargif

Mongolia and Japan 1988 ldquoAddition of Soyombo Scriptrdquo N1855 L298-358 September 21 1998 httpstddkuugdkjtc1sc2wg2docsn1855pdf

Pandey Anshuman 2010a ldquoPreliminary Proposal to Encode the Xawtaa Dorboljin Script in ISOIEC10646rdquo N3956 L210-411 October 23 2010 httpstddkuugdkjtc1sc2wg2docsn3956pdf

11

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

mdashmdashmdash 2010b ldquoPreliminary Proposal to Encode the Soyombo Script in ISOIEC 10646rdquo N3949 L210-399 October 30 2010 httpstddkuugdkjtc1sc2wg2docsn3949pdf

mdashmdashmdash 2011a ldquoDetermining the Encoding Model for Soyombo Vowelsrdquo N3986 L211-054 February 52011 httpstddkuugdkjtc1sc2wg2docsn3986pdf

mdashmdashmdash 2011b ldquoRevised Preliminary Proposal to Encode Soyombo in the UCSrdquo N4026 L211-125 April25 2011 httpstddkuugdkjtc1sc2wg2docsn4026pdf

Sato Takayuki K ldquoSoyombo and Pagba (old Mongol scripts)rdquo N2163 L200-055 January 6 2000 httpstddkuugdkjtc1sc2wg2docsn2163doc

Shagdarsuumlruumlng Tseveliin 2001 Study of Mongolian Scripts (Graphic Study or Grammatology) Enl 2nded Bibliotheca Mongolica Monograph 1 Ed by Sharaviin Choimaa Ulaanbaatar Center for MongolStudies National University of Mongolia

Wikimedia 2006 ldquo1000 Tugriks - Rectordquo httpenwikipediaorgwikiFile1000_Tugriks_-_

Rectojpg

mdashmdashmdash 2008 ldquoFlag of Mongoliardquo httpenwikipediaorgwikiFileFlag_of_Mongoliasvg

mdashmdashmdash 2009a ldquoCoat of Arms of Mongoliardquo httpenwikipediaorgwikiFileCoat_of_Arms_

of_Mongoliasvg

mdashmdashmdash 2009b ldquoGreen Tara poem typed in Soyombo scriptrdquo httpenwikipediaorgwikiFile

Green_Tara_Poem_Soyombojpg

8 Acknowledgments

I would like to thank Gyoumlrgy Kara (Indiana University Bloomington) for reviewing a draft of this proposaland for providing additional information about Soyombo

This project was made possible in part by a grant from the United States National Endowment for the Hu-manities which funded the Universal Scripts Project (part of the Script Encoding Initiative at the Universityof California Berkeley) Any views findings conclusions or recommendations expressed in this publicationdo not necessarily reflect those of the National Endowment for the Humanities

12

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 5: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

36 Consonant Conjuncts

Consonant clusters in Sanskrit are represented as conjuncts The initial consonant in a conjunct is the baseletter Non-initial consonants are written as subjoined forms which are stacked within the frame of the baseletter Certain letters have pre-fixed forms when they occur in conjunct-initial position These pre-fixedforms are attached to the following consonant There is one conjunct that is written using an atomic ligature

361 Prefixed forms

When occur as C1 they are written using prefixed forms that are attached to the regular letterfor C2 In such conjuncts C2 functions as the base letter but C1 is parsed first in the logical order

bull 71747 Conjunct initial form of71705 It is used for writing Tibetan ལམག la-mgoletters lka = lt 71747 71690 gt rarr lka

bull 71748 Conjunct initial form of 71706 lt 71748 + 71690 gt rarr śka

bull 71749 Conjunct initial form of71707 It is used for writing Tibetan སམག sa-mgoletters ska = lt 71749 71690 gt rarr ska

bull 71750 Conjunct initial form of71703 It is used for writing Tibetan རམག ra-mgoletters rka = lt 71750 +71690 gt rarr rka

362 Subjoined forms

Non-initial consonants of a cluster are written as subjoined forms attached to C1

71690 71691 71692 71693 71694 71695 71696 71697 71698 71699 71751 71752 71753 71754 71755 71756 71757 71758 71759 71760

71700 71701 71702 71703 71704 71705 71706 71707 71708 71761 71762 71763 71764 71765 71766 71767 71768 71769

71721 71722 71723 71724 71725 71726 71727 71728 71729 71730 71770 71771 71772 71773 71774 71775 71776 71777 71778 71779

71731 71732 71733 71734 71735 71736 71737 71738 71739 71740 71780 71781 71782 71783 71784 71785 71786 71787 71788 71789

5

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

The subjoined form is the base element of a letter The glyphic representation of subjoined letters is discussedfurther in Section 55

363 Independent conjunct

The conjunct 71741 ksa is used for writing the Sanskrit cluster ksa ltक +0915 +094D +0937 gt It is proposed for independentencoding as It is different from other conjuncts because it cannot be produced using a subjoinedletter the shape of 71783 does not correspond to any element in the graphical structureof

364 Atomic ligatures in script charts

The following conjuncts are enumerated in charts of Soyombo

71741 ksa kya kra kla kva kka nka ntildeca nta nta mpa lka śka ska rka

They are not atomic characters but conjuncts Moreover they are not the only conjuncts used in Soyomboothers are shown in manuscripts These conjuncts are likely shown in charts for purposes of illustrating thetypes of conjuncts that may be written

37 Various Signs

bull 71744 Indicates nasalization in words of Sanskrit origin eg om lt71680 71685 71744gt It corresponds to +0F7E

bull 71745 Represents an allophone of r or s at word-final position in Devanagari orthographyfor Sanskrit It is used in Mongolian Square for writing Sanskrit and it corresponds to ཿ +0F7E

38 Punctuation

bull 71790 and 71791 These characters are used for marking the ends of sentences andlonger sections They correspond to +0964 and to +0965

bull 71792 This character is used for marking the end of a syllable It corresponds to +0F0B

39 Head Marks

There are two lsquoheadmarksrsquo71793 and71794 These characters are used for indicatingthe beginning and end of section of text

6

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

310 Symbols

The71795 was design by Zanabazar the creator of the script It is an officially recognizedsymbol in Mongolia and appears on the flag and coat of arms of the country (see figures 34 and 35) Thesymbol is included as a character of the script but it is not part of the orthography of written text

4 Language-Specific Orthography

41 Tibetan

The representation of consonant clusters and syllable-final consonants in Tibetan words follows the orthog-raphy for Tibetan For example a word-initial bare consonant may be written using the regular letter insteadof forming a conjunct with the following consonant Also a syllable-final consonant may be represented bythe regular letter instead of the final-consonant sign The word མཆག mchog lsquosupremersquo is written as7170171694716857169071792lt gt using the regular71690 instead of 71709 The 71792 is added after to mark the end of the syllable The expected representation of mchog is71701717557168571709

5 Rendering and Shaping Behaviors

51 Vertical Text

Soyombo may be written vertically from top to bottom and left to right For example the text segment dzuumlrka as shown in the seal in Figure 27

717377171671691

71737 71682 - 71716

71691

In a vertical environment Soyombo orthography departs from general rules for writing syllables The syl-lables of a word may be written using subjoined letters as if forming a conjunct The example below showsthe word71697716847170171684716967168471709 temedeg lsquosymbolrsquo written vertically Its three syllables have the same vowel which is rep-resented using a single vowel sign above the base letter the consonants of non-initial syllables are writtenusing subjoined letters while the final consonant is written using a final-consonant sign

716977168471762717577170971697 71684 -71701 71684 -71696 71684 - 71709

A similar practice is observed in the vertical form of words comprised of syllables that contain differentvowels The example below shows the word 71702716847169871683 yenu Its two syllables have different vowels which arewritten above and below the base the consonant of the second syllable is written as a subjoined form

71702716847175971683 71702 71684 -71698 71683 -

7

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

It is likely that the above behavior occurs only when the syllables of a word possess the same vowel or whenthe syllables of a word have vowels whose signs are written in different positions It is unlikely that syllableswould be written as shown above if the vowels were different and written in the same position in order toprevent clashing

52 Vowel Signs

When vowel-signs that appear at the base of the letter frame are written with final-consonants signs thenthe size and position of the vowel signs are modified in order to accommodate the placement of both signsThis rule affects two vowel signs 71682 and 71683

bull Shaping and positioning of - 71682 -rarr 71691 + 71682 + 71711 rarr 716917168271711 rarr 7169171711 + - + - rarr kuumlng

bull Shaping and positioning of - 71683 -rarr 71691 + 71683 + 71711 rarr 716917168371711 rarr 7169171711 + - + - rarr kung

53 Gemination Sign

The height of the base letter may be lowered to accommodate the within the normalx-height compare with

54 Conjuncts

The glyph of the base letter in a conjunct is generally modified in order to accommodate the placement ofsubjoined letters within the letter frame For example in writing kya lt71690 71763 gt

the regular shape71690 of is altered to where the base element is slightly truncated and compressed

Another such adjustment is made when 71692 is written with 71690 71691 71721 71722 For instance in writing nga the regular shape of 71692 changes to before

is attached A further modification is the break in the frame that occurs in conjuncts involving

55 Glyphic Representation of Subjoined Letters

The glyphs for subjoined letters represent the base element of the regular letter In some cases it maybe necessary to include part of the frame stem in the glyphs for subjoined letters in order to maintain theSoyombo graphical structure For example the word grva consists of a base letter and two subjoined lettersThe position of the second subjoined letter would straddle the baseline or fall below it In such cases thestem of the base-letter frame would be extended

8

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

7176471765 rarr 7176471765 71721 71764 71765

This could be handled by creating glyphs for subjoined letters that contain a segment of the frame stem eg 71765 for 71765 etc

6 Character Data

61 Character Properties

11800SOYOMBO LETTER ALo0LN11801SOYOMBO VOWEL SIGN IMn0NSMN11802SOYOMBO VOWEL SIGN UMn0NSMN11803SOYOMBO VOWEL SIGN UEMn0NSMN11804SOYOMBO VOWEL SIGN EMn0NSMN11805SOYOMBO VOWEL SIGN OMn0NSMN11806SOYOMBO VOWEL SIGN OEMn0NSMN11807SOYOMBO VOWEL SIGN AUMn0NSMN11808SOYOMBO VOWEL SIGN AIMn0NSMN11809SOYOMBO VOWEL LENGTH MARKMn0NSMN1180ASOYOMBO LETTER GALo0LN1180BSOYOMBO LETTER KALo0LN1180CSOYOMBO LETTER NGALo0LN1180DSOYOMBO LETTER JALo0LN1180ESOYOMBO LETTER CALo0LN1180FSOYOMBO LETTER NYALo0LN11810SOYOMBO LETTER DALo0LN11811SOYOMBO LETTER TALo0LN11812SOYOMBO LETTER NALo0LN11813SOYOMBO LETTER BALo0LN11814SOYOMBO LETTER PALo0LN11815SOYOMBO LETTER MALo0LN11816SOYOMBO LETTER YALo0LN11817SOYOMBO LETTER RALo0LN11818SOYOMBO LETTER VALo0LN11819SOYOMBO LETTER LALo0LN1181ASOYOMBO LETTER SHALo0LN1181BSOYOMBO LETTER SALo0LN1181CSOYOMBO LETTER HALo0LN1181DSOYOMBO CONSONANT SIGN GMn0NSMN1181ESOYOMBO CONSONANT SIGN KMn0NSMN1181FSOYOMBO CONSONANT SIGN NGMn0NSMN11820SOYOMBO CONSONANT SIGN DMn0NSMN11821SOYOMBO CONSONANT SIGN NMn0NSMN11822SOYOMBO CONSONANT SIGN BMn0NSMN11823SOYOMBO CONSONANT SIGN MMn0NSMN11824SOYOMBO CONSONANT SIGN RMn0NSMN11825SOYOMBO CONSONANT SIGN LMn0NSMN11826SOYOMBO CONSONANT SIGN SHMn0NSMN11827SOYOMBO CONSONANT SIGN SMn0NSMN11828SOYOMBO CONSONANT SIGN ANGMn0NSMN11829SOYOMBO LETTER GALIG GALo0LN1182ASOYOMBO LETTER GALIG GHALo0LN1182BSOYOMBO LETTER GALIG JALo0LN1182CSOYOMBO LETTER GALIG JHALo0LN1182DSOYOMBO LETTER GALIG TTALo0LN1182ESOYOMBO LETTER GALIG TTHALo0LN1182FSOYOMBO LETTER GALIG DDALo0LN

9

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

11830SOYOMBO LETTER GALIG DDHALo0LN11831SOYOMBO LETTER GALIG NNALo0LN11832SOYOMBO LETTER GALIG DALo0LN11833SOYOMBO LETTER GALIG DHALo0LN11834SOYOMBO LETTER GALIG BALo0LN11835SOYOMBO LETTER GALIG BHALo0LN11836SOYOMBO LETTER GALIG SSALo0LN11837SOYOMBO LETTER GALIG TSALo0LN11838SOYOMBO LETTER GALIG TSHALo0LN11839SOYOMBO LETTER GALIG DZALo0LN1183ASOYOMBO LETTER GALIG ZHALo0LN1183BSOYOMBO LETTER GALIG ZALo0LN1183CSOYOMBO LETTER GALIG SMALL ALo0LN1183DSOYOMBO LETTER KSSALo0LN1183ESOYOMBO VOWEL SIGN VOCALIC RMn0NSMN1183FSOYOMBO VOWEL SIGN VOCALIC LMn0NSMN11840SOYOMBO SIGN ANUSVARAMn0NSMN11841SOYOMBO SIGN VISARGAMc0LN11842SOYOMBO GEMINATION SIGNMn0NSMN11843SOYOMBO PREFIXED LETTER LAMn0NSMN11844SOYOMBO PREFIXED LETTER SHAMn0NSMN11845SOYOMBO PREFIXED LETTER SAMn0NSMN11846SOYOMBO PREFIXED LETTER RAMn0NSMN11847SOYOMBO SUBJOINED LETTER GAMn0NSMN11848SOYOMBO SUBJOINED LETTER KAMn0NSMN11849SOYOMBO SUBJOINED LETTER NGAMn0NSMN1184ASOYOMBO SUBJOINED LETTER JAMn0NSMN1184BSOYOMBO SUBJOINED LETTER CAMn0NSMN1184CSOYOMBO SUBJOINED LETTER NYAMn0NSMN1184DSOYOMBO SUBJOINED LETTER DAMn0NSMN1184ESOYOMBO SUBJOINED LETTER TAMn0NSMN1184FSOYOMBO SUBJOINED LETTER NAMn0NSMN11850SOYOMBO SUBJOINED LETTER BAMn0NSMN11851SOYOMBO SUBJOINED LETTER PAMn0NSMN11852SOYOMBO SUBJOINED LETTER MAMn0NSMN11853SOYOMBO SUBJOINED LETTER YAMn0NSMN11854SOYOMBO SUBJOINED LETTER RAMn0NSMN11855SOYOMBO SUBJOINED LETTER VAMn0NSMN11856SOYOMBO SUBJOINED LETTER LAMn0NSMN11857SOYOMBO SUBJOINED LETTER SHAMn0NSMN11858SOYOMBO SUBJOINED LETTER SAMn0NSMN11859SOYOMBO SUBJOINED LETTER HAMn0NSMN1185ASOYOMBO SUBJOINED LETTER GALIG GAMn0NSMN1185BSOYOMBO SUBJOINED LETTER GALIG GHAMn0NSMN1185CSOYOMBO SUBJOINED LETTER GALIG JAMn0NSMN1185DSOYOMBO SUBJOINED LETTER GALIG JHAMn0NSMN1185ESOYOMBO SUBJOINED LETTER GALIG TTAMn0NSMN1185FSOYOMBO SUBJOINED LETTER GALIG TTHAMn0NSMN11860SOYOMBO SUBJOINED LETTER GALIG DDAMn0NSMN11861SOYOMBO SUBJOINED LETTER GALIG DDHAMn0NSMN11862SOYOMBO SUBJOINED LETTER GALIG NNAMn0NSMN11863SOYOMBO SUBJOINED LETTER GALIG DAMn0NSMN11864SOYOMBO SUBJOINED LETTER GALIG DHAMn0NSMN11865SOYOMBO SUBJOINED LETTER GALIG BAMn0NSMN11866SOYOMBO SUBJOINED LETTER GALIG BHAMn0NSMN11867SOYOMBO SUBJOINED LETTER GALIG SSAMn0NSMN11868SOYOMBO SUBJOINED LETTER GALIG TSAMn0NSMN11869SOYOMBO SUBJOINED LETTER GALIG TSHAMn0NSMN1186ASOYOMBO SUBJOINED LETTER GALIG DZAMn0NSMN1186BSOYOMBO SUBJOINED LETTER GALIG ZHAMn0NSMN1186CSOYOMBO SUBJOINED LETTER GALIG ZAMn0NSMN

10

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

1186DSOYOMBO SUBJOINED LETTER GALIG SMALL AMn0NSMN1186ESOYOMBO DANDAPo0LN1186FSOYOMBO DOUBLE DANDAPo0LN11870SOYOMBO TSHEGPo0LN11871SOYOMBO HEAD MARKPo0ONN11872SOYOMBO TERMINAL MARKPo0ONN11873SOYOMBO SYMBOL SVAYAMBHUSo0LN

62 Linebreaking Properties

Linebreaking properties given in the data format of LineBreaktxt

11800 AL LETTER A1180111809 CM VOWEL SIGN I VOWEL LENGTH MARK1180A1181C AL GA HA1181D11828 CM CONSONANT SIGN G CONSONANT SIGN ANG118291183D AL GALIG GA KSSA1183E11842 CM SIGN VOCALIC R GEMINATION SIGN118431186D CM PREFIXED LETTER LA SUBJOINED LETTER GALIG SMALL A1186E1186F BA DANDA DOUBLE DANDA11870 BA TSHEG11871 BB HEAD MARK11872 BA TERMINAL MARK11873 AL SYMBOL SVAYAMBHU

7 References

Atwood Christopher P 2004 Encyclopedia of Mongolia and the Mongol Empire New York Facts on File

Boldsaikhan B B Batsana and Ts Oyuntsetseg 2005 Соёмбо Нууц ба Синергетик [= Secret of

Soyombo and Synergetic] (Түвэд монгол бичгийн эхийг орчуулан хавсаргав) 71793717077176571702717447173371682 71680716817168971713 7169071681716907168371694 716997168071681716987168471716716917168471697716817171071794 Ed by T Bulgan Ulaanbaatar Shambala Association System Science Research InstituteMongolian University of Science and Technology

Corff Oliver ldquoSoyombo for LATEXrdquo v 09 September 2 1996 httpuserpagefu-berlinde

~corffimSoyombooverviewSoyombohtml

ldquoHistoire du livrerdquo 2010 ldquoHistoire du livre et histoire de lrsquoeacutecriture lrsquoeacutecriture entre pratique symboleet eacuteconomierdquo (26 August 2010) httphistoire-du-livreblogspotcom201008lecriture-entre-le-symbole-et-lahtml

Kapaj Luigi 2002 ldquoMongol Scriptsrdquo httpwwwviahistoriacomSilverHordemainhtmlresearchMongolScriptshtml

Mongolwiki 2008 ldquoСайн чанарын зураг байхгүй байнаrdquo httpwikiecm-outsourcingcomindexphptitle=ДүрсSoyombo_usgiin_bargif

Mongolia and Japan 1988 ldquoAddition of Soyombo Scriptrdquo N1855 L298-358 September 21 1998 httpstddkuugdkjtc1sc2wg2docsn1855pdf

Pandey Anshuman 2010a ldquoPreliminary Proposal to Encode the Xawtaa Dorboljin Script in ISOIEC10646rdquo N3956 L210-411 October 23 2010 httpstddkuugdkjtc1sc2wg2docsn3956pdf

11

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

mdashmdashmdash 2010b ldquoPreliminary Proposal to Encode the Soyombo Script in ISOIEC 10646rdquo N3949 L210-399 October 30 2010 httpstddkuugdkjtc1sc2wg2docsn3949pdf

mdashmdashmdash 2011a ldquoDetermining the Encoding Model for Soyombo Vowelsrdquo N3986 L211-054 February 52011 httpstddkuugdkjtc1sc2wg2docsn3986pdf

mdashmdashmdash 2011b ldquoRevised Preliminary Proposal to Encode Soyombo in the UCSrdquo N4026 L211-125 April25 2011 httpstddkuugdkjtc1sc2wg2docsn4026pdf

Sato Takayuki K ldquoSoyombo and Pagba (old Mongol scripts)rdquo N2163 L200-055 January 6 2000 httpstddkuugdkjtc1sc2wg2docsn2163doc

Shagdarsuumlruumlng Tseveliin 2001 Study of Mongolian Scripts (Graphic Study or Grammatology) Enl 2nded Bibliotheca Mongolica Monograph 1 Ed by Sharaviin Choimaa Ulaanbaatar Center for MongolStudies National University of Mongolia

Wikimedia 2006 ldquo1000 Tugriks - Rectordquo httpenwikipediaorgwikiFile1000_Tugriks_-_

Rectojpg

mdashmdashmdash 2008 ldquoFlag of Mongoliardquo httpenwikipediaorgwikiFileFlag_of_Mongoliasvg

mdashmdashmdash 2009a ldquoCoat of Arms of Mongoliardquo httpenwikipediaorgwikiFileCoat_of_Arms_

of_Mongoliasvg

mdashmdashmdash 2009b ldquoGreen Tara poem typed in Soyombo scriptrdquo httpenwikipediaorgwikiFile

Green_Tara_Poem_Soyombojpg

8 Acknowledgments

I would like to thank Gyoumlrgy Kara (Indiana University Bloomington) for reviewing a draft of this proposaland for providing additional information about Soyombo

This project was made possible in part by a grant from the United States National Endowment for the Hu-manities which funded the Universal Scripts Project (part of the Script Encoding Initiative at the Universityof California Berkeley) Any views findings conclusions or recommendations expressed in this publicationdo not necessarily reflect those of the National Endowment for the Humanities

12

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 6: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

The subjoined form is the base element of a letter The glyphic representation of subjoined letters is discussedfurther in Section 55

363 Independent conjunct

The conjunct 71741 ksa is used for writing the Sanskrit cluster ksa ltक +0915 +094D +0937 gt It is proposed for independentencoding as It is different from other conjuncts because it cannot be produced using a subjoinedletter the shape of 71783 does not correspond to any element in the graphical structureof

364 Atomic ligatures in script charts

The following conjuncts are enumerated in charts of Soyombo

71741 ksa kya kra kla kva kka nka ntildeca nta nta mpa lka śka ska rka

They are not atomic characters but conjuncts Moreover they are not the only conjuncts used in Soyomboothers are shown in manuscripts These conjuncts are likely shown in charts for purposes of illustrating thetypes of conjuncts that may be written

37 Various Signs

bull 71744 Indicates nasalization in words of Sanskrit origin eg om lt71680 71685 71744gt It corresponds to +0F7E

bull 71745 Represents an allophone of r or s at word-final position in Devanagari orthographyfor Sanskrit It is used in Mongolian Square for writing Sanskrit and it corresponds to ཿ +0F7E

38 Punctuation

bull 71790 and 71791 These characters are used for marking the ends of sentences andlonger sections They correspond to +0964 and to +0965

bull 71792 This character is used for marking the end of a syllable It corresponds to +0F0B

39 Head Marks

There are two lsquoheadmarksrsquo71793 and71794 These characters are used for indicatingthe beginning and end of section of text

6

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

310 Symbols

The71795 was design by Zanabazar the creator of the script It is an officially recognizedsymbol in Mongolia and appears on the flag and coat of arms of the country (see figures 34 and 35) Thesymbol is included as a character of the script but it is not part of the orthography of written text

4 Language-Specific Orthography

41 Tibetan

The representation of consonant clusters and syllable-final consonants in Tibetan words follows the orthog-raphy for Tibetan For example a word-initial bare consonant may be written using the regular letter insteadof forming a conjunct with the following consonant Also a syllable-final consonant may be represented bythe regular letter instead of the final-consonant sign The word མཆག mchog lsquosupremersquo is written as7170171694716857169071792lt gt using the regular71690 instead of 71709 The 71792 is added after to mark the end of the syllable The expected representation of mchog is71701717557168571709

5 Rendering and Shaping Behaviors

51 Vertical Text

Soyombo may be written vertically from top to bottom and left to right For example the text segment dzuumlrka as shown in the seal in Figure 27

717377171671691

71737 71682 - 71716

71691

In a vertical environment Soyombo orthography departs from general rules for writing syllables The syl-lables of a word may be written using subjoined letters as if forming a conjunct The example below showsthe word71697716847170171684716967168471709 temedeg lsquosymbolrsquo written vertically Its three syllables have the same vowel which is rep-resented using a single vowel sign above the base letter the consonants of non-initial syllables are writtenusing subjoined letters while the final consonant is written using a final-consonant sign

716977168471762717577170971697 71684 -71701 71684 -71696 71684 - 71709

A similar practice is observed in the vertical form of words comprised of syllables that contain differentvowels The example below shows the word 71702716847169871683 yenu Its two syllables have different vowels which arewritten above and below the base the consonant of the second syllable is written as a subjoined form

71702716847175971683 71702 71684 -71698 71683 -

7

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

It is likely that the above behavior occurs only when the syllables of a word possess the same vowel or whenthe syllables of a word have vowels whose signs are written in different positions It is unlikely that syllableswould be written as shown above if the vowels were different and written in the same position in order toprevent clashing

52 Vowel Signs

When vowel-signs that appear at the base of the letter frame are written with final-consonants signs thenthe size and position of the vowel signs are modified in order to accommodate the placement of both signsThis rule affects two vowel signs 71682 and 71683

bull Shaping and positioning of - 71682 -rarr 71691 + 71682 + 71711 rarr 716917168271711 rarr 7169171711 + - + - rarr kuumlng

bull Shaping and positioning of - 71683 -rarr 71691 + 71683 + 71711 rarr 716917168371711 rarr 7169171711 + - + - rarr kung

53 Gemination Sign

The height of the base letter may be lowered to accommodate the within the normalx-height compare with

54 Conjuncts

The glyph of the base letter in a conjunct is generally modified in order to accommodate the placement ofsubjoined letters within the letter frame For example in writing kya lt71690 71763 gt

the regular shape71690 of is altered to where the base element is slightly truncated and compressed

Another such adjustment is made when 71692 is written with 71690 71691 71721 71722 For instance in writing nga the regular shape of 71692 changes to before

is attached A further modification is the break in the frame that occurs in conjuncts involving

55 Glyphic Representation of Subjoined Letters

The glyphs for subjoined letters represent the base element of the regular letter In some cases it maybe necessary to include part of the frame stem in the glyphs for subjoined letters in order to maintain theSoyombo graphical structure For example the word grva consists of a base letter and two subjoined lettersThe position of the second subjoined letter would straddle the baseline or fall below it In such cases thestem of the base-letter frame would be extended

8

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

7176471765 rarr 7176471765 71721 71764 71765

This could be handled by creating glyphs for subjoined letters that contain a segment of the frame stem eg 71765 for 71765 etc

6 Character Data

61 Character Properties

11800SOYOMBO LETTER ALo0LN11801SOYOMBO VOWEL SIGN IMn0NSMN11802SOYOMBO VOWEL SIGN UMn0NSMN11803SOYOMBO VOWEL SIGN UEMn0NSMN11804SOYOMBO VOWEL SIGN EMn0NSMN11805SOYOMBO VOWEL SIGN OMn0NSMN11806SOYOMBO VOWEL SIGN OEMn0NSMN11807SOYOMBO VOWEL SIGN AUMn0NSMN11808SOYOMBO VOWEL SIGN AIMn0NSMN11809SOYOMBO VOWEL LENGTH MARKMn0NSMN1180ASOYOMBO LETTER GALo0LN1180BSOYOMBO LETTER KALo0LN1180CSOYOMBO LETTER NGALo0LN1180DSOYOMBO LETTER JALo0LN1180ESOYOMBO LETTER CALo0LN1180FSOYOMBO LETTER NYALo0LN11810SOYOMBO LETTER DALo0LN11811SOYOMBO LETTER TALo0LN11812SOYOMBO LETTER NALo0LN11813SOYOMBO LETTER BALo0LN11814SOYOMBO LETTER PALo0LN11815SOYOMBO LETTER MALo0LN11816SOYOMBO LETTER YALo0LN11817SOYOMBO LETTER RALo0LN11818SOYOMBO LETTER VALo0LN11819SOYOMBO LETTER LALo0LN1181ASOYOMBO LETTER SHALo0LN1181BSOYOMBO LETTER SALo0LN1181CSOYOMBO LETTER HALo0LN1181DSOYOMBO CONSONANT SIGN GMn0NSMN1181ESOYOMBO CONSONANT SIGN KMn0NSMN1181FSOYOMBO CONSONANT SIGN NGMn0NSMN11820SOYOMBO CONSONANT SIGN DMn0NSMN11821SOYOMBO CONSONANT SIGN NMn0NSMN11822SOYOMBO CONSONANT SIGN BMn0NSMN11823SOYOMBO CONSONANT SIGN MMn0NSMN11824SOYOMBO CONSONANT SIGN RMn0NSMN11825SOYOMBO CONSONANT SIGN LMn0NSMN11826SOYOMBO CONSONANT SIGN SHMn0NSMN11827SOYOMBO CONSONANT SIGN SMn0NSMN11828SOYOMBO CONSONANT SIGN ANGMn0NSMN11829SOYOMBO LETTER GALIG GALo0LN1182ASOYOMBO LETTER GALIG GHALo0LN1182BSOYOMBO LETTER GALIG JALo0LN1182CSOYOMBO LETTER GALIG JHALo0LN1182DSOYOMBO LETTER GALIG TTALo0LN1182ESOYOMBO LETTER GALIG TTHALo0LN1182FSOYOMBO LETTER GALIG DDALo0LN

9

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

11830SOYOMBO LETTER GALIG DDHALo0LN11831SOYOMBO LETTER GALIG NNALo0LN11832SOYOMBO LETTER GALIG DALo0LN11833SOYOMBO LETTER GALIG DHALo0LN11834SOYOMBO LETTER GALIG BALo0LN11835SOYOMBO LETTER GALIG BHALo0LN11836SOYOMBO LETTER GALIG SSALo0LN11837SOYOMBO LETTER GALIG TSALo0LN11838SOYOMBO LETTER GALIG TSHALo0LN11839SOYOMBO LETTER GALIG DZALo0LN1183ASOYOMBO LETTER GALIG ZHALo0LN1183BSOYOMBO LETTER GALIG ZALo0LN1183CSOYOMBO LETTER GALIG SMALL ALo0LN1183DSOYOMBO LETTER KSSALo0LN1183ESOYOMBO VOWEL SIGN VOCALIC RMn0NSMN1183FSOYOMBO VOWEL SIGN VOCALIC LMn0NSMN11840SOYOMBO SIGN ANUSVARAMn0NSMN11841SOYOMBO SIGN VISARGAMc0LN11842SOYOMBO GEMINATION SIGNMn0NSMN11843SOYOMBO PREFIXED LETTER LAMn0NSMN11844SOYOMBO PREFIXED LETTER SHAMn0NSMN11845SOYOMBO PREFIXED LETTER SAMn0NSMN11846SOYOMBO PREFIXED LETTER RAMn0NSMN11847SOYOMBO SUBJOINED LETTER GAMn0NSMN11848SOYOMBO SUBJOINED LETTER KAMn0NSMN11849SOYOMBO SUBJOINED LETTER NGAMn0NSMN1184ASOYOMBO SUBJOINED LETTER JAMn0NSMN1184BSOYOMBO SUBJOINED LETTER CAMn0NSMN1184CSOYOMBO SUBJOINED LETTER NYAMn0NSMN1184DSOYOMBO SUBJOINED LETTER DAMn0NSMN1184ESOYOMBO SUBJOINED LETTER TAMn0NSMN1184FSOYOMBO SUBJOINED LETTER NAMn0NSMN11850SOYOMBO SUBJOINED LETTER BAMn0NSMN11851SOYOMBO SUBJOINED LETTER PAMn0NSMN11852SOYOMBO SUBJOINED LETTER MAMn0NSMN11853SOYOMBO SUBJOINED LETTER YAMn0NSMN11854SOYOMBO SUBJOINED LETTER RAMn0NSMN11855SOYOMBO SUBJOINED LETTER VAMn0NSMN11856SOYOMBO SUBJOINED LETTER LAMn0NSMN11857SOYOMBO SUBJOINED LETTER SHAMn0NSMN11858SOYOMBO SUBJOINED LETTER SAMn0NSMN11859SOYOMBO SUBJOINED LETTER HAMn0NSMN1185ASOYOMBO SUBJOINED LETTER GALIG GAMn0NSMN1185BSOYOMBO SUBJOINED LETTER GALIG GHAMn0NSMN1185CSOYOMBO SUBJOINED LETTER GALIG JAMn0NSMN1185DSOYOMBO SUBJOINED LETTER GALIG JHAMn0NSMN1185ESOYOMBO SUBJOINED LETTER GALIG TTAMn0NSMN1185FSOYOMBO SUBJOINED LETTER GALIG TTHAMn0NSMN11860SOYOMBO SUBJOINED LETTER GALIG DDAMn0NSMN11861SOYOMBO SUBJOINED LETTER GALIG DDHAMn0NSMN11862SOYOMBO SUBJOINED LETTER GALIG NNAMn0NSMN11863SOYOMBO SUBJOINED LETTER GALIG DAMn0NSMN11864SOYOMBO SUBJOINED LETTER GALIG DHAMn0NSMN11865SOYOMBO SUBJOINED LETTER GALIG BAMn0NSMN11866SOYOMBO SUBJOINED LETTER GALIG BHAMn0NSMN11867SOYOMBO SUBJOINED LETTER GALIG SSAMn0NSMN11868SOYOMBO SUBJOINED LETTER GALIG TSAMn0NSMN11869SOYOMBO SUBJOINED LETTER GALIG TSHAMn0NSMN1186ASOYOMBO SUBJOINED LETTER GALIG DZAMn0NSMN1186BSOYOMBO SUBJOINED LETTER GALIG ZHAMn0NSMN1186CSOYOMBO SUBJOINED LETTER GALIG ZAMn0NSMN

10

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

1186DSOYOMBO SUBJOINED LETTER GALIG SMALL AMn0NSMN1186ESOYOMBO DANDAPo0LN1186FSOYOMBO DOUBLE DANDAPo0LN11870SOYOMBO TSHEGPo0LN11871SOYOMBO HEAD MARKPo0ONN11872SOYOMBO TERMINAL MARKPo0ONN11873SOYOMBO SYMBOL SVAYAMBHUSo0LN

62 Linebreaking Properties

Linebreaking properties given in the data format of LineBreaktxt

11800 AL LETTER A1180111809 CM VOWEL SIGN I VOWEL LENGTH MARK1180A1181C AL GA HA1181D11828 CM CONSONANT SIGN G CONSONANT SIGN ANG118291183D AL GALIG GA KSSA1183E11842 CM SIGN VOCALIC R GEMINATION SIGN118431186D CM PREFIXED LETTER LA SUBJOINED LETTER GALIG SMALL A1186E1186F BA DANDA DOUBLE DANDA11870 BA TSHEG11871 BB HEAD MARK11872 BA TERMINAL MARK11873 AL SYMBOL SVAYAMBHU

7 References

Atwood Christopher P 2004 Encyclopedia of Mongolia and the Mongol Empire New York Facts on File

Boldsaikhan B B Batsana and Ts Oyuntsetseg 2005 Соёмбо Нууц ба Синергетик [= Secret of

Soyombo and Synergetic] (Түвэд монгол бичгийн эхийг орчуулан хавсаргав) 71793717077176571702717447173371682 71680716817168971713 7169071681716907168371694 716997168071681716987168471716716917168471697716817171071794 Ed by T Bulgan Ulaanbaatar Shambala Association System Science Research InstituteMongolian University of Science and Technology

Corff Oliver ldquoSoyombo for LATEXrdquo v 09 September 2 1996 httpuserpagefu-berlinde

~corffimSoyombooverviewSoyombohtml

ldquoHistoire du livrerdquo 2010 ldquoHistoire du livre et histoire de lrsquoeacutecriture lrsquoeacutecriture entre pratique symboleet eacuteconomierdquo (26 August 2010) httphistoire-du-livreblogspotcom201008lecriture-entre-le-symbole-et-lahtml

Kapaj Luigi 2002 ldquoMongol Scriptsrdquo httpwwwviahistoriacomSilverHordemainhtmlresearchMongolScriptshtml

Mongolwiki 2008 ldquoСайн чанарын зураг байхгүй байнаrdquo httpwikiecm-outsourcingcomindexphptitle=ДүрсSoyombo_usgiin_bargif

Mongolia and Japan 1988 ldquoAddition of Soyombo Scriptrdquo N1855 L298-358 September 21 1998 httpstddkuugdkjtc1sc2wg2docsn1855pdf

Pandey Anshuman 2010a ldquoPreliminary Proposal to Encode the Xawtaa Dorboljin Script in ISOIEC10646rdquo N3956 L210-411 October 23 2010 httpstddkuugdkjtc1sc2wg2docsn3956pdf

11

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

mdashmdashmdash 2010b ldquoPreliminary Proposal to Encode the Soyombo Script in ISOIEC 10646rdquo N3949 L210-399 October 30 2010 httpstddkuugdkjtc1sc2wg2docsn3949pdf

mdashmdashmdash 2011a ldquoDetermining the Encoding Model for Soyombo Vowelsrdquo N3986 L211-054 February 52011 httpstddkuugdkjtc1sc2wg2docsn3986pdf

mdashmdashmdash 2011b ldquoRevised Preliminary Proposal to Encode Soyombo in the UCSrdquo N4026 L211-125 April25 2011 httpstddkuugdkjtc1sc2wg2docsn4026pdf

Sato Takayuki K ldquoSoyombo and Pagba (old Mongol scripts)rdquo N2163 L200-055 January 6 2000 httpstddkuugdkjtc1sc2wg2docsn2163doc

Shagdarsuumlruumlng Tseveliin 2001 Study of Mongolian Scripts (Graphic Study or Grammatology) Enl 2nded Bibliotheca Mongolica Monograph 1 Ed by Sharaviin Choimaa Ulaanbaatar Center for MongolStudies National University of Mongolia

Wikimedia 2006 ldquo1000 Tugriks - Rectordquo httpenwikipediaorgwikiFile1000_Tugriks_-_

Rectojpg

mdashmdashmdash 2008 ldquoFlag of Mongoliardquo httpenwikipediaorgwikiFileFlag_of_Mongoliasvg

mdashmdashmdash 2009a ldquoCoat of Arms of Mongoliardquo httpenwikipediaorgwikiFileCoat_of_Arms_

of_Mongoliasvg

mdashmdashmdash 2009b ldquoGreen Tara poem typed in Soyombo scriptrdquo httpenwikipediaorgwikiFile

Green_Tara_Poem_Soyombojpg

8 Acknowledgments

I would like to thank Gyoumlrgy Kara (Indiana University Bloomington) for reviewing a draft of this proposaland for providing additional information about Soyombo

This project was made possible in part by a grant from the United States National Endowment for the Hu-manities which funded the Universal Scripts Project (part of the Script Encoding Initiative at the Universityof California Berkeley) Any views findings conclusions or recommendations expressed in this publicationdo not necessarily reflect those of the National Endowment for the Humanities

12

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 7: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

310 Symbols

The71795 was design by Zanabazar the creator of the script It is an officially recognizedsymbol in Mongolia and appears on the flag and coat of arms of the country (see figures 34 and 35) Thesymbol is included as a character of the script but it is not part of the orthography of written text

4 Language-Specific Orthography

41 Tibetan

The representation of consonant clusters and syllable-final consonants in Tibetan words follows the orthog-raphy for Tibetan For example a word-initial bare consonant may be written using the regular letter insteadof forming a conjunct with the following consonant Also a syllable-final consonant may be represented bythe regular letter instead of the final-consonant sign The word མཆག mchog lsquosupremersquo is written as7170171694716857169071792lt gt using the regular71690 instead of 71709 The 71792 is added after to mark the end of the syllable The expected representation of mchog is71701717557168571709

5 Rendering and Shaping Behaviors

51 Vertical Text

Soyombo may be written vertically from top to bottom and left to right For example the text segment dzuumlrka as shown in the seal in Figure 27

717377171671691

71737 71682 - 71716

71691

In a vertical environment Soyombo orthography departs from general rules for writing syllables The syl-lables of a word may be written using subjoined letters as if forming a conjunct The example below showsthe word71697716847170171684716967168471709 temedeg lsquosymbolrsquo written vertically Its three syllables have the same vowel which is rep-resented using a single vowel sign above the base letter the consonants of non-initial syllables are writtenusing subjoined letters while the final consonant is written using a final-consonant sign

716977168471762717577170971697 71684 -71701 71684 -71696 71684 - 71709

A similar practice is observed in the vertical form of words comprised of syllables that contain differentvowels The example below shows the word 71702716847169871683 yenu Its two syllables have different vowels which arewritten above and below the base the consonant of the second syllable is written as a subjoined form

71702716847175971683 71702 71684 -71698 71683 -

7

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

It is likely that the above behavior occurs only when the syllables of a word possess the same vowel or whenthe syllables of a word have vowels whose signs are written in different positions It is unlikely that syllableswould be written as shown above if the vowels were different and written in the same position in order toprevent clashing

52 Vowel Signs

When vowel-signs that appear at the base of the letter frame are written with final-consonants signs thenthe size and position of the vowel signs are modified in order to accommodate the placement of both signsThis rule affects two vowel signs 71682 and 71683

bull Shaping and positioning of - 71682 -rarr 71691 + 71682 + 71711 rarr 716917168271711 rarr 7169171711 + - + - rarr kuumlng

bull Shaping and positioning of - 71683 -rarr 71691 + 71683 + 71711 rarr 716917168371711 rarr 7169171711 + - + - rarr kung

53 Gemination Sign

The height of the base letter may be lowered to accommodate the within the normalx-height compare with

54 Conjuncts

The glyph of the base letter in a conjunct is generally modified in order to accommodate the placement ofsubjoined letters within the letter frame For example in writing kya lt71690 71763 gt

the regular shape71690 of is altered to where the base element is slightly truncated and compressed

Another such adjustment is made when 71692 is written with 71690 71691 71721 71722 For instance in writing nga the regular shape of 71692 changes to before

is attached A further modification is the break in the frame that occurs in conjuncts involving

55 Glyphic Representation of Subjoined Letters

The glyphs for subjoined letters represent the base element of the regular letter In some cases it maybe necessary to include part of the frame stem in the glyphs for subjoined letters in order to maintain theSoyombo graphical structure For example the word grva consists of a base letter and two subjoined lettersThe position of the second subjoined letter would straddle the baseline or fall below it In such cases thestem of the base-letter frame would be extended

8

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

7176471765 rarr 7176471765 71721 71764 71765

This could be handled by creating glyphs for subjoined letters that contain a segment of the frame stem eg 71765 for 71765 etc

6 Character Data

61 Character Properties

11800SOYOMBO LETTER ALo0LN11801SOYOMBO VOWEL SIGN IMn0NSMN11802SOYOMBO VOWEL SIGN UMn0NSMN11803SOYOMBO VOWEL SIGN UEMn0NSMN11804SOYOMBO VOWEL SIGN EMn0NSMN11805SOYOMBO VOWEL SIGN OMn0NSMN11806SOYOMBO VOWEL SIGN OEMn0NSMN11807SOYOMBO VOWEL SIGN AUMn0NSMN11808SOYOMBO VOWEL SIGN AIMn0NSMN11809SOYOMBO VOWEL LENGTH MARKMn0NSMN1180ASOYOMBO LETTER GALo0LN1180BSOYOMBO LETTER KALo0LN1180CSOYOMBO LETTER NGALo0LN1180DSOYOMBO LETTER JALo0LN1180ESOYOMBO LETTER CALo0LN1180FSOYOMBO LETTER NYALo0LN11810SOYOMBO LETTER DALo0LN11811SOYOMBO LETTER TALo0LN11812SOYOMBO LETTER NALo0LN11813SOYOMBO LETTER BALo0LN11814SOYOMBO LETTER PALo0LN11815SOYOMBO LETTER MALo0LN11816SOYOMBO LETTER YALo0LN11817SOYOMBO LETTER RALo0LN11818SOYOMBO LETTER VALo0LN11819SOYOMBO LETTER LALo0LN1181ASOYOMBO LETTER SHALo0LN1181BSOYOMBO LETTER SALo0LN1181CSOYOMBO LETTER HALo0LN1181DSOYOMBO CONSONANT SIGN GMn0NSMN1181ESOYOMBO CONSONANT SIGN KMn0NSMN1181FSOYOMBO CONSONANT SIGN NGMn0NSMN11820SOYOMBO CONSONANT SIGN DMn0NSMN11821SOYOMBO CONSONANT SIGN NMn0NSMN11822SOYOMBO CONSONANT SIGN BMn0NSMN11823SOYOMBO CONSONANT SIGN MMn0NSMN11824SOYOMBO CONSONANT SIGN RMn0NSMN11825SOYOMBO CONSONANT SIGN LMn0NSMN11826SOYOMBO CONSONANT SIGN SHMn0NSMN11827SOYOMBO CONSONANT SIGN SMn0NSMN11828SOYOMBO CONSONANT SIGN ANGMn0NSMN11829SOYOMBO LETTER GALIG GALo0LN1182ASOYOMBO LETTER GALIG GHALo0LN1182BSOYOMBO LETTER GALIG JALo0LN1182CSOYOMBO LETTER GALIG JHALo0LN1182DSOYOMBO LETTER GALIG TTALo0LN1182ESOYOMBO LETTER GALIG TTHALo0LN1182FSOYOMBO LETTER GALIG DDALo0LN

9

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

11830SOYOMBO LETTER GALIG DDHALo0LN11831SOYOMBO LETTER GALIG NNALo0LN11832SOYOMBO LETTER GALIG DALo0LN11833SOYOMBO LETTER GALIG DHALo0LN11834SOYOMBO LETTER GALIG BALo0LN11835SOYOMBO LETTER GALIG BHALo0LN11836SOYOMBO LETTER GALIG SSALo0LN11837SOYOMBO LETTER GALIG TSALo0LN11838SOYOMBO LETTER GALIG TSHALo0LN11839SOYOMBO LETTER GALIG DZALo0LN1183ASOYOMBO LETTER GALIG ZHALo0LN1183BSOYOMBO LETTER GALIG ZALo0LN1183CSOYOMBO LETTER GALIG SMALL ALo0LN1183DSOYOMBO LETTER KSSALo0LN1183ESOYOMBO VOWEL SIGN VOCALIC RMn0NSMN1183FSOYOMBO VOWEL SIGN VOCALIC LMn0NSMN11840SOYOMBO SIGN ANUSVARAMn0NSMN11841SOYOMBO SIGN VISARGAMc0LN11842SOYOMBO GEMINATION SIGNMn0NSMN11843SOYOMBO PREFIXED LETTER LAMn0NSMN11844SOYOMBO PREFIXED LETTER SHAMn0NSMN11845SOYOMBO PREFIXED LETTER SAMn0NSMN11846SOYOMBO PREFIXED LETTER RAMn0NSMN11847SOYOMBO SUBJOINED LETTER GAMn0NSMN11848SOYOMBO SUBJOINED LETTER KAMn0NSMN11849SOYOMBO SUBJOINED LETTER NGAMn0NSMN1184ASOYOMBO SUBJOINED LETTER JAMn0NSMN1184BSOYOMBO SUBJOINED LETTER CAMn0NSMN1184CSOYOMBO SUBJOINED LETTER NYAMn0NSMN1184DSOYOMBO SUBJOINED LETTER DAMn0NSMN1184ESOYOMBO SUBJOINED LETTER TAMn0NSMN1184FSOYOMBO SUBJOINED LETTER NAMn0NSMN11850SOYOMBO SUBJOINED LETTER BAMn0NSMN11851SOYOMBO SUBJOINED LETTER PAMn0NSMN11852SOYOMBO SUBJOINED LETTER MAMn0NSMN11853SOYOMBO SUBJOINED LETTER YAMn0NSMN11854SOYOMBO SUBJOINED LETTER RAMn0NSMN11855SOYOMBO SUBJOINED LETTER VAMn0NSMN11856SOYOMBO SUBJOINED LETTER LAMn0NSMN11857SOYOMBO SUBJOINED LETTER SHAMn0NSMN11858SOYOMBO SUBJOINED LETTER SAMn0NSMN11859SOYOMBO SUBJOINED LETTER HAMn0NSMN1185ASOYOMBO SUBJOINED LETTER GALIG GAMn0NSMN1185BSOYOMBO SUBJOINED LETTER GALIG GHAMn0NSMN1185CSOYOMBO SUBJOINED LETTER GALIG JAMn0NSMN1185DSOYOMBO SUBJOINED LETTER GALIG JHAMn0NSMN1185ESOYOMBO SUBJOINED LETTER GALIG TTAMn0NSMN1185FSOYOMBO SUBJOINED LETTER GALIG TTHAMn0NSMN11860SOYOMBO SUBJOINED LETTER GALIG DDAMn0NSMN11861SOYOMBO SUBJOINED LETTER GALIG DDHAMn0NSMN11862SOYOMBO SUBJOINED LETTER GALIG NNAMn0NSMN11863SOYOMBO SUBJOINED LETTER GALIG DAMn0NSMN11864SOYOMBO SUBJOINED LETTER GALIG DHAMn0NSMN11865SOYOMBO SUBJOINED LETTER GALIG BAMn0NSMN11866SOYOMBO SUBJOINED LETTER GALIG BHAMn0NSMN11867SOYOMBO SUBJOINED LETTER GALIG SSAMn0NSMN11868SOYOMBO SUBJOINED LETTER GALIG TSAMn0NSMN11869SOYOMBO SUBJOINED LETTER GALIG TSHAMn0NSMN1186ASOYOMBO SUBJOINED LETTER GALIG DZAMn0NSMN1186BSOYOMBO SUBJOINED LETTER GALIG ZHAMn0NSMN1186CSOYOMBO SUBJOINED LETTER GALIG ZAMn0NSMN

10

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

1186DSOYOMBO SUBJOINED LETTER GALIG SMALL AMn0NSMN1186ESOYOMBO DANDAPo0LN1186FSOYOMBO DOUBLE DANDAPo0LN11870SOYOMBO TSHEGPo0LN11871SOYOMBO HEAD MARKPo0ONN11872SOYOMBO TERMINAL MARKPo0ONN11873SOYOMBO SYMBOL SVAYAMBHUSo0LN

62 Linebreaking Properties

Linebreaking properties given in the data format of LineBreaktxt

11800 AL LETTER A1180111809 CM VOWEL SIGN I VOWEL LENGTH MARK1180A1181C AL GA HA1181D11828 CM CONSONANT SIGN G CONSONANT SIGN ANG118291183D AL GALIG GA KSSA1183E11842 CM SIGN VOCALIC R GEMINATION SIGN118431186D CM PREFIXED LETTER LA SUBJOINED LETTER GALIG SMALL A1186E1186F BA DANDA DOUBLE DANDA11870 BA TSHEG11871 BB HEAD MARK11872 BA TERMINAL MARK11873 AL SYMBOL SVAYAMBHU

7 References

Atwood Christopher P 2004 Encyclopedia of Mongolia and the Mongol Empire New York Facts on File

Boldsaikhan B B Batsana and Ts Oyuntsetseg 2005 Соёмбо Нууц ба Синергетик [= Secret of

Soyombo and Synergetic] (Түвэд монгол бичгийн эхийг орчуулан хавсаргав) 71793717077176571702717447173371682 71680716817168971713 7169071681716907168371694 716997168071681716987168471716716917168471697716817171071794 Ed by T Bulgan Ulaanbaatar Shambala Association System Science Research InstituteMongolian University of Science and Technology

Corff Oliver ldquoSoyombo for LATEXrdquo v 09 September 2 1996 httpuserpagefu-berlinde

~corffimSoyombooverviewSoyombohtml

ldquoHistoire du livrerdquo 2010 ldquoHistoire du livre et histoire de lrsquoeacutecriture lrsquoeacutecriture entre pratique symboleet eacuteconomierdquo (26 August 2010) httphistoire-du-livreblogspotcom201008lecriture-entre-le-symbole-et-lahtml

Kapaj Luigi 2002 ldquoMongol Scriptsrdquo httpwwwviahistoriacomSilverHordemainhtmlresearchMongolScriptshtml

Mongolwiki 2008 ldquoСайн чанарын зураг байхгүй байнаrdquo httpwikiecm-outsourcingcomindexphptitle=ДүрсSoyombo_usgiin_bargif

Mongolia and Japan 1988 ldquoAddition of Soyombo Scriptrdquo N1855 L298-358 September 21 1998 httpstddkuugdkjtc1sc2wg2docsn1855pdf

Pandey Anshuman 2010a ldquoPreliminary Proposal to Encode the Xawtaa Dorboljin Script in ISOIEC10646rdquo N3956 L210-411 October 23 2010 httpstddkuugdkjtc1sc2wg2docsn3956pdf

11

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

mdashmdashmdash 2010b ldquoPreliminary Proposal to Encode the Soyombo Script in ISOIEC 10646rdquo N3949 L210-399 October 30 2010 httpstddkuugdkjtc1sc2wg2docsn3949pdf

mdashmdashmdash 2011a ldquoDetermining the Encoding Model for Soyombo Vowelsrdquo N3986 L211-054 February 52011 httpstddkuugdkjtc1sc2wg2docsn3986pdf

mdashmdashmdash 2011b ldquoRevised Preliminary Proposal to Encode Soyombo in the UCSrdquo N4026 L211-125 April25 2011 httpstddkuugdkjtc1sc2wg2docsn4026pdf

Sato Takayuki K ldquoSoyombo and Pagba (old Mongol scripts)rdquo N2163 L200-055 January 6 2000 httpstddkuugdkjtc1sc2wg2docsn2163doc

Shagdarsuumlruumlng Tseveliin 2001 Study of Mongolian Scripts (Graphic Study or Grammatology) Enl 2nded Bibliotheca Mongolica Monograph 1 Ed by Sharaviin Choimaa Ulaanbaatar Center for MongolStudies National University of Mongolia

Wikimedia 2006 ldquo1000 Tugriks - Rectordquo httpenwikipediaorgwikiFile1000_Tugriks_-_

Rectojpg

mdashmdashmdash 2008 ldquoFlag of Mongoliardquo httpenwikipediaorgwikiFileFlag_of_Mongoliasvg

mdashmdashmdash 2009a ldquoCoat of Arms of Mongoliardquo httpenwikipediaorgwikiFileCoat_of_Arms_

of_Mongoliasvg

mdashmdashmdash 2009b ldquoGreen Tara poem typed in Soyombo scriptrdquo httpenwikipediaorgwikiFile

Green_Tara_Poem_Soyombojpg

8 Acknowledgments

I would like to thank Gyoumlrgy Kara (Indiana University Bloomington) for reviewing a draft of this proposaland for providing additional information about Soyombo

This project was made possible in part by a grant from the United States National Endowment for the Hu-manities which funded the Universal Scripts Project (part of the Script Encoding Initiative at the Universityof California Berkeley) Any views findings conclusions or recommendations expressed in this publicationdo not necessarily reflect those of the National Endowment for the Humanities

12

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 8: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

It is likely that the above behavior occurs only when the syllables of a word possess the same vowel or whenthe syllables of a word have vowels whose signs are written in different positions It is unlikely that syllableswould be written as shown above if the vowels were different and written in the same position in order toprevent clashing

52 Vowel Signs

When vowel-signs that appear at the base of the letter frame are written with final-consonants signs thenthe size and position of the vowel signs are modified in order to accommodate the placement of both signsThis rule affects two vowel signs 71682 and 71683

bull Shaping and positioning of - 71682 -rarr 71691 + 71682 + 71711 rarr 716917168271711 rarr 7169171711 + - + - rarr kuumlng

bull Shaping and positioning of - 71683 -rarr 71691 + 71683 + 71711 rarr 716917168371711 rarr 7169171711 + - + - rarr kung

53 Gemination Sign

The height of the base letter may be lowered to accommodate the within the normalx-height compare with

54 Conjuncts

The glyph of the base letter in a conjunct is generally modified in order to accommodate the placement ofsubjoined letters within the letter frame For example in writing kya lt71690 71763 gt

the regular shape71690 of is altered to where the base element is slightly truncated and compressed

Another such adjustment is made when 71692 is written with 71690 71691 71721 71722 For instance in writing nga the regular shape of 71692 changes to before

is attached A further modification is the break in the frame that occurs in conjuncts involving

55 Glyphic Representation of Subjoined Letters

The glyphs for subjoined letters represent the base element of the regular letter In some cases it maybe necessary to include part of the frame stem in the glyphs for subjoined letters in order to maintain theSoyombo graphical structure For example the word grva consists of a base letter and two subjoined lettersThe position of the second subjoined letter would straddle the baseline or fall below it In such cases thestem of the base-letter frame would be extended

8

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

7176471765 rarr 7176471765 71721 71764 71765

This could be handled by creating glyphs for subjoined letters that contain a segment of the frame stem eg 71765 for 71765 etc

6 Character Data

61 Character Properties

11800SOYOMBO LETTER ALo0LN11801SOYOMBO VOWEL SIGN IMn0NSMN11802SOYOMBO VOWEL SIGN UMn0NSMN11803SOYOMBO VOWEL SIGN UEMn0NSMN11804SOYOMBO VOWEL SIGN EMn0NSMN11805SOYOMBO VOWEL SIGN OMn0NSMN11806SOYOMBO VOWEL SIGN OEMn0NSMN11807SOYOMBO VOWEL SIGN AUMn0NSMN11808SOYOMBO VOWEL SIGN AIMn0NSMN11809SOYOMBO VOWEL LENGTH MARKMn0NSMN1180ASOYOMBO LETTER GALo0LN1180BSOYOMBO LETTER KALo0LN1180CSOYOMBO LETTER NGALo0LN1180DSOYOMBO LETTER JALo0LN1180ESOYOMBO LETTER CALo0LN1180FSOYOMBO LETTER NYALo0LN11810SOYOMBO LETTER DALo0LN11811SOYOMBO LETTER TALo0LN11812SOYOMBO LETTER NALo0LN11813SOYOMBO LETTER BALo0LN11814SOYOMBO LETTER PALo0LN11815SOYOMBO LETTER MALo0LN11816SOYOMBO LETTER YALo0LN11817SOYOMBO LETTER RALo0LN11818SOYOMBO LETTER VALo0LN11819SOYOMBO LETTER LALo0LN1181ASOYOMBO LETTER SHALo0LN1181BSOYOMBO LETTER SALo0LN1181CSOYOMBO LETTER HALo0LN1181DSOYOMBO CONSONANT SIGN GMn0NSMN1181ESOYOMBO CONSONANT SIGN KMn0NSMN1181FSOYOMBO CONSONANT SIGN NGMn0NSMN11820SOYOMBO CONSONANT SIGN DMn0NSMN11821SOYOMBO CONSONANT SIGN NMn0NSMN11822SOYOMBO CONSONANT SIGN BMn0NSMN11823SOYOMBO CONSONANT SIGN MMn0NSMN11824SOYOMBO CONSONANT SIGN RMn0NSMN11825SOYOMBO CONSONANT SIGN LMn0NSMN11826SOYOMBO CONSONANT SIGN SHMn0NSMN11827SOYOMBO CONSONANT SIGN SMn0NSMN11828SOYOMBO CONSONANT SIGN ANGMn0NSMN11829SOYOMBO LETTER GALIG GALo0LN1182ASOYOMBO LETTER GALIG GHALo0LN1182BSOYOMBO LETTER GALIG JALo0LN1182CSOYOMBO LETTER GALIG JHALo0LN1182DSOYOMBO LETTER GALIG TTALo0LN1182ESOYOMBO LETTER GALIG TTHALo0LN1182FSOYOMBO LETTER GALIG DDALo0LN

9

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

11830SOYOMBO LETTER GALIG DDHALo0LN11831SOYOMBO LETTER GALIG NNALo0LN11832SOYOMBO LETTER GALIG DALo0LN11833SOYOMBO LETTER GALIG DHALo0LN11834SOYOMBO LETTER GALIG BALo0LN11835SOYOMBO LETTER GALIG BHALo0LN11836SOYOMBO LETTER GALIG SSALo0LN11837SOYOMBO LETTER GALIG TSALo0LN11838SOYOMBO LETTER GALIG TSHALo0LN11839SOYOMBO LETTER GALIG DZALo0LN1183ASOYOMBO LETTER GALIG ZHALo0LN1183BSOYOMBO LETTER GALIG ZALo0LN1183CSOYOMBO LETTER GALIG SMALL ALo0LN1183DSOYOMBO LETTER KSSALo0LN1183ESOYOMBO VOWEL SIGN VOCALIC RMn0NSMN1183FSOYOMBO VOWEL SIGN VOCALIC LMn0NSMN11840SOYOMBO SIGN ANUSVARAMn0NSMN11841SOYOMBO SIGN VISARGAMc0LN11842SOYOMBO GEMINATION SIGNMn0NSMN11843SOYOMBO PREFIXED LETTER LAMn0NSMN11844SOYOMBO PREFIXED LETTER SHAMn0NSMN11845SOYOMBO PREFIXED LETTER SAMn0NSMN11846SOYOMBO PREFIXED LETTER RAMn0NSMN11847SOYOMBO SUBJOINED LETTER GAMn0NSMN11848SOYOMBO SUBJOINED LETTER KAMn0NSMN11849SOYOMBO SUBJOINED LETTER NGAMn0NSMN1184ASOYOMBO SUBJOINED LETTER JAMn0NSMN1184BSOYOMBO SUBJOINED LETTER CAMn0NSMN1184CSOYOMBO SUBJOINED LETTER NYAMn0NSMN1184DSOYOMBO SUBJOINED LETTER DAMn0NSMN1184ESOYOMBO SUBJOINED LETTER TAMn0NSMN1184FSOYOMBO SUBJOINED LETTER NAMn0NSMN11850SOYOMBO SUBJOINED LETTER BAMn0NSMN11851SOYOMBO SUBJOINED LETTER PAMn0NSMN11852SOYOMBO SUBJOINED LETTER MAMn0NSMN11853SOYOMBO SUBJOINED LETTER YAMn0NSMN11854SOYOMBO SUBJOINED LETTER RAMn0NSMN11855SOYOMBO SUBJOINED LETTER VAMn0NSMN11856SOYOMBO SUBJOINED LETTER LAMn0NSMN11857SOYOMBO SUBJOINED LETTER SHAMn0NSMN11858SOYOMBO SUBJOINED LETTER SAMn0NSMN11859SOYOMBO SUBJOINED LETTER HAMn0NSMN1185ASOYOMBO SUBJOINED LETTER GALIG GAMn0NSMN1185BSOYOMBO SUBJOINED LETTER GALIG GHAMn0NSMN1185CSOYOMBO SUBJOINED LETTER GALIG JAMn0NSMN1185DSOYOMBO SUBJOINED LETTER GALIG JHAMn0NSMN1185ESOYOMBO SUBJOINED LETTER GALIG TTAMn0NSMN1185FSOYOMBO SUBJOINED LETTER GALIG TTHAMn0NSMN11860SOYOMBO SUBJOINED LETTER GALIG DDAMn0NSMN11861SOYOMBO SUBJOINED LETTER GALIG DDHAMn0NSMN11862SOYOMBO SUBJOINED LETTER GALIG NNAMn0NSMN11863SOYOMBO SUBJOINED LETTER GALIG DAMn0NSMN11864SOYOMBO SUBJOINED LETTER GALIG DHAMn0NSMN11865SOYOMBO SUBJOINED LETTER GALIG BAMn0NSMN11866SOYOMBO SUBJOINED LETTER GALIG BHAMn0NSMN11867SOYOMBO SUBJOINED LETTER GALIG SSAMn0NSMN11868SOYOMBO SUBJOINED LETTER GALIG TSAMn0NSMN11869SOYOMBO SUBJOINED LETTER GALIG TSHAMn0NSMN1186ASOYOMBO SUBJOINED LETTER GALIG DZAMn0NSMN1186BSOYOMBO SUBJOINED LETTER GALIG ZHAMn0NSMN1186CSOYOMBO SUBJOINED LETTER GALIG ZAMn0NSMN

10

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

1186DSOYOMBO SUBJOINED LETTER GALIG SMALL AMn0NSMN1186ESOYOMBO DANDAPo0LN1186FSOYOMBO DOUBLE DANDAPo0LN11870SOYOMBO TSHEGPo0LN11871SOYOMBO HEAD MARKPo0ONN11872SOYOMBO TERMINAL MARKPo0ONN11873SOYOMBO SYMBOL SVAYAMBHUSo0LN

62 Linebreaking Properties

Linebreaking properties given in the data format of LineBreaktxt

11800 AL LETTER A1180111809 CM VOWEL SIGN I VOWEL LENGTH MARK1180A1181C AL GA HA1181D11828 CM CONSONANT SIGN G CONSONANT SIGN ANG118291183D AL GALIG GA KSSA1183E11842 CM SIGN VOCALIC R GEMINATION SIGN118431186D CM PREFIXED LETTER LA SUBJOINED LETTER GALIG SMALL A1186E1186F BA DANDA DOUBLE DANDA11870 BA TSHEG11871 BB HEAD MARK11872 BA TERMINAL MARK11873 AL SYMBOL SVAYAMBHU

7 References

Atwood Christopher P 2004 Encyclopedia of Mongolia and the Mongol Empire New York Facts on File

Boldsaikhan B B Batsana and Ts Oyuntsetseg 2005 Соёмбо Нууц ба Синергетик [= Secret of

Soyombo and Synergetic] (Түвэд монгол бичгийн эхийг орчуулан хавсаргав) 71793717077176571702717447173371682 71680716817168971713 7169071681716907168371694 716997168071681716987168471716716917168471697716817171071794 Ed by T Bulgan Ulaanbaatar Shambala Association System Science Research InstituteMongolian University of Science and Technology

Corff Oliver ldquoSoyombo for LATEXrdquo v 09 September 2 1996 httpuserpagefu-berlinde

~corffimSoyombooverviewSoyombohtml

ldquoHistoire du livrerdquo 2010 ldquoHistoire du livre et histoire de lrsquoeacutecriture lrsquoeacutecriture entre pratique symboleet eacuteconomierdquo (26 August 2010) httphistoire-du-livreblogspotcom201008lecriture-entre-le-symbole-et-lahtml

Kapaj Luigi 2002 ldquoMongol Scriptsrdquo httpwwwviahistoriacomSilverHordemainhtmlresearchMongolScriptshtml

Mongolwiki 2008 ldquoСайн чанарын зураг байхгүй байнаrdquo httpwikiecm-outsourcingcomindexphptitle=ДүрсSoyombo_usgiin_bargif

Mongolia and Japan 1988 ldquoAddition of Soyombo Scriptrdquo N1855 L298-358 September 21 1998 httpstddkuugdkjtc1sc2wg2docsn1855pdf

Pandey Anshuman 2010a ldquoPreliminary Proposal to Encode the Xawtaa Dorboljin Script in ISOIEC10646rdquo N3956 L210-411 October 23 2010 httpstddkuugdkjtc1sc2wg2docsn3956pdf

11

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

mdashmdashmdash 2010b ldquoPreliminary Proposal to Encode the Soyombo Script in ISOIEC 10646rdquo N3949 L210-399 October 30 2010 httpstddkuugdkjtc1sc2wg2docsn3949pdf

mdashmdashmdash 2011a ldquoDetermining the Encoding Model for Soyombo Vowelsrdquo N3986 L211-054 February 52011 httpstddkuugdkjtc1sc2wg2docsn3986pdf

mdashmdashmdash 2011b ldquoRevised Preliminary Proposal to Encode Soyombo in the UCSrdquo N4026 L211-125 April25 2011 httpstddkuugdkjtc1sc2wg2docsn4026pdf

Sato Takayuki K ldquoSoyombo and Pagba (old Mongol scripts)rdquo N2163 L200-055 January 6 2000 httpstddkuugdkjtc1sc2wg2docsn2163doc

Shagdarsuumlruumlng Tseveliin 2001 Study of Mongolian Scripts (Graphic Study or Grammatology) Enl 2nded Bibliotheca Mongolica Monograph 1 Ed by Sharaviin Choimaa Ulaanbaatar Center for MongolStudies National University of Mongolia

Wikimedia 2006 ldquo1000 Tugriks - Rectordquo httpenwikipediaorgwikiFile1000_Tugriks_-_

Rectojpg

mdashmdashmdash 2008 ldquoFlag of Mongoliardquo httpenwikipediaorgwikiFileFlag_of_Mongoliasvg

mdashmdashmdash 2009a ldquoCoat of Arms of Mongoliardquo httpenwikipediaorgwikiFileCoat_of_Arms_

of_Mongoliasvg

mdashmdashmdash 2009b ldquoGreen Tara poem typed in Soyombo scriptrdquo httpenwikipediaorgwikiFile

Green_Tara_Poem_Soyombojpg

8 Acknowledgments

I would like to thank Gyoumlrgy Kara (Indiana University Bloomington) for reviewing a draft of this proposaland for providing additional information about Soyombo

This project was made possible in part by a grant from the United States National Endowment for the Hu-manities which funded the Universal Scripts Project (part of the Script Encoding Initiative at the Universityof California Berkeley) Any views findings conclusions or recommendations expressed in this publicationdo not necessarily reflect those of the National Endowment for the Humanities

12

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 9: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

7176471765 rarr 7176471765 71721 71764 71765

This could be handled by creating glyphs for subjoined letters that contain a segment of the frame stem eg 71765 for 71765 etc

6 Character Data

61 Character Properties

11800SOYOMBO LETTER ALo0LN11801SOYOMBO VOWEL SIGN IMn0NSMN11802SOYOMBO VOWEL SIGN UMn0NSMN11803SOYOMBO VOWEL SIGN UEMn0NSMN11804SOYOMBO VOWEL SIGN EMn0NSMN11805SOYOMBO VOWEL SIGN OMn0NSMN11806SOYOMBO VOWEL SIGN OEMn0NSMN11807SOYOMBO VOWEL SIGN AUMn0NSMN11808SOYOMBO VOWEL SIGN AIMn0NSMN11809SOYOMBO VOWEL LENGTH MARKMn0NSMN1180ASOYOMBO LETTER GALo0LN1180BSOYOMBO LETTER KALo0LN1180CSOYOMBO LETTER NGALo0LN1180DSOYOMBO LETTER JALo0LN1180ESOYOMBO LETTER CALo0LN1180FSOYOMBO LETTER NYALo0LN11810SOYOMBO LETTER DALo0LN11811SOYOMBO LETTER TALo0LN11812SOYOMBO LETTER NALo0LN11813SOYOMBO LETTER BALo0LN11814SOYOMBO LETTER PALo0LN11815SOYOMBO LETTER MALo0LN11816SOYOMBO LETTER YALo0LN11817SOYOMBO LETTER RALo0LN11818SOYOMBO LETTER VALo0LN11819SOYOMBO LETTER LALo0LN1181ASOYOMBO LETTER SHALo0LN1181BSOYOMBO LETTER SALo0LN1181CSOYOMBO LETTER HALo0LN1181DSOYOMBO CONSONANT SIGN GMn0NSMN1181ESOYOMBO CONSONANT SIGN KMn0NSMN1181FSOYOMBO CONSONANT SIGN NGMn0NSMN11820SOYOMBO CONSONANT SIGN DMn0NSMN11821SOYOMBO CONSONANT SIGN NMn0NSMN11822SOYOMBO CONSONANT SIGN BMn0NSMN11823SOYOMBO CONSONANT SIGN MMn0NSMN11824SOYOMBO CONSONANT SIGN RMn0NSMN11825SOYOMBO CONSONANT SIGN LMn0NSMN11826SOYOMBO CONSONANT SIGN SHMn0NSMN11827SOYOMBO CONSONANT SIGN SMn0NSMN11828SOYOMBO CONSONANT SIGN ANGMn0NSMN11829SOYOMBO LETTER GALIG GALo0LN1182ASOYOMBO LETTER GALIG GHALo0LN1182BSOYOMBO LETTER GALIG JALo0LN1182CSOYOMBO LETTER GALIG JHALo0LN1182DSOYOMBO LETTER GALIG TTALo0LN1182ESOYOMBO LETTER GALIG TTHALo0LN1182FSOYOMBO LETTER GALIG DDALo0LN

9

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

11830SOYOMBO LETTER GALIG DDHALo0LN11831SOYOMBO LETTER GALIG NNALo0LN11832SOYOMBO LETTER GALIG DALo0LN11833SOYOMBO LETTER GALIG DHALo0LN11834SOYOMBO LETTER GALIG BALo0LN11835SOYOMBO LETTER GALIG BHALo0LN11836SOYOMBO LETTER GALIG SSALo0LN11837SOYOMBO LETTER GALIG TSALo0LN11838SOYOMBO LETTER GALIG TSHALo0LN11839SOYOMBO LETTER GALIG DZALo0LN1183ASOYOMBO LETTER GALIG ZHALo0LN1183BSOYOMBO LETTER GALIG ZALo0LN1183CSOYOMBO LETTER GALIG SMALL ALo0LN1183DSOYOMBO LETTER KSSALo0LN1183ESOYOMBO VOWEL SIGN VOCALIC RMn0NSMN1183FSOYOMBO VOWEL SIGN VOCALIC LMn0NSMN11840SOYOMBO SIGN ANUSVARAMn0NSMN11841SOYOMBO SIGN VISARGAMc0LN11842SOYOMBO GEMINATION SIGNMn0NSMN11843SOYOMBO PREFIXED LETTER LAMn0NSMN11844SOYOMBO PREFIXED LETTER SHAMn0NSMN11845SOYOMBO PREFIXED LETTER SAMn0NSMN11846SOYOMBO PREFIXED LETTER RAMn0NSMN11847SOYOMBO SUBJOINED LETTER GAMn0NSMN11848SOYOMBO SUBJOINED LETTER KAMn0NSMN11849SOYOMBO SUBJOINED LETTER NGAMn0NSMN1184ASOYOMBO SUBJOINED LETTER JAMn0NSMN1184BSOYOMBO SUBJOINED LETTER CAMn0NSMN1184CSOYOMBO SUBJOINED LETTER NYAMn0NSMN1184DSOYOMBO SUBJOINED LETTER DAMn0NSMN1184ESOYOMBO SUBJOINED LETTER TAMn0NSMN1184FSOYOMBO SUBJOINED LETTER NAMn0NSMN11850SOYOMBO SUBJOINED LETTER BAMn0NSMN11851SOYOMBO SUBJOINED LETTER PAMn0NSMN11852SOYOMBO SUBJOINED LETTER MAMn0NSMN11853SOYOMBO SUBJOINED LETTER YAMn0NSMN11854SOYOMBO SUBJOINED LETTER RAMn0NSMN11855SOYOMBO SUBJOINED LETTER VAMn0NSMN11856SOYOMBO SUBJOINED LETTER LAMn0NSMN11857SOYOMBO SUBJOINED LETTER SHAMn0NSMN11858SOYOMBO SUBJOINED LETTER SAMn0NSMN11859SOYOMBO SUBJOINED LETTER HAMn0NSMN1185ASOYOMBO SUBJOINED LETTER GALIG GAMn0NSMN1185BSOYOMBO SUBJOINED LETTER GALIG GHAMn0NSMN1185CSOYOMBO SUBJOINED LETTER GALIG JAMn0NSMN1185DSOYOMBO SUBJOINED LETTER GALIG JHAMn0NSMN1185ESOYOMBO SUBJOINED LETTER GALIG TTAMn0NSMN1185FSOYOMBO SUBJOINED LETTER GALIG TTHAMn0NSMN11860SOYOMBO SUBJOINED LETTER GALIG DDAMn0NSMN11861SOYOMBO SUBJOINED LETTER GALIG DDHAMn0NSMN11862SOYOMBO SUBJOINED LETTER GALIG NNAMn0NSMN11863SOYOMBO SUBJOINED LETTER GALIG DAMn0NSMN11864SOYOMBO SUBJOINED LETTER GALIG DHAMn0NSMN11865SOYOMBO SUBJOINED LETTER GALIG BAMn0NSMN11866SOYOMBO SUBJOINED LETTER GALIG BHAMn0NSMN11867SOYOMBO SUBJOINED LETTER GALIG SSAMn0NSMN11868SOYOMBO SUBJOINED LETTER GALIG TSAMn0NSMN11869SOYOMBO SUBJOINED LETTER GALIG TSHAMn0NSMN1186ASOYOMBO SUBJOINED LETTER GALIG DZAMn0NSMN1186BSOYOMBO SUBJOINED LETTER GALIG ZHAMn0NSMN1186CSOYOMBO SUBJOINED LETTER GALIG ZAMn0NSMN

10

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

1186DSOYOMBO SUBJOINED LETTER GALIG SMALL AMn0NSMN1186ESOYOMBO DANDAPo0LN1186FSOYOMBO DOUBLE DANDAPo0LN11870SOYOMBO TSHEGPo0LN11871SOYOMBO HEAD MARKPo0ONN11872SOYOMBO TERMINAL MARKPo0ONN11873SOYOMBO SYMBOL SVAYAMBHUSo0LN

62 Linebreaking Properties

Linebreaking properties given in the data format of LineBreaktxt

11800 AL LETTER A1180111809 CM VOWEL SIGN I VOWEL LENGTH MARK1180A1181C AL GA HA1181D11828 CM CONSONANT SIGN G CONSONANT SIGN ANG118291183D AL GALIG GA KSSA1183E11842 CM SIGN VOCALIC R GEMINATION SIGN118431186D CM PREFIXED LETTER LA SUBJOINED LETTER GALIG SMALL A1186E1186F BA DANDA DOUBLE DANDA11870 BA TSHEG11871 BB HEAD MARK11872 BA TERMINAL MARK11873 AL SYMBOL SVAYAMBHU

7 References

Atwood Christopher P 2004 Encyclopedia of Mongolia and the Mongol Empire New York Facts on File

Boldsaikhan B B Batsana and Ts Oyuntsetseg 2005 Соёмбо Нууц ба Синергетик [= Secret of

Soyombo and Synergetic] (Түвэд монгол бичгийн эхийг орчуулан хавсаргав) 71793717077176571702717447173371682 71680716817168971713 7169071681716907168371694 716997168071681716987168471716716917168471697716817171071794 Ed by T Bulgan Ulaanbaatar Shambala Association System Science Research InstituteMongolian University of Science and Technology

Corff Oliver ldquoSoyombo for LATEXrdquo v 09 September 2 1996 httpuserpagefu-berlinde

~corffimSoyombooverviewSoyombohtml

ldquoHistoire du livrerdquo 2010 ldquoHistoire du livre et histoire de lrsquoeacutecriture lrsquoeacutecriture entre pratique symboleet eacuteconomierdquo (26 August 2010) httphistoire-du-livreblogspotcom201008lecriture-entre-le-symbole-et-lahtml

Kapaj Luigi 2002 ldquoMongol Scriptsrdquo httpwwwviahistoriacomSilverHordemainhtmlresearchMongolScriptshtml

Mongolwiki 2008 ldquoСайн чанарын зураг байхгүй байнаrdquo httpwikiecm-outsourcingcomindexphptitle=ДүрсSoyombo_usgiin_bargif

Mongolia and Japan 1988 ldquoAddition of Soyombo Scriptrdquo N1855 L298-358 September 21 1998 httpstddkuugdkjtc1sc2wg2docsn1855pdf

Pandey Anshuman 2010a ldquoPreliminary Proposal to Encode the Xawtaa Dorboljin Script in ISOIEC10646rdquo N3956 L210-411 October 23 2010 httpstddkuugdkjtc1sc2wg2docsn3956pdf

11

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

mdashmdashmdash 2010b ldquoPreliminary Proposal to Encode the Soyombo Script in ISOIEC 10646rdquo N3949 L210-399 October 30 2010 httpstddkuugdkjtc1sc2wg2docsn3949pdf

mdashmdashmdash 2011a ldquoDetermining the Encoding Model for Soyombo Vowelsrdquo N3986 L211-054 February 52011 httpstddkuugdkjtc1sc2wg2docsn3986pdf

mdashmdashmdash 2011b ldquoRevised Preliminary Proposal to Encode Soyombo in the UCSrdquo N4026 L211-125 April25 2011 httpstddkuugdkjtc1sc2wg2docsn4026pdf

Sato Takayuki K ldquoSoyombo and Pagba (old Mongol scripts)rdquo N2163 L200-055 January 6 2000 httpstddkuugdkjtc1sc2wg2docsn2163doc

Shagdarsuumlruumlng Tseveliin 2001 Study of Mongolian Scripts (Graphic Study or Grammatology) Enl 2nded Bibliotheca Mongolica Monograph 1 Ed by Sharaviin Choimaa Ulaanbaatar Center for MongolStudies National University of Mongolia

Wikimedia 2006 ldquo1000 Tugriks - Rectordquo httpenwikipediaorgwikiFile1000_Tugriks_-_

Rectojpg

mdashmdashmdash 2008 ldquoFlag of Mongoliardquo httpenwikipediaorgwikiFileFlag_of_Mongoliasvg

mdashmdashmdash 2009a ldquoCoat of Arms of Mongoliardquo httpenwikipediaorgwikiFileCoat_of_Arms_

of_Mongoliasvg

mdashmdashmdash 2009b ldquoGreen Tara poem typed in Soyombo scriptrdquo httpenwikipediaorgwikiFile

Green_Tara_Poem_Soyombojpg

8 Acknowledgments

I would like to thank Gyoumlrgy Kara (Indiana University Bloomington) for reviewing a draft of this proposaland for providing additional information about Soyombo

This project was made possible in part by a grant from the United States National Endowment for the Hu-manities which funded the Universal Scripts Project (part of the Script Encoding Initiative at the Universityof California Berkeley) Any views findings conclusions or recommendations expressed in this publicationdo not necessarily reflect those of the National Endowment for the Humanities

12

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 10: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

11830SOYOMBO LETTER GALIG DDHALo0LN11831SOYOMBO LETTER GALIG NNALo0LN11832SOYOMBO LETTER GALIG DALo0LN11833SOYOMBO LETTER GALIG DHALo0LN11834SOYOMBO LETTER GALIG BALo0LN11835SOYOMBO LETTER GALIG BHALo0LN11836SOYOMBO LETTER GALIG SSALo0LN11837SOYOMBO LETTER GALIG TSALo0LN11838SOYOMBO LETTER GALIG TSHALo0LN11839SOYOMBO LETTER GALIG DZALo0LN1183ASOYOMBO LETTER GALIG ZHALo0LN1183BSOYOMBO LETTER GALIG ZALo0LN1183CSOYOMBO LETTER GALIG SMALL ALo0LN1183DSOYOMBO LETTER KSSALo0LN1183ESOYOMBO VOWEL SIGN VOCALIC RMn0NSMN1183FSOYOMBO VOWEL SIGN VOCALIC LMn0NSMN11840SOYOMBO SIGN ANUSVARAMn0NSMN11841SOYOMBO SIGN VISARGAMc0LN11842SOYOMBO GEMINATION SIGNMn0NSMN11843SOYOMBO PREFIXED LETTER LAMn0NSMN11844SOYOMBO PREFIXED LETTER SHAMn0NSMN11845SOYOMBO PREFIXED LETTER SAMn0NSMN11846SOYOMBO PREFIXED LETTER RAMn0NSMN11847SOYOMBO SUBJOINED LETTER GAMn0NSMN11848SOYOMBO SUBJOINED LETTER KAMn0NSMN11849SOYOMBO SUBJOINED LETTER NGAMn0NSMN1184ASOYOMBO SUBJOINED LETTER JAMn0NSMN1184BSOYOMBO SUBJOINED LETTER CAMn0NSMN1184CSOYOMBO SUBJOINED LETTER NYAMn0NSMN1184DSOYOMBO SUBJOINED LETTER DAMn0NSMN1184ESOYOMBO SUBJOINED LETTER TAMn0NSMN1184FSOYOMBO SUBJOINED LETTER NAMn0NSMN11850SOYOMBO SUBJOINED LETTER BAMn0NSMN11851SOYOMBO SUBJOINED LETTER PAMn0NSMN11852SOYOMBO SUBJOINED LETTER MAMn0NSMN11853SOYOMBO SUBJOINED LETTER YAMn0NSMN11854SOYOMBO SUBJOINED LETTER RAMn0NSMN11855SOYOMBO SUBJOINED LETTER VAMn0NSMN11856SOYOMBO SUBJOINED LETTER LAMn0NSMN11857SOYOMBO SUBJOINED LETTER SHAMn0NSMN11858SOYOMBO SUBJOINED LETTER SAMn0NSMN11859SOYOMBO SUBJOINED LETTER HAMn0NSMN1185ASOYOMBO SUBJOINED LETTER GALIG GAMn0NSMN1185BSOYOMBO SUBJOINED LETTER GALIG GHAMn0NSMN1185CSOYOMBO SUBJOINED LETTER GALIG JAMn0NSMN1185DSOYOMBO SUBJOINED LETTER GALIG JHAMn0NSMN1185ESOYOMBO SUBJOINED LETTER GALIG TTAMn0NSMN1185FSOYOMBO SUBJOINED LETTER GALIG TTHAMn0NSMN11860SOYOMBO SUBJOINED LETTER GALIG DDAMn0NSMN11861SOYOMBO SUBJOINED LETTER GALIG DDHAMn0NSMN11862SOYOMBO SUBJOINED LETTER GALIG NNAMn0NSMN11863SOYOMBO SUBJOINED LETTER GALIG DAMn0NSMN11864SOYOMBO SUBJOINED LETTER GALIG DHAMn0NSMN11865SOYOMBO SUBJOINED LETTER GALIG BAMn0NSMN11866SOYOMBO SUBJOINED LETTER GALIG BHAMn0NSMN11867SOYOMBO SUBJOINED LETTER GALIG SSAMn0NSMN11868SOYOMBO SUBJOINED LETTER GALIG TSAMn0NSMN11869SOYOMBO SUBJOINED LETTER GALIG TSHAMn0NSMN1186ASOYOMBO SUBJOINED LETTER GALIG DZAMn0NSMN1186BSOYOMBO SUBJOINED LETTER GALIG ZHAMn0NSMN1186CSOYOMBO SUBJOINED LETTER GALIG ZAMn0NSMN

10

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

1186DSOYOMBO SUBJOINED LETTER GALIG SMALL AMn0NSMN1186ESOYOMBO DANDAPo0LN1186FSOYOMBO DOUBLE DANDAPo0LN11870SOYOMBO TSHEGPo0LN11871SOYOMBO HEAD MARKPo0ONN11872SOYOMBO TERMINAL MARKPo0ONN11873SOYOMBO SYMBOL SVAYAMBHUSo0LN

62 Linebreaking Properties

Linebreaking properties given in the data format of LineBreaktxt

11800 AL LETTER A1180111809 CM VOWEL SIGN I VOWEL LENGTH MARK1180A1181C AL GA HA1181D11828 CM CONSONANT SIGN G CONSONANT SIGN ANG118291183D AL GALIG GA KSSA1183E11842 CM SIGN VOCALIC R GEMINATION SIGN118431186D CM PREFIXED LETTER LA SUBJOINED LETTER GALIG SMALL A1186E1186F BA DANDA DOUBLE DANDA11870 BA TSHEG11871 BB HEAD MARK11872 BA TERMINAL MARK11873 AL SYMBOL SVAYAMBHU

7 References

Atwood Christopher P 2004 Encyclopedia of Mongolia and the Mongol Empire New York Facts on File

Boldsaikhan B B Batsana and Ts Oyuntsetseg 2005 Соёмбо Нууц ба Синергетик [= Secret of

Soyombo and Synergetic] (Түвэд монгол бичгийн эхийг орчуулан хавсаргав) 71793717077176571702717447173371682 71680716817168971713 7169071681716907168371694 716997168071681716987168471716716917168471697716817171071794 Ed by T Bulgan Ulaanbaatar Shambala Association System Science Research InstituteMongolian University of Science and Technology

Corff Oliver ldquoSoyombo for LATEXrdquo v 09 September 2 1996 httpuserpagefu-berlinde

~corffimSoyombooverviewSoyombohtml

ldquoHistoire du livrerdquo 2010 ldquoHistoire du livre et histoire de lrsquoeacutecriture lrsquoeacutecriture entre pratique symboleet eacuteconomierdquo (26 August 2010) httphistoire-du-livreblogspotcom201008lecriture-entre-le-symbole-et-lahtml

Kapaj Luigi 2002 ldquoMongol Scriptsrdquo httpwwwviahistoriacomSilverHordemainhtmlresearchMongolScriptshtml

Mongolwiki 2008 ldquoСайн чанарын зураг байхгүй байнаrdquo httpwikiecm-outsourcingcomindexphptitle=ДүрсSoyombo_usgiin_bargif

Mongolia and Japan 1988 ldquoAddition of Soyombo Scriptrdquo N1855 L298-358 September 21 1998 httpstddkuugdkjtc1sc2wg2docsn1855pdf

Pandey Anshuman 2010a ldquoPreliminary Proposal to Encode the Xawtaa Dorboljin Script in ISOIEC10646rdquo N3956 L210-411 October 23 2010 httpstddkuugdkjtc1sc2wg2docsn3956pdf

11

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

mdashmdashmdash 2010b ldquoPreliminary Proposal to Encode the Soyombo Script in ISOIEC 10646rdquo N3949 L210-399 October 30 2010 httpstddkuugdkjtc1sc2wg2docsn3949pdf

mdashmdashmdash 2011a ldquoDetermining the Encoding Model for Soyombo Vowelsrdquo N3986 L211-054 February 52011 httpstddkuugdkjtc1sc2wg2docsn3986pdf

mdashmdashmdash 2011b ldquoRevised Preliminary Proposal to Encode Soyombo in the UCSrdquo N4026 L211-125 April25 2011 httpstddkuugdkjtc1sc2wg2docsn4026pdf

Sato Takayuki K ldquoSoyombo and Pagba (old Mongol scripts)rdquo N2163 L200-055 January 6 2000 httpstddkuugdkjtc1sc2wg2docsn2163doc

Shagdarsuumlruumlng Tseveliin 2001 Study of Mongolian Scripts (Graphic Study or Grammatology) Enl 2nded Bibliotheca Mongolica Monograph 1 Ed by Sharaviin Choimaa Ulaanbaatar Center for MongolStudies National University of Mongolia

Wikimedia 2006 ldquo1000 Tugriks - Rectordquo httpenwikipediaorgwikiFile1000_Tugriks_-_

Rectojpg

mdashmdashmdash 2008 ldquoFlag of Mongoliardquo httpenwikipediaorgwikiFileFlag_of_Mongoliasvg

mdashmdashmdash 2009a ldquoCoat of Arms of Mongoliardquo httpenwikipediaorgwikiFileCoat_of_Arms_

of_Mongoliasvg

mdashmdashmdash 2009b ldquoGreen Tara poem typed in Soyombo scriptrdquo httpenwikipediaorgwikiFile

Green_Tara_Poem_Soyombojpg

8 Acknowledgments

I would like to thank Gyoumlrgy Kara (Indiana University Bloomington) for reviewing a draft of this proposaland for providing additional information about Soyombo

This project was made possible in part by a grant from the United States National Endowment for the Hu-manities which funded the Universal Scripts Project (part of the Script Encoding Initiative at the Universityof California Berkeley) Any views findings conclusions or recommendations expressed in this publicationdo not necessarily reflect those of the National Endowment for the Humanities

12

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 11: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

1186DSOYOMBO SUBJOINED LETTER GALIG SMALL AMn0NSMN1186ESOYOMBO DANDAPo0LN1186FSOYOMBO DOUBLE DANDAPo0LN11870SOYOMBO TSHEGPo0LN11871SOYOMBO HEAD MARKPo0ONN11872SOYOMBO TERMINAL MARKPo0ONN11873SOYOMBO SYMBOL SVAYAMBHUSo0LN

62 Linebreaking Properties

Linebreaking properties given in the data format of LineBreaktxt

11800 AL LETTER A1180111809 CM VOWEL SIGN I VOWEL LENGTH MARK1180A1181C AL GA HA1181D11828 CM CONSONANT SIGN G CONSONANT SIGN ANG118291183D AL GALIG GA KSSA1183E11842 CM SIGN VOCALIC R GEMINATION SIGN118431186D CM PREFIXED LETTER LA SUBJOINED LETTER GALIG SMALL A1186E1186F BA DANDA DOUBLE DANDA11870 BA TSHEG11871 BB HEAD MARK11872 BA TERMINAL MARK11873 AL SYMBOL SVAYAMBHU

7 References

Atwood Christopher P 2004 Encyclopedia of Mongolia and the Mongol Empire New York Facts on File

Boldsaikhan B B Batsana and Ts Oyuntsetseg 2005 Соёмбо Нууц ба Синергетик [= Secret of

Soyombo and Synergetic] (Түвэд монгол бичгийн эхийг орчуулан хавсаргав) 71793717077176571702717447173371682 71680716817168971713 7169071681716907168371694 716997168071681716987168471716716917168471697716817171071794 Ed by T Bulgan Ulaanbaatar Shambala Association System Science Research InstituteMongolian University of Science and Technology

Corff Oliver ldquoSoyombo for LATEXrdquo v 09 September 2 1996 httpuserpagefu-berlinde

~corffimSoyombooverviewSoyombohtml

ldquoHistoire du livrerdquo 2010 ldquoHistoire du livre et histoire de lrsquoeacutecriture lrsquoeacutecriture entre pratique symboleet eacuteconomierdquo (26 August 2010) httphistoire-du-livreblogspotcom201008lecriture-entre-le-symbole-et-lahtml

Kapaj Luigi 2002 ldquoMongol Scriptsrdquo httpwwwviahistoriacomSilverHordemainhtmlresearchMongolScriptshtml

Mongolwiki 2008 ldquoСайн чанарын зураг байхгүй байнаrdquo httpwikiecm-outsourcingcomindexphptitle=ДүрсSoyombo_usgiin_bargif

Mongolia and Japan 1988 ldquoAddition of Soyombo Scriptrdquo N1855 L298-358 September 21 1998 httpstddkuugdkjtc1sc2wg2docsn1855pdf

Pandey Anshuman 2010a ldquoPreliminary Proposal to Encode the Xawtaa Dorboljin Script in ISOIEC10646rdquo N3956 L210-411 October 23 2010 httpstddkuugdkjtc1sc2wg2docsn3956pdf

11

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

mdashmdashmdash 2010b ldquoPreliminary Proposal to Encode the Soyombo Script in ISOIEC 10646rdquo N3949 L210-399 October 30 2010 httpstddkuugdkjtc1sc2wg2docsn3949pdf

mdashmdashmdash 2011a ldquoDetermining the Encoding Model for Soyombo Vowelsrdquo N3986 L211-054 February 52011 httpstddkuugdkjtc1sc2wg2docsn3986pdf

mdashmdashmdash 2011b ldquoRevised Preliminary Proposal to Encode Soyombo in the UCSrdquo N4026 L211-125 April25 2011 httpstddkuugdkjtc1sc2wg2docsn4026pdf

Sato Takayuki K ldquoSoyombo and Pagba (old Mongol scripts)rdquo N2163 L200-055 January 6 2000 httpstddkuugdkjtc1sc2wg2docsn2163doc

Shagdarsuumlruumlng Tseveliin 2001 Study of Mongolian Scripts (Graphic Study or Grammatology) Enl 2nded Bibliotheca Mongolica Monograph 1 Ed by Sharaviin Choimaa Ulaanbaatar Center for MongolStudies National University of Mongolia

Wikimedia 2006 ldquo1000 Tugriks - Rectordquo httpenwikipediaorgwikiFile1000_Tugriks_-_

Rectojpg

mdashmdashmdash 2008 ldquoFlag of Mongoliardquo httpenwikipediaorgwikiFileFlag_of_Mongoliasvg

mdashmdashmdash 2009a ldquoCoat of Arms of Mongoliardquo httpenwikipediaorgwikiFileCoat_of_Arms_

of_Mongoliasvg

mdashmdashmdash 2009b ldquoGreen Tara poem typed in Soyombo scriptrdquo httpenwikipediaorgwikiFile

Green_Tara_Poem_Soyombojpg

8 Acknowledgments

I would like to thank Gyoumlrgy Kara (Indiana University Bloomington) for reviewing a draft of this proposaland for providing additional information about Soyombo

This project was made possible in part by a grant from the United States National Endowment for the Hu-manities which funded the Universal Scripts Project (part of the Script Encoding Initiative at the Universityof California Berkeley) Any views findings conclusions or recommendations expressed in this publicationdo not necessarily reflect those of the National Endowment for the Humanities

12

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 12: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

mdashmdashmdash 2010b ldquoPreliminary Proposal to Encode the Soyombo Script in ISOIEC 10646rdquo N3949 L210-399 October 30 2010 httpstddkuugdkjtc1sc2wg2docsn3949pdf

mdashmdashmdash 2011a ldquoDetermining the Encoding Model for Soyombo Vowelsrdquo N3986 L211-054 February 52011 httpstddkuugdkjtc1sc2wg2docsn3986pdf

mdashmdashmdash 2011b ldquoRevised Preliminary Proposal to Encode Soyombo in the UCSrdquo N4026 L211-125 April25 2011 httpstddkuugdkjtc1sc2wg2docsn4026pdf

Sato Takayuki K ldquoSoyombo and Pagba (old Mongol scripts)rdquo N2163 L200-055 January 6 2000 httpstddkuugdkjtc1sc2wg2docsn2163doc

Shagdarsuumlruumlng Tseveliin 2001 Study of Mongolian Scripts (Graphic Study or Grammatology) Enl 2nded Bibliotheca Mongolica Monograph 1 Ed by Sharaviin Choimaa Ulaanbaatar Center for MongolStudies National University of Mongolia

Wikimedia 2006 ldquo1000 Tugriks - Rectordquo httpenwikipediaorgwikiFile1000_Tugriks_-_

Rectojpg

mdashmdashmdash 2008 ldquoFlag of Mongoliardquo httpenwikipediaorgwikiFileFlag_of_Mongoliasvg

mdashmdashmdash 2009a ldquoCoat of Arms of Mongoliardquo httpenwikipediaorgwikiFileCoat_of_Arms_

of_Mongoliasvg

mdashmdashmdash 2009b ldquoGreen Tara poem typed in Soyombo scriptrdquo httpenwikipediaorgwikiFile

Green_Tara_Poem_Soyombojpg

8 Acknowledgments

I would like to thank Gyoumlrgy Kara (Indiana University Bloomington) for reviewing a draft of this proposaland for providing additional information about Soyombo

This project was made possible in part by a grant from the United States National Endowment for the Hu-manities which funded the Universal Scripts Project (part of the Script Encoding Initiative at the Universityof California Berkeley) Any views findings conclusions or recommendations expressed in this publicationdo not necessarily reflect those of the National Endowment for the Humanities

12

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 13: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 1

1187FSoyombo11800

1180 1181 1182 1183 1184 1185 1186 1187

71680$71681$71682$71683$71684$71685$71686$71687$71688$71689716907169171692716937169471695

71696716977169871699717007170171702717037170471705717067170771708$71709$71710$71711

$71712$71713$71714$71715$71716$71717$71718$71719$7172071721717227172371724717257172671727

7172871729717307173171732717337173471735717367173771738717397174071741$71742$71743

$a$71745$71746$ 71747$ 71748$ 71749$ 71750$71751$71752$71753$71754$71755$71756$71757$71758$71759

$71760$71761$71762$71763$71764$71765$71766$71767$71768$71769$71770$71771$71772$71773$71774$71775

$71776$71777$71778$71779$71780$71781$71782$71783$71784$71785$71786$71787$71788$717897179071791

71792717937179471795

11800

11801

11802

11803

11804

11805

11806

11807

11808

11809

1180A

1180B

1180C

1180D

1180E

1180F

11810

11811

11812

11813

11814

11815

11816

11817

11818

11819

1181A

1181B

1181C

1181D

1181E

1181F

11820

11821

11822

11823

11824

11825

11826

11827

11828

11829

1182A

1182B

1182C

1182D

1182E

1182F

11830

11831

11832

11833

11834

11835

11836

11837

11838

11839

1183A

1183B

1183C

1183D

1183E

1183F

11840

11841

11842

11843

11844

11845

11846

11847

11848

11849

1184A

1184B

1184C

1184D

1184E

1184F

11850

11851

11852

11853

11854

11855

11856

11857

11858

11859

1185A

1185B

1185C

1185D

1185E

1185F

11860

11861

11862

11863

11864

11865

11866

11867

11868

11869

1186A

1186B

1186C

1186D

1186E

1186F

11870

11871

11872

11873

0

1

2

3

4

5

6

7

8

9

A

B

C

D

E

F

Figure 1 Proposed code chart for Soyombo

13

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 14: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 2

11866Soyombo11800

11830 71728 SOYOMBO LETTER GALIG DDHA11831 71729 SOYOMBO LETTER GALIG NNA11832 71730 SOYOMBO LETTER GALIG DA11833 71731 SOYOMBO LETTER GALIG DHA11834 71732 SOYOMBO LETTER GALIG BA11835 71733 SOYOMBO LETTER GALIG BHA11836 71734 SOYOMBO LETTER GALIG SSA

Additions for Tibetan11837 71735 SOYOMBO LETTER GALIG TSA11838 71736 SOYOMBO LETTER GALIG TSHA11839 71737 SOYOMBO LETTER GALIG DZA1183A 71738 SOYOMBO LETTER GALIG ZHA1183B 71739 SOYOMBO LETTER GALIG ZA1183C 71740 SOYOMBO LETTER GALIG SMALL A

Consonant conjunct1183D 71741 SOYOMBO LETTER KSSA

Vowel signs for Sanskrit1183E $71742 SOYOMBO SIGN VOCALIC R1183F $71743 SOYOMBO SIGN VOCALIC L

Signs for Sanskrit11840 $a SOYOMBO SIGN ANUSVARA11841 $71745 SOYOMBO SIGN VISARGA

Gemination mark11842 $71746 SOYOMBO GEMINATION MARK

Prefixed letters11843 $ 71747 SOYOMBO PREFIXED LETTER LA11844 $ 71748 SOYOMBO PREFIXED LETTER SHA11845 $ 71749 SOYOMBO PREFIXED LETTER SA11846 $ 71750 SOYOMBO PREFIXED LETTER RA

Subjoined letters11847 $71751 SOYOMBO SUBJOINED LETTER GA11848 $71752 SOYOMBO SUBJOINED LETTER KA11849 $71753 SOYOMBO SUBJOINED LETTER NGA1184A $71754 SOYOMBO SUBJOINED LETTER JA1184B $71755 SOYOMBO SUBJOINED LETTER CA1184C $71756 SOYOMBO SUBJOINED LETTER NYA1184D $71757 SOYOMBO SUBJOINED LETTER DA1184E $71758 SOYOMBO SUBJOINED LETTER TA1184F $71759 SOYOMBO SUBJOINED LETTER NA11850 $71760 SOYOMBO SUBJOINED LETTER BA11851 $71761 SOYOMBO SUBJOINED LETTER PA11852 $71762 SOYOMBO SUBJOINED LETTER MA11853 $71763 SOYOMBO SUBJOINED LETTER YA11854 $71764 SOYOMBO SUBJOINED LETTER RA11855 $71765 SOYOMBO SUBJOINED LETTER VA11856 $71766 SOYOMBO SUBJOINED LETTER LA11857 $71767 SOYOMBO SUBJOINED LETTER SHA11858 $71768 SOYOMBO SUBJOINED LETTER SA11859 $71769 SOYOMBO SUBJOINED LETTER HA1185A $71770 SOYOMBO SUBJOINED LETTER GALIG GA1185B $71771 SOYOMBO SUBJOINED LETTER GALIG GHA1185C $71772 SOYOMBO SUBJOINED LETTER GALIG JA1185D $71773 SOYOMBO SUBJOINED LETTER GALIG JHA1185E $71774 SOYOMBO SUBJOINED LETTER GALIG TTA1185F $71775 SOYOMBO SUBJOINED LETTER GALIG TTHA11860 $71776 SOYOMBO SUBJOINED LETTER GALIG DDA11861 $71777 SOYOMBO SUBJOINED LETTER GALIG DDHA11862 $71778 SOYOMBO SUBJOINED LETTER GALIG NNA11863 $71779 SOYOMBO SUBJOINED LETTER GALIG DA11864 $71780 SOYOMBO SUBJOINED LETTER GALIG DHA11865 $71781 SOYOMBO SUBJOINED LETTER GALIG BA11866 $71782 SOYOMBO SUBJOINED LETTER GALIG BHA

Vowel carrier11800 71680 SOYOMBO LETTER A

Vowel signs11801 $71681 SOYOMBO VOWEL SIGN I11802 $71682 SOYOMBO VOWEL SIGN UE11803 $71683 SOYOMBO VOWEL SIGN U11804 $71684 SOYOMBO VOWEL SIGN E11805 $71685 SOYOMBO VOWEL SIGN O11806 $71686 SOYOMBO VOWEL SIGN OE11807 $71687 SOYOMBO VOWEL SIGN AU11808 $71688 SOYOMBO VOWEL SIGN AI

Vowel length mark11809 $71689 SOYOMBO VOWEL LENGTH MARK

Consonants1180A 71690 SOYOMBO LETTER GA

bull Used for Sanskrit ka1180B 71691 SOYOMBO LETTER KA

bull Used for Sanskrit kha1180C 71692 SOYOMBO LETTER NGA1180D 71693 SOYOMBO LETTER JA

bull Used for Sanskrit ca1180E 71694 SOYOMBO LETTER CA

bull Used for Sanskrit cha1180F 71695 SOYOMBO LETTER NYA11810 71696 SOYOMBO LETTER DA

bull Used for Sanskrit ta11811 71697 SOYOMBO LETTER TA

bull Used for Sanskrit tha11812 71698 SOYOMBO LETTER NA11813 71699 SOYOMBO LETTER BA

bull Used for Sanskrit pa11814 71700 SOYOMBO LETTER PA

bull Used for Sanskrit pha11815 71701 SOYOMBO LETTER MA11816 71702 SOYOMBO LETTER YA11817 71703 SOYOMBO LETTER RA11818 71704 SOYOMBO LETTER VA11819 71705 SOYOMBO LETTER LA1181A 71706 SOYOMBO LETTER SHA1181B 71707 SOYOMBO LETTER SA1181C 71708 SOYOMBO LETTER HA

Final consonant signs1181D $71709 SOYOMBO CONSONANT SIGN G1181E $71710 SOYOMBO CONSONANT SIGN K1181F $71711 SOYOMBO CONSONANT SIGN NG11820 $71712 SOYOMBO CONSONANT SIGN D11821 $71713 SOYOMBO CONSONANT SIGN N11822 $71714 SOYOMBO CONSONANT SIGN B11823 $71715 SOYOMBO CONSONANT SIGN M11824 $71716 SOYOMBO CONSONANT SIGN R11825 $71717 SOYOMBO CONSONANT SIGN L11826 $71718 SOYOMBO CONSONANT SIGN SH11827 $71719 SOYOMBO CONSONANT SIGN S11828 $71720 SOYOMBO CONSONANT SIGN ANG

Additional consonants for Sanskrit11829 71721 SOYOMBO LETTER GALIG GA1182A 71722 SOYOMBO LETTER GALIG GHA1182B 71723 SOYOMBO LETTER GALIG JA1182C 71724 SOYOMBO LETTER GALIG JHA1182D 71725 SOYOMBO LETTER GALIG TTA1182E 71726 SOYOMBO LETTER GALIG TTHA1182F 71727 SOYOMBO LETTER GALIG DDA

Figure 2 Proposed names list for Soyombo (continued in Figure 3)14

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 15: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Printed using UniBooktrade(httpwwwunicodeorgunibook)

Date 28-Oct-2011 3

11873Soyombo11867

11867 $71783 SOYOMBO SUBJOINED LETTER GALIG SSA11868 $71784 SOYOMBO SUBJOINED LETTER GALIG TSA11869 $71785 SOYOMBO SUBJOINED LETTER GALIG TSHA1186A $71786 SOYOMBO SUBJOINED LETTER GALIG DZA1186B $71787 SOYOMBO SUBJOINED LETTER GALIG ZHA1186C $71788 SOYOMBO SUBJOINED LETTER GALIG ZA1186D $71789 SOYOMBO SUBJOINED LETTER GALIG SMALL A

Punctuation1186E 71790 SOYOMBO DANDA1186F 71791 SOYOMBO DOUBLE DANDA11870 71792 SOYOMBO TSHEG

rarr 0F0B tibetan mark intersyllabic tsheg

Head marks11871 71793 SOYOMBO HEAD MARK11872 71794 SOYOMBO TERMINAL MARK

Symbol11873 71795 SOYOMBO SYMBOL SVAYAMBHU

bull national symbol of Mongolia

Figure 3 Proposed names list for Soyombo (continued from Figure 2)

15

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 16: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Base(Vd)

(Vab)

(Vbb) (Cf)

(VL)

Figure 4 Structure of a Mongolian syllable in Soyombo (adapted from Corff 1996)

Base

(Vd) (Sv)

(Vab)

(Sn)

(C2 Cn)

(Vbb)

(Vl)

Figure 5 Structure of a Sanskrit syllable in Soyombo

16

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 17: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 6 Photograph of a chart of the Soyombo script (from ldquoHistoire du livrerdquo 2010)

17

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 18: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 7 Characters of the Soyombo script (from Kapaj 2002)

18

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 19: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 8 Characters of the Soyombo script (from Kapaj 2002)

19

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 20: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 9 Traditional chart of Soyombo (from Shagdarsuumlruumlng 2001 152)

20

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 21: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 10 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 133)

Figure 11 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 134)

21

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 22: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 12 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 135)

22

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 23: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 13 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 136)

23

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 24: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 14 Description of Soyombo vowels (from Shagdarsuumlruumlng 2001 137)

24

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 25: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 15 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 138)

25

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 26: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 16 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 139)

26

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 27: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 17 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 140)

27

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 28: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 18 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 141)

28

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 29: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 19 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 142)

29

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 30: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 20 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 143)

30

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 31: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 21 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 144)

31

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 32: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 22 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 145)

32

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 33: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 23 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 146)

33

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 34: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 24 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 147)

Figure 25 Description of Soyombo consonants (from Shagdarsuumlruumlng 2001 148)

34

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 35: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 26 Character elements used in Soyombo characters (from Shagdarsuumlruumlng 2001 153)

35

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 36: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 27 Method of writing Soyombo vertically (from Shagdarsuumlruumlng 2001 150) The specimenat top is a seal containing vertical text in which the syllables of words are written as conjuncts

36

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 37: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 28 A folio from a Soyombo manuscript (from Mongolwiki 2008)

Figure 29 A manuscript containing text in Soyombo and Mongolian Square (from Shagdarsuumlruumlng2001 173)

37

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 38: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 30 Poem for the Green Tara typeset in a digitized Soyombo font (from Wikimedia 2009b)

38

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 39: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 31 Xylograph (block print) of a book cover in Soyombo Mongolian Square Mongolianand Cyrillic (from Boldsaikhan 2005 330) The title is Sanskrit written in Soyombo Mongal-svayambhu-jyoti-varna-lipih TheMongolian Square represents Tibetan the Mongolian representsMongolian and the Cyrillic represents Modern (Khalkha) Mongolian

39

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 40: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 32 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

Figure 33 Folios of a Soyombo manuscript (from Boldsaikhan 2005 378)

40

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 41: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

Proposal to Encode the Soyombo Script in ISOIEC 10646 Anshuman Pandey

Figure 34 The flag of Mongolia showing the symbol (from Wikimedia 2008)

Figure 35 The coat of arms ofMongolia showing the symbol (fromWikimedia 2009a)

Figure 36 An imprint of the symbol in the center on the recto face of a 1000 Mongo-lian toumlgroumlg (tugrik) note (from Wikimedia 2006)

41

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 42: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

ISOIEC JTC 1SC 2WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS

FOR ADDITIONS TO THE REPERTOIRE OF ISOIEC 106461

Please fill all the sections A B and C below Please read Principles and Procedures Document (P amp P) from httpwwwdkuugdkJTC1SC2WG2docsprincipleshtml for

guidelines and details before filling this form Please ensure you are using the latest Form from httpwwwdkuugdkJTC1SC2WG2docssummaryformhtml

See also httpwwwdkuugdkJTC1SC2WG2docsroadmapshtml for latest Roadmaps A Administrative 1 Title Proposal to Encode the Soyombo Script in ISOIEC 10646 2 Requesters name Script Encoding Initiative (SEI) Anshuman Pandey (pandeyumichedu) 3 Requester type (Member bodyLiaisonIndividual contribution) Liaison contribution 4 Submission date 2011-10-25 5 Requesters reference (if applicable) 6 Choose one of the following This is a complete proposal Yes (or) More information will be provided later B Technical ndash General 1 Choose one of the following a This proposal is for a new script (set of characters) Yes Proposed name of script Soyombo b The proposal is for addition of character(s) to an existing block Name of the existing block 2 Number of characters in proposal 113 3 Proposed category (select one from below - see section 22 of PampP document) A-Contemporary B1-Specialized (small collection) B2-Specialized (large collection) X C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4 Is a repertoire including character names provided Yes a If YES are the names in accordance with the ldquocharacter naming guidelinesrdquo in Annex L of PampP document Yes b Are the character shapes attached in a legible form suitable for review Yes 5 Fonts related a Who will provide the appropriate computerized font to the Project Editor of 10646 for publishing the

standard

Anshuman Pandey b Identify the party granting a license for use of the font by the editors (include address e-mail ftp-site etc) Anshuman Pandey (pandeyumichedu) 6 References a Are references (to other character sets dictionaries descriptive texts etc) provided Yes b Are published examples of use (such as samples from newspapers magazines or other sources) of proposed characters attached Yes 7 Special encoding issues Does the proposal address other aspects of character data processing (if applicable) such as input presentation sorting searching indexing transliteration etc (if yes please enclose information) Yes 8 Additional Information Submitters are invited to provide any additional information about Properties of the proposed Character(s) or Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script Examples of such properties are Casing information Numeric information Currency information Display behaviour information such as line breaks widths etc Combining behaviour Spacing behaviour Directional behaviour Default Collation behaviour relevance in Mark Up contexts Compatibility equivalence and other Unicode normalization related information See the Unicode standard at httpwwwunicodeorg for such information on other scripts Also see Unicode Character Database ( httpwwwunicodeorgreportstr44 ) and associated Unicode Technical Reports for information needed for consideration by the Unicode Technical Committee for inclusion in the Unicode Standard

1 Form number N3902-F (Original 1994-10-14 Revised 1995-01 1995-04 1996-04 1996-08 1999-03 2001-05 2001-09 2003-11 2005-01 2005-09 2005-10 2007-03 2008-05 2009-11 2011-03)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference

Page 43: ᠰᠣᠶᠣᠮᠪᠣᠪᠢᠴᠢᠭ᠌ - DKUUG standardizingstd.dkuug.dk/JTC1/sc2/WG2/docs/n4142.pdf · N2163L2/00-055).ThescriptwasallocatedtotheRoadmaptotheSupplementaryMultilingualPlane(SMP)

C Technical - Justification 1 Has this proposal for addition of character(s) been submitted before No If YES explain 2 Has contact been made to members of the user community (for example National Body user groups of the script or characters other experts etc) Yes If YES with whom Gyoumlrgy Kara ltgkaraindianaedugt If YES available relevant documents See text of proposal 3 Information on the user community for the proposed characters (for example size demographics information technology use or publishing use) is included Yes Reference Size of user community is unknown Script is used in print and digital publications 4 The context of use for the proposed characters (type of use common or rare) Common Reference The script is used for writing Mongolian Sanskrit and Tibetan 5 Are the proposed characters in current use by the user community Yes If YES where Reference Mongolia See text of proposal for details 6 After giving due considerations to the principles in the PampP document must the proposed characters be entirely in the BMP No If YES is a rationale provided If YES reference 7 Should the proposed characters be kept together in a contiguous range (rather than being scattered) Yes 8 Can any of the proposed characters be considered a presentation form of an existing character or character sequence No If YES is a rationale for its inclusion provided If YES reference 9 Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed characters No If YES is a rationale for its inclusion provided If YES reference 10 Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character No If YES is a rationale for its inclusion provided If YES reference 11 Does the proposal include use of combining characters andor use of composite sequences Yes If YES is a rationale for such use provided Yes If YES reference Combining vowel and final-consonant signs Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided If YES reference 12 Does the proposal contain characters with any special properties such as control function or similar semantics No If YES describe in detail (include attachment if necessary) 13 Does the proposal contain any Ideographic compatibility characters No If YES are the equivalent corresponding unified ideographic characters identified If YES reference