<?xml version="1.0" encoding="utf-8"?>
<lgr xmlns="urn:ietf:params:xml:ns:lgr-1.0">
  <meta>
    <version comment="Proposed LGR for Devanagari">1</version>
    <date>2017-11-21</date>
    <language>und-Deva</language>
    <scope type="domain">.</scope>
    <unicode-version>6.3.0</unicode-version>
    <description type="text/html"><![CDATA[
    <h1>Label Generation Rules for Devanagari script</h1>
    
    <h2>Overview</h2>
    <p>This file contains Label Generation Rules (LGR) for the Devanagari script as would be appropriate for the 
    Root zone. For more details on this proposal, see &ldquo;Proposal for a Devanagari Script Root Zone Label Generation 
    Rule-Set (LGR)&rdquo; [Proposal]. The format of this file follows [RFC 7940].</p>
    
    <h2>Repertoire</h2>
    <p>The NeoBrahmi Generation Panel (NBGP) proposes 83 unique code-points to be made part of the Devanagari 
    LGR [Proposal] in addition to the two sequences U+0931 U+094D U+092F and U+0931 U+094D U+0939 
    which limit the character U+0931 (DEVANAGARI LETTER RRA) in its own specific 
    context beyond which it does not stand by itself. Accordingly, while U+0931 is not 
    listed by itself, it brings the total of distinct code points to 84.</p>
    
    <p>A number of other sequences have been defined in connection with the definition of variants, 
     bringing the total repertoire entries to 92 (see  &ldquo;Variants&rdquo; below).</p>

     <p>The repertoire includes code points used by languages written in Devanagari that
     fall within [EGIDS] scale 1 to 4. Boro, Braj, Dhundari, Mundari, Kharia  have also been additionally covered. 
     Though listed in EGIDS scale 4, Saraiki is not covered, because the Devanagari 
     script is &ldquo;no longer in use&rdquo; by the Saraiki community. For more details, see Section 5 &ldquo;Repertoire&rdquo; in [Proposal]).</p>
	
    <p>The repertoire is based on [MSR-2], which is a subset of Unicode 6.3 [Unicode 6.3].    </p>

    <h2>Variants</h2>
    <p>According to Section 6 &ldquo;Variants&rdquo;, in [Proposal], this LGR defines variants which are
    &ldquo;Confusing due to deviation from normally perceived character formations by the larger linguistic 
    community&rdquo; These cases are not of mere visual similarity as they involve some deviations from 
    the widely accepted norms of Devanagari Akshar formations. These can cause confusion even to a
    careful observer and are hence being proposed as variants. They fall into two broad categories:</p>
    <ul>
    <li> Vowel/Vowel sign followed by Nukta</li>
    <li> Unique Vowels and Vowel Signs required for Kashmiri</li>
    </ul>
    <p>Variant Disposition: All variants are of type &ldquo;blocked&rdquo;, making labels that 
    differ only by these variants mutually exclusive: whichever label containing either of 
    these variants is chosen earlier would be delegated, while the other one label should be blocked.</p>
    
    <p>In addition to these, cross-script variant analysis of Devanagari has been carried out by the NBGP. Possible cross-script
	variant cases were found with the Gurmukhi script and have been mentioned in  Appendix 1 of the [Proposal].</p>
      
    <h2>Character Classes</h2>
    <p>Devanagari is an alphasyllabary and the heart of the writing system is 
    the <em>akshar</em>. It is this unit, which is instinctively recognized by users of the script.
    The writing system of Devanagari could be summed up as composed of Consonants, Implicit 
    Vowel Killer: Halant, Vowels, Anusvara, Candrabindu, Nukta and a Visarga.</p>
    
    <p>Consonants: Devanagari consonants have an implicit schwa /&#x0259;/ included in them. 
    To make a full  syllable, consonants may be followed by certain code points from one or more 
    of the  other groups (see &ldquo;WLE rules&rdquo; below). See Section &ldquo;3.3.1 The Consonants&rdquo; of the [Proposal].</p>
    
    <p>Halant: All consonants have an implicit vowel sign (schwa) within them.
    A special sign is needed to denote that this implicit vowel is stripped off. 
    This is known as the Halant (U+094D). The Halant thus joins two consonants and 
    creates conjuncts, which can be generally from 2 to 4 consonant combinations. 
    In rare cases it can join up to 5 consonants. However, this LGR will not enforce any limit.
    See section 3.3.2 &ldquo;The Implicit Vowel Killer: Halant&rdquo; in [Proposal].</p>

    <p>Vowels: There are separate code points for vowels that are pronounced 
    independently  at the beginning of a syllable or after a vowel sound. To indicate 
    a Vowel sound following a consonant other than the implicit shwa sound, a vowel sign (Matra) is 
    attached to the consonant. The is an equivalent Matra for each vowel excepting the U+0905.  
    See Section &ldquo;3.3.3 Vowels&rdquo; of the [Proposal] </p>
    
    <p>Anusvara : The Anusvara (showing a nasal at  the end of a syllable)  can follow a vowel,  
    matra, consonant or nukta. See Section &ldquo;3.3.4 The Anusvara&rdquo; of the [Proposal].</p>
    
    <p>Candrabindu : A Candrabindu denotes nasalization of the preceding vowel. Present-day 
    Hindi users tend to replace the Candrabindu by the Anusvara. 
    See Section &ldquo;3.3.5 Nasalization: Candrabindu&rdquo; of the [Proposal].</p>
    
    <p>Nukta : The nukta sign is placed below a certain number of consonants 
    to represent sounds found only in words borrowed from Perso-Arabic. 
    It is also placed under U+0921 and U+0922 to indicate 
    flapped sounds. Apart from this, Santali language uses Nukta adjoined to certain vowels and vowel signs. 
    See Section &ldquo;3.3.6 Nukta&rdquo; of the [Proposal].</p>

    <p>Visarga: The Visarga (U+0903), representing an aspiration at the end of a syllyable, is 
    frequently used in Sanskrit. See Section &ldquo;3.3.7 Visarga and Avagraha&rdquo; of the [Proposal].</p>

      
    <h2>Whole Label Evaluaton (WLE) rules</h2>
    <h3>Default Whole Label Evaluation Rules</h3>
    <p>The LGR includes the set of required default WLE rules and actions applicable to 
        the Root Zone and defined in [MSR-2]. They are marked with &#x235F;.</p> 
        
    <h3>Devanagari specific Rules</h3>
    <p>These rules ensure that the Devanagari label conforms
    to akshar formation norms for the Devanagari script.  These norms are exclusively 
    presented as context rules.</p>
    
    <p>The following symbols are used in the names and comments for WLE rules: </p>
    <ul>
      <li>C    &#x2192;    Consonant</li>
      <li>M    &#x2192;    Matra</li>
      <li>V    &#x2192;    Vowel</li>
      <li>B    &#x2192;    Anusvara (Bindu)</li> 
      <li>D    &#x2192;    Candrabindu</li>
      <li>X    &#x2192;    Visarga</li>
      <li>H    &#x2192;    Halant / Virama</li>
      <li>N    &#x2192;    Nukta</li>
        <li>S    &#x2192;    Eyelash Reph (C1HC2) where:
        <ul> 
          <li>C1 is U+0931 DEVANAGARI LETTER RRA</li>
          <li>H is U+094D DEVANAGARI SIGN VIRAMA</li>
          <li>C2 is either
                <ul>
                <li>U+092F DEVANAGARI LETTER YA</li>
                <li>or U+0939 DEVANAGARI LETTER HA</p></li>
                </ul>
          </li>
        </ul>
        </li></ul>
    
    <p>The rules are: </p>
     <ul>
     <li>1.    N: must be preceded only by either of specific set of Cs, Vs and Ms</li>
     <li>2.    H: must be preceded by C or CN</li>
     <li>3.    M: must be preceded by C or CN</li>
     <li>4.    X: must be preceded by either of V, C, N or M</li>
     <li>5.    B: must be preceded by either of V, C, N or M </li>
     <li>6.    D: must be preceded by either of V, C, N or M </li>
     <li>7.    V: Can NOT be preceded by H </li>
     </ul>
    
     <p>See Section &ldquo;7 Whole Label Evaluation Rules (WLE)&rdquo; of the[Proposal].</p>
    
    <h2>Overall Development Process and Methodology</h2>
    <p>Under the Neo-Brahmi Generation Panel, there are many different scripts 
    belonging to separate Unicode blocks. Each of these scripts will be assigned 
    a separate LGR; however Neo-Brahmi GP will ensure that the fundamental philosophy 
    behind building those LGRs are all in sync with all other Brahmi derived 
    scripts. This is the Devanagari LGR, which caters to multiple languages written 
    using Devanagari belonging to EGIDS scale 1 to 4.</p>
    
    <p>For additional details and contributors, see Sections 4 and  8 of [Proposal].</p>
    
    <h2>References</h2> 
    <p>Reference [0] refers to the Unicode Standard version in which corresponding code 
    points were initially encoded. Reference [100] and up correspond to sources given 
    in [Proposal] for justifying the inclusion of for the corresponding code points. 
    Single code point or ranges may have multiple source reference values.</p>
    
    <p>In addition, the following references are cited in this document:</p>
    <dl class="references">

     <dt>[MSR-2]</dt>
     <dd>Integration Panel, "Maximal Starting Repertoire ��� MSR-2 Overview and Rationale", 14 April 2015
     https://www.icann.org/en/system/files/files/msr-2-overview-14apr15-en.pdf</dd>

     <dt>[Proposal]</dt> 
     <dd>Neo-Brahmi Generation Panel, ���Proposal for a Devanagari Script Root Zone Label 
     Generation Rule-Set (LGR)���, [date TBD][link TBD]</dd>

     <dt>[RFC 7940]</dt>
     <dd>Davies, K. and A. Freytag, "Representing Label Generation Rulesets Using XML", 
     RFC 7940, August 2016, http://www.rfc-editor.org/info/rfc7940. </dd> 

     <dt>[EGIDS]</dt>
     <dd>Expanded Graded Intergenerational Disruption Scale, https://www.ethnologue.com/about/language-status (Accessed on 13th Nov. 2017)</dd>
     
     <dt>[Unicode 6.3]</dt>
     <dd>The Unicode Consortium. The Unicode Standard, Version 6.3.0, (Mountain View, CA: The Unicode Consortium, 2013. ISBN 978-1-936213-08-5) 
     http://www.unicode.org/versions/Unicode6.3.0/</dd>
    </dl>

    <p>For more details for references [100] and up and [0] and up refer to the Table of References below.</p>
]]></description>
    <references>
      <reference id="0" comment="Any code point originally encoded in Unicode 1.1">The Unicode Standard 1.1</reference>
      <reference id="8" comment="Any code point originally encoded in Unicode 5.0">The Unicode Standard 5.0</reference>
      <reference id="9" comment="Any code point originally encoded in Unicode 5.1">The Unicode Standard 5.1</reference>
      <reference id="11" comment="Any code point originally encoded in Unicode 6.0">The Unicode Standard 6.0</reference>
      <reference id="100">DEVAN��GAR�� VIP TEAM: VARIANT ISSUES REPORT, 3rd Oct. 2011, https://archive.icann.org/en/topics/new-gtlds/devanagari-vip-issues-report-03oct11-en.pdf (Accessed on 10th Oct. 2017)</reference>
      <reference id="101">Omniglot, "Hindi", https://www.omniglot.com/writing/hindi.htm (Accessed on 10th Oct. 2017)</reference>
      <reference id="102">Omniglot, "Marathi", https://www.omniglot.com/writing/marathi.htm (Accessed on 10th Oct. 2017)</reference>
      <reference id="103">Omniglot, "Sanskrit", https://www.omniglot.com/writing/sanskrit.htm (Accessed on 10th Oct. 2017)</reference>
      <reference id="104">Omniglot, "Sindhi", https://www.omniglot.com/writing/sindhi.htm (Accessed on 10th Oct. 2017)</reference>
      <reference id="105">Omniglot, "Kashmiri", https://www.omniglot.com/writing/kashmiri.htm (Accessed on 10th Oct. 2017)</reference>
      <reference id="106">Unicode 10.0.0," South and Central Asia-I, Page 456 (R5 and R5a) ", http://www.unicode.org/versions/Unicode10.0.0/ch12.pdf (Accessed on 13th Nov. 2017)</reference>
      <reference id="107">Unicode Indic Group, "Devanagari Eyelash Ra", http://unicode.org/~emuller/iwg/p8/utcdoc.html (Accessed on 13th Nov. 2017)</reference>
	  <reference id="108">M.K. Raina, "How to read and write Kashmiri in Devanagari?", http://www.koshur.org/pdf/Let%20Us%20Learn%20Kashmiri.pdf (Accessed on 12th Dec. 2017)</reference>
	  <reference id="109">Central Hindi Directorate-Ministry of HRD-Govt. of India, "DEVAN��GAR�� ALPHABET AND ITS ROMANIZATION", http://hindinideshalaya.nic.in/english/hindi_orgin/devnagarithesysmbols.html (Accessed on 12th Dec. 2017</reference>
	  <reference id="110">Omniglot, "Bodo", https://www.omniglot.com/writing/bodo.htm (Accessed on 12th Dec. 2017)</reference>
	  <reference id="111">Omniglot, "Maithili", https://www.omniglot.com/writing/maithili.htm (Accessed on 12th Dec. 2017)</reference>
    </references>
  </meta>
  <data>
    <char cp="0901" when="follows-only-V-or-C-or-N-or-M" tag="chandrabindu" ref="0 101 102 103" comment="DEVANAGARI SIGN CANDRABINDU" />
    <char cp="0902" when="follows-only-V-or-C-or-N-or-M" tag="anusvara" ref="0 101 102 103" comment="DEVANAGARI SIGN ANUSVARA">
      <var cp="093A" type="blocked" />
    </char>
    <char cp="0903" when="follows-only-V-or-C-or-N-or-M" tag="visarga" ref="0 101 102 103" comment="DEVANAGARI SIGN VISARGA" />
    <char cp="0905" not-when="preceded-by-H" tag="vowel" ref="0 101 102 103 104" comment="DEVANAGARI LETTER A" />
    <char cp="0905 0902" not-when="preceded-by-H" ref="0 105 108" comment="DEVANAGARI LETTER A + DEVANAGARI SIGN ANUSVARA">
      <var cp="0973" type="blocked" />
    </char>
    <char cp="0906" not-when="preceded-by-H" tag="vowel" ref="0 101 102 103 104" comment="DEVANAGARI LETTER AA">
      <var cp="0906 093C" type="blocked" />
    </char>
    <char cp="0906 0902" not-when="preceded-by-H" ref="0 105 108" comment="DEVANAGARI LETTER AA +  DEVANAGARI SIGN ANUSVARA">
      <var cp="0974" type="blocked" />
    </char>
    <char cp="0906 093C" not-when="preceded-by-H" ref="0" comment="DEVANAGARI LETTER AA + NUKTA">
      <var cp="0906" type="blocked" />
    </char>
    <char cp="0907" not-when="preceded-by-H" tag="vowel" ref="0 101 102 103 104" comment="DEVANAGARI LETTER I" />
    <char cp="0908" not-when="preceded-by-H" tag="vowel" ref="0 101 102 103 104" comment="DEVANAGARI LETTER II" />
    <char cp="0909" not-when="preceded-by-H" tag="vowel" ref="0 101 102 103 104" comment="DEVANAGARI LETTER U" />
    <char cp="090A" not-when="preceded-by-H" tag="vowel" ref="0 101 102 103 104" comment="DEVANAGARI LETTER UU" />
    <char cp="090B" not-when="preceded-by-H" tag="vowel" ref="0 101 102 103" comment="DEVANAGARI LETTER VOCALIC R" />
    <char cp="090D" not-when="preceded-by-H" tag="vowel" ref="0 101" comment="DEVANAGARI LETTER CANDRA E" />
    <char cp="090E" not-when="preceded-by-H" tag="vowel" ref="0 105 108" comment="DEVANAGARI LETTER SHORT E">
      <var cp="0910" type="blocked" />
    </char>
    <char cp="090F" not-when="preceded-by-H" tag="vowel" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER E" />
    <char cp="0910" not-when="preceded-by-H" tag="vowel" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER AI">
      <var cp="090E" type="blocked" />
    </char>
    <char cp="0911" not-when="preceded-by-H" tag="vowel" ref="0 100 108" comment="DEVANAGARI LETTER CANDRA O" />
    <char cp="0912" not-when="preceded-by-H" tag="vowel" ref="0 105 108" comment="DEVANAGARI LETTER SHORT O" />
    <char cp="0913" not-when="preceded-by-H" tag="vowel" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER O">
      <var cp="0913 093C" type="blocked" />
    </char>
    <char cp="0913 093C" not-when="preceded-by-H" ref="0" comment="DEVANAGARI LETTER O + NUKTA">
      <var cp="0913" type="blocked" />
    </char>
    <char cp="0914" not-when="preceded-by-H" tag="vowel" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER AU">
      <var cp="0975" type="blocked" />
    </char>
    <char cp="0915" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER KA" />
    <char cp="0916" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER KHA" />
    <char cp="0917" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER GA" />
    <char cp="0918" tag="consonant" ref="0 101 102 103 104" comment="DEVANAGARI LETTER GHA" />
    <char cp="0919" tag="consonant" ref="0 101 102 103" comment="DEVANAGARI LETTER NGA" />
    <char cp="091A" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER CA" />
    <char cp="091B" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER CHA" />
    <char cp="091C" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER JA" />
    <char cp="091D" tag="consonant" ref="0 101 102 103 104" comment="DEVANAGARI LETTER JHA" />
    <char cp="091E" tag="consonant" ref="0 101 102 103" comment="DEVANAGARI LETTER NYA" />
    <char cp="091F" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER TTA" />
    <char cp="0920" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER TTHA" />
    <char cp="0921" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER DDA" />
    <char cp="0922" tag="consonant" ref="0 101 102 103 104" comment="DEVANAGARI LETTER DDHA" />
    <char cp="0923" tag="consonant" ref="0 101 102 103 104" comment="DEVANAGARI LETTER NNA" />
    <char cp="0924" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER TA" />
    <char cp="0925" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER THA" />
    <char cp="0926" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER DA" />
    <char cp="0927" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER DHA" />
    <char cp="0928" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER NA" />
    <char cp="092A" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER PA" />
    <char cp="092B" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER PHA" />
    <char cp="092C" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER BA" />
    <char cp="092D" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER BHA" />
    <char cp="092E" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER MA" />
    <char cp="092F" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER YA" />
    <char cp="0930" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER RA" />
    <char cp="0931 094D 092F" ref="0 106 107" comment="Section 7, Case of Eyelash Reph: To enable Eyelash Reph Formation with Rra+Halanta+Ya and prevent stand-alone RRA from occuring" />
    <char cp="0931 094D 0939" ref="0 106 107" comment="Section 7, Case of Eyelash Reph: To enable Eyelash Reph Formation with Rra+Halanta+Ha and prevent stand-alone RRA from occuring" />
    <char cp="0932" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER LA" />
    <char cp="0933" tag="consonant" ref="0 102 103" comment="DEVANAGARI LETTER LLA" />
    <char cp="0935" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER VA" />
    <char cp="0936" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER SHA" />
    <char cp="0937" tag="consonant" ref="0 101 102 103 104" comment="DEVANAGARI LETTER SSA" />
    <char cp="0938" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER SA" />
    <char cp="0939" tag="consonant" ref="0 101 102 103 104 105 108" comment="DEVANAGARI LETTER HA" />
    <char cp="093A" when="follows-only-C-or-CN" tag="matra" ref="11 105 108" comment="DEVANAGARI VOWEL SIGN OE">
      <var cp="0902" type="blocked" />
    </char>
    <char cp="093B" when="follows-only-C-or-CN" tag="matra" ref="11 105 108" comment="DEVANAGARI VOWEL SIGN OOE">
      <var cp="093E 0902" type="blocked" />
    </char>
    <char cp="093C" when="follows-only-specific-C-or-V-or-M" tag="nukta" ref="0 101 104 105 108 109 110 111" comment="DEVANAGARI SIGN NUKTA" />
    <char cp="093E" when="follows-only-C-or-CN" tag="matra" ref="0 101 102 103" comment="DEVANAGARI VOWEL SIGN AA">
      <var cp="093E 093C" type="blocked" />
    </char>
    <char cp="093E 0902" when="follows-only-C-or-CN" comment="DEVANAGARI VOWEL SIGN AA +  DEVANAGARI SIGN ANUSVARA">
      <var cp="093B" type="blocked" />
    </char>
    <char cp="093E 093C" when="follows-only-C-or-CN" ref="0" comment="DEVANAGARI VOWEL SIGN AA + NUKTA">
      <var cp="093E" type="blocked" />
    </char>
    <char cp="093F" when="follows-only-C-or-CN" tag="matra" ref="0 101 102 103" comment="DEVANAGARI VOWEL SIGN I stands to the left of the consonant" />
    <char cp="0940" when="follows-only-C-or-CN" tag="matra" ref="0 101 102 103" comment="DEVANAGARI VOWEL SIGN II" />
    <char cp="0941" when="follows-only-C-or-CN" tag="matra" ref="0 101 102 103" comment="DEVANAGARI VOWEL SIGN U" />
    <char cp="0942" when="follows-only-C-or-CN" tag="matra" ref="0 101 102 103" comment="DEVANAGARI VOWEL SIGN UU" />
    <char cp="0943" when="follows-only-C-or-CN" tag="matra" ref="0 101 102 103" comment="DEVANAGARI VOWEL SIGN VOCALIC R" />
    <char cp="0945" when="follows-only-C-or-CN" tag="matra" ref="0 100 101 108" comment="DEVANAGARI VOWEL SIGN CANDRA E = candra" />
    <char cp="0946" when="follows-only-C-or-CN" tag="matra" ref="0 105 108" comment="DEVANAGARI VOWEL SIGN SHORT E">
      <var cp="0947" type="blocked" />
    </char>
    <char cp="0947" when="follows-only-C-or-CN" tag="matra" ref="0 101 102 103 105 108" comment="DEVANAGARI VOWEL SIGN E">
      <var cp="0946" type="blocked" />
    </char>
    <char cp="0948" when="follows-only-C-or-CN" tag="matra" ref="0 101 102 103" comment="DEVANAGARI VOWEL SIGN AI" />
    <char cp="0949" when="follows-only-C-or-CN" tag="matra" ref="0 100 108" comment="DEVANAGARI VOWEL SIGN CANDRA O" />
    <char cp="094A" when="follows-only-C-or-CN" tag="matra" ref="0 105 108" comment="DEVANAGARI LETTER SHORT O" />
    <char cp="094B" when="follows-only-C-or-CN" tag="matra" ref="0 101 102 103 105 108" comment="DEVANAGARI VOWEL SIGN O">
      <var cp="094B 093C" type="blocked" />
    </char>
    <char cp="094B 093C" when="follows-only-C-or-CN" comment="DEVANAGARI VOWEL SIGN O + NUKTA">
      <var cp="094B" type="blocked" />
    </char>
    <char cp="094C" when="follows-only-C-or-CN" tag="matra" ref="0 101 102 103 105 108" comment="DEVANAGARI VOWEL SIGN AU">
      <var cp="094F" type="blocked" />
    </char>
    <char cp="094D" when="follows-only-C-or-CN" tag="halant" ref="0 101 102 103 105 108" comment="DEVANAGARI SIGN VIRAMA = halant; suppresses inherent vowel" />
    <char cp="094F" when="follows-only-C-or-CN" tag="matra" ref="0 105 108" comment="DEVANAGARI VOWEL SIGN AW">
      <var cp="094C" type="blocked" />
    </char>
    <char cp="0956" when="follows-only-C-or-CN" tag="matra" ref="11 105 108" comment="DEVANAGARI VOWEL SIGN UE" />
    <char cp="0957" when="follows-only-C-or-CN" tag="matra" ref="11 105 108" comment="DEVANAGARI VOWEL SIGN UUE" />
    <char cp="0972" not-when="preceded-by-H" tag="vowel" ref="9 100 108" comment="DEVANAGARI LETTER CANDRA A" />
    <char cp="0973" not-when="preceded-by-H" tag="vowel" ref="11 105 108" comment="DEVANAGARI LETTER OE">
      <var cp="0905 0902" type="blocked" />
    </char>
    <char cp="0974" not-when="preceded-by-H" tag="vowel" ref="11 105 108" comment="DEVANAGARI LETTER OOE">
      <var cp="0906 0902" type="blocked" />
    </char>
    <char cp="0975" not-when="preceded-by-H" tag="vowel" ref="11 105 108" comment="DEVANAGARI LETTER AW">
      <var cp="0914" type="blocked" />
    </char>
    <char cp="0976" not-when="preceded-by-H" tag="vowel" ref="11 105 108" comment="DEVANAGARI LETTER UE" />
    <char cp="0977" not-when="preceded-by-H" tag="vowel" ref="11 105 108" comment="DEVANAGARI LETTER UUE" />
    <char cp="097B" tag="consonant" ref="8 104" comment="DEVANAGARI LETTER GGA" />
    <char cp="097C" tag="consonant" ref="8 104" comment="DEVANAGARI LETTER JJA" />
    <char cp="097E" tag="consonant" ref="8 104" comment="DEVANAGARI LETTER DDDA" />
    <char cp="097F" tag="consonant" ref="8 104" comment="DEVANAGARI LETTER BBA" />
  </data>
  <!--Rules section goes here-->
  <rules>
    <!--Character class definitions go here-->
    <class name="consonant" from-tag="consonant" />
    <class name="vowel" from-tag="vowel" />
    <class name="matra" from-tag="matra" />
    <class name="halant" from-tag="halant" />
    <class name="nukta" from-tag="nukta" />
    <!--Whole label evaluation and context rules go here-->
    <rule name="leading-combining-mark" comment="Default rule from MSR-2 ���">
      <start />
      <union>
        <class property="gc:Mn" />
        <class property="gc:Mc" />
      </union>
    </rule>
    <rule name="follows-only-specific-C-or-V-or-M" comment="Section 7, WLE 1: Rule for Nukta which restricts its preceding characters">
      <look-behind>
        <choice>
          <char cp="0915" comment="DEVANAGARI LETTER KA" />
          <char cp="0916" comment="DEVANAGARI LETTER KHA" />
          <char cp="0917" comment="DEVANAGARI LETTER GA" />
          <char cp="091A" comment="DEVANAGARI LETTER CA" />
          <char cp="091B" comment="DEVANAGARI LETTER CHA" />
          <char cp="091C" comment="DEVANAGARI LETTER JA" />
          <char cp="0921" comment="DEVANAGARI LETTER DDA" />
          <char cp="0922" comment="DEVANAGARI LETTER DDHA" />
          <char cp="092B" comment="DEVANAGARI LETTER PHA" />
          <char cp="0906" comment="Vowel: DEVANAGARI LETTER AA (Required in Santhali language)" />
          <char cp="0913" comment="Vowel: DEVANAGARI LETTER O (Required in Santhali language)" />
          <char cp="093E" comment="Matra: DEVANAGARI VOWEL SIGN AA (Required in Santhali language)" />
          <char cp="094B" comment="Matra: DEVANAGARI VOWEL SIGN O (Required in Santhali language)" />
        </choice>
      </look-behind>
      <anchor />
    </rule>
    <rule name="follows-only-C-or-CN" comment="Section 7, WLE 2, 3: These characters can only be preceded by consonant or a nukta which is preceded by a consonant">
      <look-behind>
        <choice>
          <class by-ref="consonant" />
          <rule comment="nukta-preceded-by-consonant">
            <class by-ref="consonant" />
            <class by-ref="nukta" />
          </rule>
        </choice>
      </look-behind>
      <anchor />
    </rule>
    <rule name="follows-only-V-or-C-or-N-or-M" comment="Section 7, WLE 4, 5, 6: These characters can only be preceded by either Vowel or Consonant or Nukta or Matra">
      <look-behind>
        <choice>
          <class by-ref="vowel" />
          <class by-ref="consonant" />
          <class by-ref="nukta" />
          <class by-ref="matra" />
        </choice>
      </look-behind>
      <anchor />
    </rule>
    <rule name="preceded-by-H" comment="Section 7, WLE 7: A vowel is not allowed to follow an H">
      <look-behind>
        <class by-ref="halant" />
      </look-behind>
      <anchor />
    </rule>
    <!--Action elements go here - order defines precedence-->
    <action disp="invalid" match="leading-combining-mark" />
    <action disp="invalid" any-variant="out-of-repertoire-var" comment="any variant label with a code point out of repertoire is invalid ���" />
    <action disp="blocked" any-variant="blocked" comment="default action MSR-2 ���" />
    <action disp="allocatable" any-variant="allocatable" comment="default action MSR-2 ���" />
    <action disp="valid" comment="catch all; default action from MSR-2 ���" />
  </rules>
</lgr>