Proposal-Hebrew-Lgr-24apr19-En

Save as HTML

This document is mechanically formatted from the XML file for the LGR. It provides additional summary data and explanatory text. The XML file remains the sole normative specification of the LGR.

LGR Version 3
Date 2019-04-24
Language(s) und-Hebr
Scope(s) domain: .
Unicode Version 6.3.0

Table of Contents

  1. Description
  2. Repertoire
  3. Variant Sets
  4. Classes, Rules and Actions
    1. Character Classes
    2. Whole label evaluation and context rules
    3. Actions
  5. Table of References

Description

Repertoire

Summary

Number of elements in repertoire 27
Number of ranges in repertoire 3
Number of code point sequences 0

Repertoire by Code Point

The following table lists the repertoire by code point (or code point sequence). The data in the Script and Name column are extracted from the Unicode character database. Where the comment in the original LGR is equal to the character name, it has been suppressed.

For any code point or sequence for which a variant is defined, the link to the associated variant set, or if mapped to itself, the variant type of that mapping is provided in the Variants column.

# Code
Point
Glyph Script Name Tags Required Context Variants Comment References
1 U+05D0 … U+05D9 אי Hebrew HEBREW LETTER ALEF … HEBREW LETTER YOD [101]
2 U+05DA ך Hebrew HEBREW LETTER FINAL KAF set 1 Final form [101]
3 U+05DB כ Hebrew HEBREW LETTER KAF set 1 Nominal form [101]
4 U+05DC ל Hebrew HEBREW LETTER LAMED [101]
5 U+05DD ם Hebrew HEBREW LETTER FINAL MEM set 2 Final form [101]
6 U+05DE מ Hebrew HEBREW LETTER MEM set 2 Nominal form [101]
7 U+05DF ן Hebrew HEBREW LETTER FINAL NUN set 3 Final form [101]
8 U+05E0 נ Hebrew HEBREW LETTER NUN set 3 Nominal form [101]
9 U+05E1 … U+05E2 סע Hebrew HEBREW LETTER SAMEKH … HEBREW LETTER AYIN [101]
10 U+05E3 ף Hebrew HEBREW LETTER FINAL PE set 4 Final form [101]
11 U+05E4 פ Hebrew HEBREW LETTER PE set 4 Nominal form [101]
12 U+05E5 ץ Hebrew HEBREW LETTER FINAL TSADI set 5 Final form [101]
13 U+05E6 צ Hebrew HEBREW LETTER TSADI set 5 Nominal form [101]
14 U+05E7 … U+05EA קת Hebrew HEBREW LETTER QOF … HEBREW LETTER TAV [101]

Legend

Code Point
A code point or code point sequence.
Name
Shows the character or sequence name from the Unicode Character Database.
Glyph
The shape displayed depends on the fonts available to your browser.
Script
Shows the script property value from the Unicode Character Database. Combining marks may have the value Inherited and code points used with more than one script may have the value Common.
References
Links to the references associated with the code point or sequence, if any.
Tags
LGR-defined tag values. Any tags matching the Unicode script property are suppressed in this view.
Required Context
Link to the rule defining the required context a code point or sequence must satisfy. If prefixed by "not:", identifies a context that must not occur.
Variants
A link to the variant set the code point or sequence is a member of, except where a coded point or sequence maps only to itself, in which case the type of that mapping is listed.
Comment
If the comment in this row consists only of the code point or sequence name it is suppressed in this view.

Variant Sets

Summary

Number of variant sets 5
Largest variant set 2
Ordinary Variants by Type blocked (10)

The following tables list all variant sets defined in this LGR, except for singleton sets. Each table lists all variant mapping pairs of the set; one per row. Mappings are assumed to be symmetric: each row documents both forward (���) and reverse (���) mapping directions. In each table, the mappings are sorted by Source value in ascending code point order; shading is used to group mappings from the same source code point or sequence.

Where the type of both forward and reverse mappings are the same, a single value is given in the Type(s) column, otherwise the types for forward and reverse mappings, as well as comments and references are listed above one another.

A mapping where source and target are the same is reflexive. Variant sets consisting of only a single reflexive mapping are not shown as a set. Instead, the variant type of the mapping is listed in the Variants column of the Repertoire by Code Point table. Reflexive mappings that are part of a larger set are indicated with a ���������.

In any LGR with variant specifications that are well behaved, all members within each variant set are defined as variants of each other; the mappings in each set are symmetric and transitive; and all variant sets are disjoint.

Common Legend

Source
Source of the mapping pair.
Target
Destination of the mapping pair.
Glyph
The shape displayed for source or target depends on the fonts available to your browser.
��� - forward
Indicates that variant Type, References and Comment apply to the mapping from source to target.
��� - reverse
Indicates that variant Type, References and Comment apply to the reverse mapping from target to source.
��� - both
Indicates that variant Type, References and Comment apply to both forward and reverse mapping.
��� - reflexive
Indicates that variant Type, References and Comment are for a reflexive mapping where source equals target.
���� - not in LGR
Indicates that variant is not in LGR.
Type
The type of the variant mapping. There are some predefined variant types such as ���allocatable��� and ���blocked���, while others are defined specifically for each LGR.
References
One or more reference IDs (optional). A "/" separates references for reverse / forward mappings, if different.
Comment
A descriptive comment (optional). A "/" separates comments for reverse / forward mappings, if different.

Variant Set 1 ��� 2 Members - 1 Mappings

# Source Glyph Target Glyph   Type(s) References Comment
1 U+05DA ך U+05DB כ ��� blocked Nominal form variant / Final form variant

Variant Set 2 ��� 2 Members - 1 Mappings

# Source Glyph Target Glyph   Type(s) References Comment
1 U+05DD ם U+05DE מ ��� blocked Nominal form variant / Final form variant

Variant Set 3 ��� 2 Members - 1 Mappings

# Source Glyph Target Glyph   Type(s) References Comment
1 U+05DF ן U+05E0 נ ��� blocked Nominal form variant / Final form variant

Variant Set 4 ��� 2 Members - 1 Mappings

# Source Glyph Target Glyph   Type(s) References Comment
1 U+05E3 ף U+05E4 פ ��� blocked Nominal form variant / Final form variant

Variant Set 5 ��� 2 Members - 1 Mappings

# Source Glyph Target Glyph   Type(s) References Comment
1 U+05E5 ץ U+05E6 צ ��� blocked Nominal form variant / Final form variant

Classes, Rules and Actions

Character Classes

The following table lists all top-level classes with their definition and the regular expression defining their members.

Name Definition Count Members References Comment

Legend

Members or Ranges
Lists the members of the class as code points (xxx) or as ranges of code points (xxx-yyy). Any class too numerous to list in full is elided with "...".
Tag=ttt
An anonymous class implicitly defined based on tag value.
[: :] - named character set
Reference to a named character set [:name:].
(���,���,\,���) - set operators
Sets may be combined by set operators (��� = intersection, ��� = union, \ = difference, ��� = symmetric difference).

Whole label evaluation and context rules

The following table lists all the top-level, or named rules defined in the LGR and indicates whether they are used as trigger in an action or as context (when or not-when) for a code point. (Any use of context rules for variants is not indicated).

Name Regular Expression Used as
Trigger
Used as
Context
Anchor References Comment
leading-combining-mark (start) ([:class property:gc=Mn:]���[:class property:gc=Mc:]) True False False default WLE rule matching labels with leading combining marks ���

Legend

Used as Trigger
This rule triggers one of the actions listed below.
Used as Context
This rule defines a required context for a code point.
Anchor
This has a placeholder for the code point for which it is evaluated.
Regular Expression
A regular expression equivalent to the rule, shown in the standard notation with some extensions as noted:
��� - context anchor
In a regex the ��� signifies a placeholder for the actual code point, when a context is evaluated. The code point must occur at the position corresponding to the anchor. Rules containing an anchor cannot be used as triggers.
(...)��� - look-behind
If present encloses required context preceding the anchor.
���(...) - look-ahead
If present encloses required context following the anchor.
(: :) - rule reference
Non-recursive reference to a named rule.
[: :] - character set either named, implicit or property
Reference to a named character set [:name:], an implicit character set [:class tag=val:] or a given Unicode property [:class property:prop=val:]. A leading "^" before name or tag indicates the set complement.
(|) - choice operator
When there are various choices in a rule, choices are separated by the set operator (|) and each choice is represented by a set enclosed in parenthesis.
(���,���,\,���) - set operators
Sets may be combined by set operators (��� = intersection, ��� = union, \ = difference, ��� = symmetric difference).
�� - empty set
Indicated that the following set is empty because of the result of set operations or because non of its elements are part of the repertoire defined here.
An empty set that is not optional means that a rule can never match.
{m}, {m, n}, {m,} - count
Indicates that the preceding element is evaluated from m to n times. Only {m} means the preceding element is evaluated exactly m times (equivalent to {m,m}), {m,} means the preceding element is evaluated at least m times.
No count indicated the elements is evaluated once (equivalent to "{1}").

Actions

The following table lists the actions that are used to assign dispositions to labels and variant labels, based on the specified conditions. The order of actions defines their precedence: the first action triggered by a label is the one defining its disposition.

# Condition Rule / Variant Set   Disposition References Comment
1 if label match leading-combining-mark ��� invalid labels with leading combining marks are invalid ���
2 if at least one variant is in {out-of-repertoire-var} ��� invalid any variant label with a code point out of repertoire is invalid ���
3 if at least one variant is in {blocked} ��� blocked any variant label containing blocked variants is blocked ���
4 if all variants are in {allocatable} ��� allocatable variant labels with all variants allocatable are allocatable ���
5 if any label (catch-all) ��� valid catch all (default action) ���

Legend

{...} - variant type set
In the "Rule/Variant Set" column the notation {...} means a set of variant types.

Table of References

[0] The Unicode Standard 6.3.0, http://unicode.org/versions/Unicode6.3.0/
[100] Proposal for Hebrew Script Root Zone LGR, Version 1.3
https://www.icann.org/en/system/files/files/proposal-hebrew-lgr-24apr19-en.pdf
[101] Omniglot Hebrew
http://omniglot.com/writing/hebrew.htm