UProperty
interface UProperty
| android.icu.lang.UProperty |
Selection constants for Unicode properties.
These constants are used in functions like UCharacter.hasBinaryProperty(int) to select one of the Unicode properties.
The properties APIs are intended to reflect Unicode properties as defined in the Unicode Character Database (UCD) and Unicode Technical Reports (UTR).
For details about the properties see UAX #44: Unicode Character Database.
Important: If ICU is built with UCD files from Unicode versions below 3.2, then properties marked with "new" are not or not fully available. Check UCharacter.getUnicodeVersion() to be sure.
Summary
| Nested classes | |
|---|---|
| abstract |
Selector constants for UCharacter. |
| Constants | |
|---|---|
| static Int |
String property Age. |
| static Int |
Binary property Alphabetic. |
| static Int |
Binary property ASCII_Hex_Digit (0-9 A-F a-f). |
| static Int |
Binary property of strings Basic_Emoji. |
| static Int |
Enumerated property Bidi_Class. |
| static Int |
Binary property Bidi_Control. |
| static Int |
Binary property Bidi_Mirrored. |
| static Int |
String property Bidi_Mirroring_Glyph. |
| static Int |
String property Bidi_Paired_Bracket (new in Unicode 6.3). |
| static Int |
Enumerated property Bidi_Paired_Bracket_Type (new in Unicode 6.3). |
| static Int |
First constant for binary Unicode properties. |
| static Int |
Enumerated property Block. |
| static Int |
Enumerated property Canonical_Combining_Class. |
| static Int |
Binary property Cased. |
| static Int |
String property Case_Folding. |
| static Int |
Binary property Case_Ignorable. |
| static Int |
Binary property Case_Sensitive. |
| static Int |
Binary property Changes_When_Casefolded. |
| static Int |
Binary property Changes_When_Casemapped. |
| static Int |
Binary property Changes_When_Lowercased. |
| static Int |
Binary property Changes_When_NFKC_Casefolded. |
| static Int |
Binary property Changes_When_Titlecased. |
| static Int |
Binary property Changes_When_Uppercased. |
| static Int |
Binary property Dash. |
| static Int |
Enumerated property Decomposition_Type. |
| static Int |
Binary property Default_Ignorable_Code_Point (new). |
| static Int |
Binary property Deprecated (new). |
| static Int |
Binary property Diacritic. |
| static Int |
First constant for double Unicode properties. |
| static Int |
Enumerated property East_Asian_Width. |
| static Int |
Binary property Emoji. |
| static Int |
Binary property Emoji_Component. |
| static Int |
Binary property of strings Emoji_Keycap_Sequence. |
| static Int |
Binary property Emoji_Modifier. |
| static Int |
Binary property Emoji_Modifier_Base. |
| static Int |
Binary property Emoji_Presentation. |
| static Int |
Binary property Extended_Pictographic. |
| static Int |
Binary property Extender. |
| static Int |
Binary property Full_Composition_Exclusion. |
| static Int |
Enumerated property General_Category. |
| static Int |
Bitmask property General_Category_Mask. |
| static Int |
Binary property Grapheme_Base (new). |
| static Int |
Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1). |
| static Int |
Binary property Grapheme_Extend (new). |
| static Int |
Binary property Grapheme_Link (new). |
| static Int |
Enumerated property Hangul_Syllable_Type, new in Unicode 4. |
| static Int |
Binary property Hex_Digit. |
| static Int |
Binary property Hyphen. |
| static Int |
Binary property Ideographic. |
| static Int |
Binary property IDS_Binary_Operator (new). |
| static Int |
Binary property IDS_Trinary_Operator (new). |
| static Int |
Binary property IDS_Unary_Operator. |
| static Int |
Binary property ID_Compat_Math_Continue. |
| static Int |
Binary property ID_Compat_Math_Start. |
| static Int |
Binary property ID_Continue. |
| static Int |
Binary property ID_Start. |
| static Int |
Enumerated property Indic_Positional_Category. |
| static Int |
Enumerated property Indic_Syllabic_Category. |
| static Int |
First constant for enumerated/integer Unicode properties. |
| static Int |
Enumerated property Joining_Group. |
| static Int |
Enumerated property Joining_Type. |
| static Int |
Binary property Join_Control. |
| static Int |
Enumerated property Lead_Canonical_Combining_Class. |
| static Int |
Enumerated property Line_Break. |
| static Int |
Binary property Logical_Order_Exception (new). |
| static Int |
Binary property Lowercase. |
| static Int |
String property Lowercase_Mapping. |
| static Int |
First constant for bit-mask Unicode properties. |
| static Int |
Binary property Math. |
| static Int |
String property Name. |
| static Int |
Binary property NFC_Inert. |
| static Int |
Enumerated property NFC_Quick_Check. |
| static Int |
Binary property NFD_Inert. |
| static Int |
Enumerated property NFD_Quick_Check. |
| static Int |
Binary property NFKC_Inert. |
| static Int |
Enumerated property NFKC_Quick_Check. |
| static Int |
Binary property NFKD_Inert. |
| static Int |
Enumerated property NFKD_Quick_Check. |
| static Int |
Binary property Noncharacter_Code_Point. |
| static Int |
Enumerated property Numeric_Type. |
| static Int |
Double property Numeric_Value. |
| static Int |
First constant for Unicode properties with unusual value types. |
| static Int |
Binary property Pattern_Syntax (new in Unicode 4.1). |
| static Int |
Binary property Pattern_White_Space (new in Unicode 4.1). |
| static Int |
Binary property alnum (a C/POSIX character class). |
| static Int |
Binary property blank (a C/POSIX character class). |
| static Int |
Binary property graph (a C/POSIX character class). |
| static Int |
Binary property print (a C/POSIX character class). |
| static Int |
Binary property xdigit (a C/POSIX character class). |
| static Int |
Binary property Prepended_Concatenation_Mark. |
| static Int |
Binary property Quotation_Mark. |
| static Int |
Binary property Radical (new). |
| static Int |
Binary property Regional_Indicator. |
| static Int |
Binary property of strings RGI_Emoji. |
| static Int |
Binary property of strings RGI_Emoji_Flag_Sequence. |
| static Int |
Binary property of strings RGI_Emoji_Modifier_Sequence. |
| static Int |
Binary property of strings RGI_Emoji_Tag_Sequence. |
| static Int |
Binary property of strings RGI_Emoji_ZWJ_Sequence. |
| static Int |
Enumerated property Script. |
| static Int |
Miscellaneous property Script_Extensions (new in Unicode 6.0). |
| static Int |
Binary Property Segment_Starter. |
| static Int |
Enumerated property Sentence_Break (new in Unicode 4.1). |
| static Int |
String property Simple_Case_Folding. |
| static Int |
String property Simple_Lowercase_Mapping. |
| static Int |
String property Simple_Titlecase_Mapping. |
| static Int |
String property Simple_Uppercase_Mapping. |
| static Int |
Binary property Soft_Dotted (new). |
| static Int |
First constant for string Unicode properties. |
| static Int |
Binary property STerm (new in Unicode 4.0.1). |
| static Int |
Binary property Terminal_Punctuation. |
| static Int |
String property Titlecase_Mapping. |
| static Int |
Enumerated property Trail_Canonical_Combining_Class. |
| static Int |
Binary property Unified_Ideograph (new). |
| static Int |
Binary property Uppercase. |
| static Int |
String property Uppercase_Mapping. |
| static Int |
Binary property Variation_Selector (new in Unicode 4.0.1). |
| static Int |
Enumerated property Vertical_Orientation. |
| static Int |
Binary property White_Space. |
| static Int |
Enumerated property Word_Break (new in Unicode 4.1). |
| static Int |
Binary property XID_Continue. |
| static Int |
Binary property XID_Start. |
Constants
AGE
static val AGE: Int
String property Age. Corresponds to UCharacter.getAge(int).
Value: 16384ALPHABETIC
static val ALPHABETIC: Int
Binary property Alphabetic.
Property for UCharacter.isUAlphabetic(), different from the property in UCharacter.isalpha().
Lu + Ll + Lt + Lm + Lo + Nl + Other_Alphabetic.
Value: 0ASCII_HEX_DIGIT
static val ASCII_HEX_DIGIT: Int
Binary property ASCII_Hex_Digit (0-9 A-F a-f).
Value: 1BASIC_EMOJI
static val BASIC_EMOJI: Int
Binary property of strings Basic_Emoji. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 65BIDI_CLASS
static val BIDI_CLASS: Int
Enumerated property Bidi_Class. Same as UCharacter.getDirection(int), returns UCharacterDirection values.
Value: 4096BIDI_CONTROL
static val BIDI_CONTROL: Int
Binary property Bidi_Control.
Format controls which have specific functions in the Bidi Algorithm.
Value: 2BIDI_MIRRORED
static val BIDI_MIRRORED: Int
Binary property Bidi_Mirrored.
Characters that may change display in RTL text.
Property for UCharacter.isMirrored().
See Bidi Algorithm; UTR 9.
Value: 3BIDI_MIRRORING_GLYPH
static val BIDI_MIRRORING_GLYPH: Int
String property Bidi_Mirroring_Glyph. Corresponds to UCharacter.getMirror(int).
Value: 16385BIDI_PAIRED_BRACKET
static val BIDI_PAIRED_BRACKET: Int
String property Bidi_Paired_Bracket (new in Unicode 6.3). Corresponds to UCharacter.getBidiPairedBracket.
Value: 16397BIDI_PAIRED_BRACKET_TYPE
static val BIDI_PAIRED_BRACKET_TYPE: Int
Enumerated property Bidi_Paired_Bracket_Type (new in Unicode 6.3). Used in UAX #9: Unicode Bidirectional Algorithm (http://www.unicode.org/reports/tr9/) Returns UCharacter.BidiPairedBracketType values.
Value: 4117BINARY_START
static val BINARY_START: Int
First constant for binary Unicode properties.
Value: 0BLOCK
static val BLOCK: Int
Enumerated property Block. Same as UCharacter.UnicodeBlock.of(int), returns UCharacter.UnicodeBlock values.
Value: 4097CANONICAL_COMBINING_CLASS
static val CANONICAL_COMBINING_CLASS: Int
Enumerated property Canonical_Combining_Class. Same as UCharacter.getCombiningClass(int), returns 8-bit numeric values.
Value: 4098CASED
static val CASED: Int
Binary property Cased. For Lowercase, Uppercase and Titlecase characters.
Value: 49CASE_FOLDING
static val CASE_FOLDING: Int
String property Case_Folding. Corresponds to UCharacter.foldCase(String, boolean).
Value: 16386CASE_IGNORABLE
static val CASE_IGNORABLE: Int
Binary property Case_Ignorable. Used in context-sensitive case mappings.
Value: 50CASE_SENSITIVE
static val CASE_SENSITIVE: Int
Binary property Case_Sensitive.
Either the source of a case mapping or in the target of a case mapping. Not the same as the general category Cased_Letter.
Value: 34CHANGES_WHEN_CASEFOLDED
static val CHANGES_WHEN_CASEFOLDED: Int
Binary property Changes_When_Casefolded.
Value: 54CHANGES_WHEN_CASEMAPPED
static val CHANGES_WHEN_CASEMAPPED: Int
Binary property Changes_When_Casemapped.
Value: 55CHANGES_WHEN_LOWERCASED
static val CHANGES_WHEN_LOWERCASED: Int
Binary property Changes_When_Lowercased.
Value: 51CHANGES_WHEN_NFKC_CASEFOLDED
static val CHANGES_WHEN_NFKC_CASEFOLDED: Int
Binary property Changes_When_NFKC_Casefolded.
Value: 56CHANGES_WHEN_TITLECASED
static val CHANGES_WHEN_TITLECASED: Int
Binary property Changes_When_Titlecased.
Value: 53CHANGES_WHEN_UPPERCASED
static val CHANGES_WHEN_UPPERCASED: Int
Binary property Changes_When_Uppercased.
Value: 52DASH
static val DASH: Int
Binary property Dash.
Variations of dashes.
Value: 4DECOMPOSITION_TYPE
static val DECOMPOSITION_TYPE: Int
Enumerated property Decomposition_Type. Returns UCharacter.DecompositionType values.
Value: 4099DEFAULT_IGNORABLE_CODE_POINT
static val DEFAULT_IGNORABLE_CODE_POINT: Int
Binary property Default_Ignorable_Code_Point (new).
Property that indicates codepoint is ignorable in most processing.
Codepoints (2060..206F, FFF0..FFFB, E0000..E0FFF) + Other_Default_Ignorable_Code_Point + (Cf + Cc + Cs - White_Space)
Value: 5DEPRECATED
static val DEPRECATED: Int
Binary property Deprecated (new).
The usage of deprecated characters is strongly discouraged.
Value: 6DIACRITIC
static val DIACRITIC: Int
Binary property Diacritic.
Characters that linguistically modify the meaning of another character to which they apply.
Value: 7DOUBLE_START
static val DOUBLE_START: Int
First constant for double Unicode properties.
Value: 12288EAST_ASIAN_WIDTH
static val EAST_ASIAN_WIDTH: Int
Enumerated property East_Asian_Width. See http://www.unicode.org/reports/tr11/ Returns UCharacter.EastAsianWidth values.
Value: 4100EMOJI
static val EMOJI: Int
Binary property Emoji. See http://www.unicode.org/reports/tr51/#Emoji_Properties
Value: 57EMOJI_COMPONENT
static val EMOJI_COMPONENT: Int
Binary property Emoji_Component. See http://www.unicode.org/reports/tr51/#Emoji_Properties
Value: 61EMOJI_KEYCAP_SEQUENCE
static val EMOJI_KEYCAP_SEQUENCE: Int
Binary property of strings Emoji_Keycap_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 66EMOJI_MODIFIER
static val EMOJI_MODIFIER: Int
Binary property Emoji_Modifier. See http://www.unicode.org/reports/tr51/#Emoji_Properties
Value: 59EMOJI_MODIFIER_BASE
static val EMOJI_MODIFIER_BASE: Int
Binary property Emoji_Modifier_Base. See http://www.unicode.org/reports/tr51/#Emoji_Properties
Value: 60EMOJI_PRESENTATION
static val EMOJI_PRESENTATION: Int
Binary property Emoji_Presentation. See http://www.unicode.org/reports/tr51/#Emoji_Properties
Value: 58EXTENDED_PICTOGRAPHIC
static val EXTENDED_PICTOGRAPHIC: Int
Binary property Extended_Pictographic. See http://www.unicode.org/reports/tr51/#Emoji_Properties
Value: 64EXTENDER
static val EXTENDER: Int
Binary property Extender.
Extend the value or shape of a preceding alphabetic character, e.g. length and iteration marks.
Value: 8FULL_COMPOSITION_EXCLUSION
static val FULL_COMPOSITION_EXCLUSION: Int
Binary property Full_Composition_Exclusion.
CompositionExclusions.txt + Singleton Decompositions + Non-Starter Decompositions.
Value: 9GENERAL_CATEGORY
static val GENERAL_CATEGORY: Int
Enumerated property General_Category. Same as UCharacter.getType(int), returns UCharacterCategory values.
Value: 4101GENERAL_CATEGORY_MASK
static val GENERAL_CATEGORY_MASK: Int
Bitmask property General_Category_Mask. This is the General_Category property returned as a bit mask. When used in UCharacter.getIntPropertyValue(c), returns bit masks for UCharacterCategory values where exactly one bit is set. When used with UCharacter.getPropertyValueName() and UCharacter.getPropertyValueEnum(), a multi-bit mask is used for sets of categories like "Letters".
Value: 8192GRAPHEME_BASE
static val GRAPHEME_BASE: Int
Binary property Grapheme_Base (new).
For programmatic determination of grapheme cluster boundaries. [0..10FFFF]-Cc-Cf-Cs-Co-Cn-Zl-Zp-Grapheme_Link-Grapheme_Extend-CGJ
Value: 10GRAPHEME_CLUSTER_BREAK
static val GRAPHEME_CLUSTER_BREAK: Int
Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.GraphemeClusterBreak values.
Value: 4114GRAPHEME_EXTEND
static val GRAPHEME_EXTEND: Int
Binary property Grapheme_Extend (new).
For programmatic determination of grapheme cluster boundaries.
Me+Mn+Mc+Other_Grapheme_Extend-Grapheme_Link-CGJ
Value: 11GRAPHEME_LINK
static val GRAPHEME_LINK: Int
Binary property Grapheme_Link (new).
For programmatic determination of grapheme cluster boundaries.
Value: 12HANGUL_SYLLABLE_TYPE
static val HANGUL_SYLLABLE_TYPE: Int
Enumerated property Hangul_Syllable_Type, new in Unicode 4. Returns UCharacter.HangulSyllableType values.
Value: 4107HEX_DIGIT
static val HEX_DIGIT: Int
Binary property Hex_Digit.
Characters commonly used for hexadecimal numbers.
Value: 13HYPHEN
static val HYPHEN: Int
Binary property Hyphen.
Dashes used to mark connections between pieces of words, plus the Katakana middle dot.
Value: 14IDEOGRAPHIC
static val IDEOGRAPHIC: Int
Binary property Ideographic.
CJKV ideographs.
Value: 17IDS_BINARY_OPERATOR
static val IDS_BINARY_OPERATOR: Int
Binary property IDS_Binary_Operator (new).
For programmatic determination of Ideographic Description Sequences.
Value: 18IDS_TRINARY_OPERATOR
static val IDS_TRINARY_OPERATOR: Int
Binary property IDS_Trinary_Operator (new).
For programmatic determination of Ideographic Description Sequences.
Value: 19IDS_UNARY_OPERATOR
static val IDS_UNARY_OPERATOR: Int
Binary property IDS_Unary_Operator. For programmatic determination of Ideographic Description Sequences.
Value: 72ID_COMPAT_MATH_CONTINUE
static val ID_COMPAT_MATH_CONTINUE: Int
Binary property ID_Compat_Math_Continue.
Used in mathematical identifier profile in UAX #31.
Value: 74ID_COMPAT_MATH_START
static val ID_COMPAT_MATH_START: Int
Binary property ID_Compat_Math_Start.
Used in mathematical identifier profile in UAX #31.
Value: 73ID_CONTINUE
static val ID_CONTINUE: Int
Binary property ID_Continue.
Characters that can continue an identifier.
ID_Start+Mn+Mc+Nd+Pc
Value: 15ID_START
static val ID_START: Int
Binary property ID_Start.
Characters that can start an identifier.
Lu+Ll+Lt+Lm+Lo+Nl
Value: 16INDIC_POSITIONAL_CATEGORY
static val INDIC_POSITIONAL_CATEGORY: Int
Enumerated property Indic_Positional_Category. New in Unicode 6.0 as provisional property Indic_Matra_Category; renamed and changed to informative in Unicode 8.0. See http://www.unicode.org/reports/tr44/#IndicPositionalCategory.txt
Value: 4118INDIC_SYLLABIC_CATEGORY
static val INDIC_SYLLABIC_CATEGORY: Int
Enumerated property Indic_Syllabic_Category. New in Unicode 6.0 as provisional; informative since Unicode 8.0. See http://www.unicode.org/reports/tr44/#IndicSyllabicCategory.txt
Value: 4119INT_START
static val INT_START: Int
First constant for enumerated/integer Unicode properties.
Value: 4096JOINING_GROUP
static val JOINING_GROUP: Int
Enumerated property Joining_Group. Returns UCharacter.JoiningGroup values.
Value: 4102JOINING_TYPE
static val JOINING_TYPE: Int
Enumerated property Joining_Type. Returns UCharacter.JoiningType values.
Value: 4103JOIN_CONTROL
static val JOIN_CONTROL: Int
Binary property Join_Control.
Format controls for cursive joining and ligation.
Value: 20LEAD_CANONICAL_COMBINING_CLASS
static val LEAD_CANONICAL_COMBINING_CLASS: Int
Enumerated property Lead_Canonical_Combining_Class. ICU-specific property for the ccc of the first code point of the decomposition, or lccc(c)=ccc(NFD(c)[0]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.
Value: 4112LINE_BREAK
static val LINE_BREAK: Int
Enumerated property Line_Break. Returns UCharacter.LineBreak values.
Value: 4104LOGICAL_ORDER_EXCEPTION
static val LOGICAL_ORDER_EXCEPTION: Int
Binary property Logical_Order_Exception (new).
Characters that do not use logical order and require special handling in most processing.
Value: 21LOWERCASE
static val LOWERCASE: Int
Binary property Lowercase.
Same as UCharacter.isULowercase(), different from UCharacter.islower().
Ll+Other_Lowercase
Value: 22LOWERCASE_MAPPING
static val LOWERCASE_MAPPING: Int
String property Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(String).
Value: 16388MASK_START
static val MASK_START: Int
First constant for bit-mask Unicode properties.
Value: 8192NAME
static val NAME: Int
String property Name. Corresponds to UCharacter.getName(int).
Value: 16389NFC_INERT
static val NFC_INERT: Int
Binary property NFC_Inert. ICU-specific property for characters that are inert under NFC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.
Value: 39See Also
NFC_QUICK_CHECK
static val NFC_QUICK_CHECK: Int
Enumerated property NFC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
Value: 4110NFD_INERT
static val NFD_INERT: Int
Binary property NFD_Inert. ICU-specific property for characters that are inert under NFD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions. There is one such property per normalization form. These properties are computed as follows - an inert character is: a) unassigned, or ALL of the following: b) of combining class 0. c) not decomposed by this normalization form. AND if NFC or NFKC, d) can never compose with a previous character. e) can never compose with a following character. f) can never change if another character is added. Example: a-breve might satisfy all but f, but if you add an ogonek it changes to a-ogonek + breve See also com.ibm.text.UCD.NFSkippable in the ICU4J repository, and icu/source/common/unormimp.h .
Value: 37NFD_QUICK_CHECK
static val NFD_QUICK_CHECK: Int
Enumerated property NFD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
Value: 4108NFKC_INERT
static val NFKC_INERT: Int
Binary property NFKC_Inert. ICU-specific property for characters that are inert under NFKC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.
Value: 40See Also
NFKC_QUICK_CHECK
static val NFKC_QUICK_CHECK: Int
Enumerated property NFKC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
Value: 4111NFKD_INERT
static val NFKD_INERT: Int
Binary property NFKD_Inert. ICU-specific property for characters that are inert under NFKD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.
Value: 38See Also
NFKD_QUICK_CHECK
static val NFKD_QUICK_CHECK: Int
Enumerated property NFKD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.
Value: 4109NONCHARACTER_CODE_POINT
static val NONCHARACTER_CODE_POINT: Int
Binary property Noncharacter_Code_Point.
Code points that are explicitly defined as illegal for the encoding of characters.
Value: 24NUMERIC_TYPE
static val NUMERIC_TYPE: Int
Enumerated property Numeric_Type. Returns UCharacter.NumericType values.
Value: 4105NUMERIC_VALUE
static val NUMERIC_VALUE: Int
Double property Numeric_Value. Corresponds to UCharacter.getUnicodeNumericValue(int).
Value: 12288OTHER_PROPERTY_START
static val OTHER_PROPERTY_START: Int
First constant for Unicode properties with unusual value types.
Value: 28672PATTERN_SYNTAX
static val PATTERN_SYNTAX: Int
Binary property Pattern_Syntax (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)
Value: 42PATTERN_WHITE_SPACE
static val PATTERN_WHITE_SPACE: Int
Binary property Pattern_White_Space (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)
Value: 43POSIX_ALNUM
static val POSIX_ALNUM: Int
Binary property alnum (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
Value: 44POSIX_BLANK
static val POSIX_BLANK: Int
Binary property blank (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
Value: 45POSIX_GRAPH
static val POSIX_GRAPH: Int
Binary property graph (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
Value: 46POSIX_PRINT
static val POSIX_PRINT: Int
Binary property print (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
Value: 47POSIX_XDIGIT
static val POSIX_XDIGIT: Int
Binary property xdigit (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.
Value: 48PREPENDED_CONCATENATION_MARK
static val PREPENDED_CONCATENATION_MARK: Int
Binary property Prepended_Concatenation_Mark.
Value: 63QUOTATION_MARK
static val QUOTATION_MARK: Int
Binary property Quotation_Mark.
Value: 25RADICAL
static val RADICAL: Int
Binary property Radical (new).
For programmatic determination of Ideographic Description Sequences.
Value: 26REGIONAL_INDICATOR
static val REGIONAL_INDICATOR: Int
Binary property Regional_Indicator.
Value: 62RGI_EMOJI
static val RGI_EMOJI: Int
Binary property of strings RGI_Emoji. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 71RGI_EMOJI_FLAG_SEQUENCE
static val RGI_EMOJI_FLAG_SEQUENCE: Int
Binary property of strings RGI_Emoji_Flag_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 68RGI_EMOJI_MODIFIER_SEQUENCE
static val RGI_EMOJI_MODIFIER_SEQUENCE: Int
Binary property of strings RGI_Emoji_Modifier_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 67RGI_EMOJI_TAG_SEQUENCE
static val RGI_EMOJI_TAG_SEQUENCE: Int
Binary property of strings RGI_Emoji_Tag_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 69RGI_EMOJI_ZWJ_SEQUENCE
static val RGI_EMOJI_ZWJ_SEQUENCE: Int
Binary property of strings RGI_Emoji_ZWJ_Sequence. See https://www.unicode.org/reports/tr51/#Emoji_Sets
Value: 70SCRIPT
static val SCRIPT: Int
Enumerated property Script. Same as UScript.getScript(int), returns UScript values.
Value: 4106SCRIPT_EXTENSIONS
static val SCRIPT_EXTENSIONS: Int
Miscellaneous property Script_Extensions (new in Unicode 6.0). Some characters are commonly used in multiple scripts. For more information, see UAX #24: http://www.unicode.org/reports/tr24/. Corresponds to UScript.hasScript and UScript.getScriptExtensions.
Value: 28672SEGMENT_STARTER
static val SEGMENT_STARTER: Int
Binary Property Segment_Starter. ICU-specific property for characters that are starters in terms of Unicode normalization and combining character sequences. They have ccc=0 and do not occur in non-initial position of the canonical decomposition of any character (like " in NFD(a-umlaut) and a Jamo T in an NFD(Hangul LVT)). ICU uses this property for segmenting a string for generating a set of canonically equivalent strings, e.g. for canonical closure while processing collation tailoring rules.
Value: 41SENTENCE_BREAK
static val SENTENCE_BREAK: Int
Enumerated property Sentence_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.SentenceBreak values.
Value: 4115SIMPLE_CASE_FOLDING
static val SIMPLE_CASE_FOLDING: Int
String property Simple_Case_Folding. Corresponds to UCharacter.foldCase(int, boolean).
Value: 16390SIMPLE_LOWERCASE_MAPPING
static val SIMPLE_LOWERCASE_MAPPING: Int
String property Simple_Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(int).
Value: 16391SIMPLE_TITLECASE_MAPPING
static val SIMPLE_TITLECASE_MAPPING: Int
String property Simple_Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(int).
Value: 16392SIMPLE_UPPERCASE_MAPPING
static val SIMPLE_UPPERCASE_MAPPING: Int
String property Simple_Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(int).
Value: 16393SOFT_DOTTED
static val SOFT_DOTTED: Int
Binary property Soft_Dotted (new).
Characters with a "soft dot", like i or j.
An accent placed on these characters causes the dot to disappear.
Value: 27STRING_START
static val STRING_START: Int
First constant for string Unicode properties.
Value: 16384S_TERM
static val S_TERM: Int
Binary property STerm (new in Unicode 4.0.1). Sentence Terminal. Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/)
Value: 35TERMINAL_PUNCTUATION
static val TERMINAL_PUNCTUATION: Int
Binary property Terminal_Punctuation.
Punctuation characters that generally mark the end of textual units.
Value: 28TITLECASE_MAPPING
static val TITLECASE_MAPPING: Int
String property Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(String).
Value: 16394TRAIL_CANONICAL_COMBINING_CLASS
static val TRAIL_CANONICAL_COMBINING_CLASS: Int
Enumerated property Trail_Canonical_Combining_Class. ICU-specific property for the ccc of the last code point of the decomposition, or lccc(c)=ccc(NFD(c)[last]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.
Value: 4113UNIFIED_IDEOGRAPH
static val UNIFIED_IDEOGRAPH: Int
Binary property Unified_Ideograph (new).
For programmatic determination of Ideographic Description Sequences.
Value: 29UPPERCASE
static val UPPERCASE: Int
Binary property Uppercase.
Same as UCharacter.isUUppercase(), different from UCharacter.isUpperCase().
Lu+Other_Uppercase
Value: 30UPPERCASE_MAPPING
static val UPPERCASE_MAPPING: Int
String property Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(String).
Value: 16396VARIATION_SELECTOR
static val VARIATION_SELECTOR: Int
Binary property Variation_Selector (new in Unicode 4.0.1). Indicates all those characters that qualify as Variation Selectors. For details on the behavior of these characters, see StandardizedVariants.html and 15.6 Variation Selectors.
Value: 36VERTICAL_ORIENTATION
static val VERTICAL_ORIENTATION: Int
Enumerated property Vertical_Orientation. Used for UAX #50 Unicode Vertical Text Layout (https://www.unicode.org/reports/tr50/). New as a UCD property in Unicode 10.0.
Value: 4120WHITE_SPACE
static val WHITE_SPACE: Int
Binary property White_Space.
Same as UCharacter.isUWhiteSpace(), different from UCharacter.isSpace() and UCharacter.isWhitespace(). Space characters+TAB+CR+LF-ZWSP-ZWNBSP
Value: 31WORD_BREAK
static val WORD_BREAK: Int
Enumerated property Word_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UCharacter.WordBreak values.
Value: 4116XID_CONTINUE
static val XID_CONTINUE: Int
Binary property XID_Continue.
ID_Continue modified to allow closure under normalization forms NFKC and NFKD.
Value: 32XID_START
static val XID_START: Int
Binary property XID_Start.
ID_Start modified to allow closure under normalization forms NFKC and NFKD.
Value: 33