net.sf.saxon.codenorm
Class NormalizerData
java.lang.Object
net.sf.saxon.codenorm.NormalizerData
public class NormalizerData
extends java.lang.Object
Accesses the Normalization Data used for Forms C and D.
Copyright © 1998-1999 Unicode, Inc. All Rights Reserved.
The Unicode Consortium makes no expressed or implied warranty of any
kind, and assumes no liability for errors or omissions.
No liability is assumed for incidental and consequential damages
in connection with or arising out of the use of the information here.
static int | NOT_COMPOSITE - Constant for use in getPairwiseComposition
|
int | getCanonicalClass(int ch) - Gets the combining class of a character from the
Unicode Character Database.
|
char | getPairwiseComposition(int first, int second) - Returns the composite of the two characters.
|
void | getRecursiveDecomposition(boolean canonical, int ch, StringBuffer buffer) - Gets recursive decomposition of a character from the
Unicode Character Database.
|
NOT_COMPOSITE
public static final int NOT_COMPOSITE
Constant for use in getPairwiseComposition
getCanonicalClass
public int getCanonicalClass(int ch)
Gets the combining class of a character from the
Unicode Character Database.
ch
- the source character
getPairwiseComposition
public char getPairwiseComposition(int first,
int second)
Returns the composite of the two characters. If the two
characters don't combine, returns NOT_COMPOSITE.
Only has to worry about BMP characters, since those are the only ones that can ever compose.
first
- first character (e.g. 'c')second
- second character (e.g. '¸' cedilla)
getRecursiveDecomposition
public void getRecursiveDecomposition(boolean canonical,
int ch,
StringBuffer buffer)
Gets recursive decomposition of a character from the
Unicode Character Database.
canonical
- If true
bit is on in this byte, then selects the recursive
canonical decomposition, otherwise selects
the recursive compatibility and canonical decomposition.ch
- the source characterbuffer
- buffer to be filled with the decomposition