public class CharUtilities extends Object
| Modifier and Type | Field and Description | 
|---|---|
| static char | CARRIAGE_RETURNcarriage return | 
| static char | CODE_EOTCharacter code used to signal a character boundary in
 inline content, such as an inline with borders and padding
 or a nested block object. | 
| static int | EOTCharacter class: Boundary between text runs | 
| static char | IDEOGRAPHIC_SPACEIdeogreaphic space | 
| static char | LINE_SEPARATORline-separator | 
| static int | LINEFEEDCharacter class: Line feed | 
| static char | LINEFEED_CHARlinefeed character | 
| static char | LREleft-to-right embedding | 
| static char | LRMleft-to-right mark | 
| static char | LROleft-to-right override | 
| static char | MISSING_IDEOGRAPHmissing ideograph | 
| static char | NBSPACEnon-breaking space | 
| static char | NEXT_LINEnext line control character | 
| static int | NONWHITESPACECharacter class: non-whitespace | 
| static char | NOT_A_CHARACTERUnicode value indicating the the character is "not a character". | 
| static char | NULL_CHARnull char | 
| static char | OBJECT_REPLACEMENT_CHARACTERObject replacement character | 
| static char | PARAGRAPH_SEPARATORparagraph-separator | 
| static char | PDFpop directional formatting | 
| static char | RLEright-to-left embedding | 
| static char | RLMright-to-left mark | 
| static char | RLOright-to-left override | 
| static char | SOFT_HYPHENsoft hyphen | 
| static char | SPACEnormal space | 
| static char | TABnormal tab | 
| static int | UCWHITESPACECharacter class: Unicode white space | 
| static char | WORD_JOINERword joiner | 
| static int | XMLWHITESPACECharacter class: XML whitespace | 
| static char | ZERO_WIDTH_JOINERzero-width joiner | 
| static char | ZERO_WIDTH_NOBREAK_SPACEzero-width no-break space (= byte order mark) | 
| static char | ZERO_WIDTH_SPACEzero-width space | 
| Modifier | Constructor and Description | 
|---|---|
| protected  | CharUtilities()Utility class: Constructor prevents instantiating when subclassed. | 
| Modifier and Type | Method and Description | 
|---|---|
| static String | charToNCRef(int c)Convert a single unicode scalar value to an XML numeric character
 reference. | 
| static int | classOf(int c)Return the appropriate CharClass constant for the type
 of the passed character. | 
| static Iterable<Integer> | codepointsIter(CharSequence s)Creates an iterator to iter a  CharSequencecodepoints. | 
| static Iterable<Integer> | codepointsIter(CharSequence s,
              int beginIndex,
              int endIndex)Creates an iterator to iter a sub-CharSequence codepoints. | 
| static boolean | containsSurrogatePairAt(CharSequence chars,
                       int index)Tells whether there is a surrogate pair starting from the given index in the  CharSequence. | 
| static String | format(int c)Format character for debugging output, which it is prefixed with "0x", padded left with '0'
 and either 4 or 6 hex characters in width according to whether it is in the BMP or not. | 
| static int | incrementIfNonBMP(int codePoint)Returns 1 if codePoint not in the BMP. | 
| static boolean | isAdjustableSpace(int c)Method to determine if the character is an adjustable
 space. | 
| static boolean | isAlphabetic(int c)Indicates whether a character is classified as "Alphabetic" by the Unicode standard. | 
| static boolean | isAnySpace(int c)Determines if the character represents any kind of space. | 
| static boolean | isBmpCodePoint(int codePoint)Determine whether the specified character (Unicode code point) is in then Basic
 Multilingual Plane (BMP). | 
| static boolean | isBreakableSpace(int c)Helper method to determine if the character is a
 space with normal behavior. | 
| static boolean | isExplicitBreak(int c)Indicates whether the given character is an explicit break-character | 
| static boolean | isFixedWidthSpace(int c)Method to determine if the character is a (breakable) fixed-width space. | 
| static boolean | isNonBreakableSpace(int c)Method to determine if the character is a nonbreaking
 space. | 
| static boolean | isSameSequence(CharSequence cs1,
              CharSequence cs2)Determine if two character sequences contain the same characters. | 
| static boolean | isSurrogatePair(char ch)Determine if the given characters is part of a surrogate pair. | 
| static boolean | isZeroWidthSpace(int c)Method to determine if the character is a zero-width space. | 
| static String | padLeft(String s,
       int width,
       char pad)Pad a string S on left out to width W using padding character PAD. | 
| static String | toNCRefs(String s)Convert a string to a sequence of ASCII or XML numeric character references. | 
public static final char CODE_EOT
public static final int UCWHITESPACE
public static final int LINEFEED
public static final int EOT
public static final int NONWHITESPACE
public static final int XMLWHITESPACE
public static final char NULL_CHAR
public static final char LINEFEED_CHAR
public static final char CARRIAGE_RETURN
public static final char TAB
public static final char SPACE
public static final char NBSPACE
public static final char NEXT_LINE
public static final char ZERO_WIDTH_SPACE
public static final char WORD_JOINER
public static final char ZERO_WIDTH_JOINER
public static final char LRM
public static final char RLM
public static final char LRE
public static final char RLE
public static final char PDF
public static final char LRO
public static final char RLO
public static final char ZERO_WIDTH_NOBREAK_SPACE
public static final char SOFT_HYPHEN
public static final char LINE_SEPARATOR
public static final char PARAGRAPH_SEPARATOR
public static final char MISSING_IDEOGRAPH
public static final char IDEOGRAPHIC_SPACE
public static final char OBJECT_REPLACEMENT_CHARACTER
public static final char NOT_A_CHARACTER
protected CharUtilities()
public static int classOf(int c)
c - character to inspectpublic static boolean isBreakableSpace(int c)
c - character to inspectpublic static boolean isZeroWidthSpace(int c)
c - the character to checkpublic static boolean isFixedWidthSpace(int c)
c - the character to checkpublic static boolean isNonBreakableSpace(int c)
c - character to checkpublic static boolean isAdjustableSpace(int c)
c - character to checkpublic static boolean isAnySpace(int c)
c - character to checkpublic static boolean isAlphabetic(int c)
c - the characterpublic static boolean isExplicitBreak(int c)
c - the character to checkpublic static String charToNCRef(int c)
c - a unicode scalar valuepublic static String toNCRefs(String s)
s - a java string (encoded in UTF-16)public static String padLeft(String s, int width, char pad)
s - string to padwidth - width of field to add paddingpad - character to use for paddingpublic static String format(int c)
c - character codepublic static boolean isSameSequence(CharSequence cs1, CharSequence cs2)
cs1 - first character sequencecs2 - second character sequencepublic static boolean isBmpCodePoint(int codePoint)
char.codePoint - the character (Unicode code point) to be testedtrue if the specified code point is between  Character#MIN_VALUE and
          Character#MAX_VALUE} inclusive; false otherwisefrom Java 1.7public static int incrementIfNonBMP(int codePoint)
codePoint - 1 if codePoint > 0xFFFF, 0 otherwisepublic static boolean isSurrogatePair(char ch)
ch - character to be checkedpublic static boolean containsSurrogatePairAt(CharSequence chars, int index)
CharSequence. If the
 character at index is an high surrogate then the character at index+1 is checked to be a low surrogate. If a
 malformed surrogate pair is encountered then an IllegalArgumentException is thrown.
 high surrogate [0xD800 - 0xDC00] low surrogate [0xDC00 - 0xE000]
chars - CharSequence to checkindex - index in the CharSequqnce where to start the checkIllegalArgumentException - if there wrong usage of surrogate pairspublic static Iterable<Integer> codepointsIter(CharSequence s)
CharSequence codepoints.s - CharSequence to iterCharSequence.codepointsIter(CharSequence, int, int)public static Iterable<Integer> codepointsIter(CharSequence s, int beginIndex, int endIndex)
s - CharSequence to iterbeginIndex - lower rangeendIndex - upper rangeCopyright © 2025 Apache Software Foundation. All rights reserved.