Package com.ibm.icu.charset
Class CharsetISO2022
java.lang.Object
java.nio.charset.Charset
com.ibm.icu.charset.CharsetICU
com.ibm.icu.charset.CharsetISO2022
- All Implemented Interfaces:
Comparable<Charset>
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionprivate class
ISO-2022-CNprivate class
ISO-2022-JPprivate class
ISO-2022-KRprivate class
private class
ISO-2022-JPprivate class
ISO-2022-KRprivate static class
private static class
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate static final byte
private static final byte
private static final byte
private static final byte
private static final byte[]
private static final byte[]
private static final byte[]
private static final byte[]
private static final byte[]
private static final byte[]
private static final byte[]
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte[][]
private static final byte[][]
ISO2022-CN Dataprivate static final int[]
private static final byte[]
protected byte[][]
private static final byte[]
ISO-2022-CNprivate static final byte
private static final byte
private static final char
private static final char
private static final byte
private static final char
private static final char[]
private static final char
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte[]
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final short[]
private static final byte[]
from unicodeprivate static final byte
private static final byte
private static final short
private static final byte[]
private static final byte[]
Recognized escape sequenes are(B ASCII .A ISO-8859-1 .F ISO-8859-7 (J JISX-201 (I JISX-201 $B JISX-208 $@ JISX-208 $(D JISX-212 $A GB2312 $(C KSC5601 private static final byte[]
private static final byte[]
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private static final byte
private int
Fields inherited from class com.ibm.icu.charset.CharsetICU
codepage, conversionType, hasFromUnicodeFallback, hasToUnicodeFallback, icuCanonicalName, maxBytesPerChar, maxCharsPerByte, minBytesPerChar, name, options, platform, ROUNDTRIP_AND_FALLBACK_SET, ROUNDTRIP_SET, subChar, subChar1, subCharLen, unicodeMask
-
Constructor Summary
ConstructorsConstructorDescriptionCharsetISO2022
(String icuCanonicalName, String javaCanonicalName, String[] aliases) -
Method Summary
Modifier and TypeMethodDescriptionprivate static int
_2022FromGR94DBCS
(int value) private CoderResult
changeState_2022
(CharsetDecoderICU decoder, ByteBuffer source, int var) private static short
CSM
(short cs) private static int
getEndOfBuffer_2022
(ByteBuffer source) private static byte
getKey_2022
(byte c, int[] key, int[] offset) (package private) void
getUnicodeSetImpl
(UnicodeSet setFillIn, int which) This follows ucnv.c method ucnv_detectUnicodeSignature() to detect the start of the stream for example U+FEFF (the Unicode BOM/signature character) that can be ignored.private static boolean
IS_2022_CONTROL
(int c) private static boolean
IS_JP_DBCS
(byte cs) private void
ISO2022InitCN
(int version) private void
ISO2022InitJP
(int version) private void
ISO2022InitKR
(int version) private int
MBCSSimpleGetNextUChar
(UConverterSharedData sharedData, ByteBuffer source, boolean useFallback) (package private) static int
MBCSSingleFromUChar32
(UConverterSharedData sharedData, int c, int[] retval, boolean useFallback) private void
private void
private static CoderResult
toUnicodeCallback
(CharsetDecoderICU cnv, int sourceChar, int targetUniChar) Methods inherited from class com.ibm.icu.charset.CharsetICU
contains, forNameICU, getCharset, getCompleteUnicodeSet, getNonSurrogateUnicodeSet, getUnicodeSet, isFixedWidth, isSurrogate
Methods inherited from class java.nio.charset.Charset
aliases, availableCharsets, canEncode, compareTo, decode, defaultCharset, displayName, displayName, encode, encode, equals, forName, forName, hashCode, isRegistered, isSupported, name, toString
-
Field Details
-
myConverterData
-
variant
private int variant -
SHIFT_IN_STR
private static final byte[] SHIFT_IN_STR -
CR
private static final byte CR- See Also:
-
LF
private static final byte LF- See Also:
-
HWKANA_START
private static final char HWKANA_START- See Also:
-
HWKANA_END
private static final char HWKANA_END- See Also:
-
GR96_START
private static final char GR96_START- See Also:
-
GR96_END
private static final char GR96_END- See Also:
-
INVALID_STATE
private static final byte INVALID_STATE- See Also:
-
ASCII
private static final byte ASCII- See Also:
-
SS2_STATE
private static final byte SS2_STATE- See Also:
-
SS3_STATE
private static final byte SS3_STATE- See Also:
-
ISO8859_1
private static final byte ISO8859_1- See Also:
-
ISO8859_7
private static final byte ISO8859_7- See Also:
-
JISX201
private static final byte JISX201- See Also:
-
JISX208
private static final byte JISX208- See Also:
-
JISX212
private static final byte JISX212- See Also:
-
GB2312
private static final byte GB2312- See Also:
-
KSC5601
private static final byte KSC5601- See Also:
-
HWKANA_7BIT
private static final byte HWKANA_7BIT- See Also:
-
GB2312_1
private static final byte GB2312_1- See Also:
-
ISO_IR_165
private static final byte ISO_IR_165- See Also:
-
CNS_11643
private static final byte CNS_11643- See Also:
-
CNS_11643_0
private static final byte CNS_11643_0- See Also:
-
CNS_11643_1
private static final byte CNS_11643_1- See Also:
-
CNS_11643_2
private static final byte CNS_11643_2- See Also:
-
CNS_11643_3
private static final byte CNS_11643_3- See Also:
-
CNS_11643_4
private static final byte CNS_11643_4- See Also:
-
CNS_11643_5
private static final byte CNS_11643_5- See Also:
-
CNS_11643_6
private static final byte CNS_11643_6- See Also:
-
CNS_11643_7
private static final byte CNS_11643_7- See Also:
-
jpCharsetMasks
private static final short[] jpCharsetMasks -
UCNV_2022_MAX_CONVERTERS
private static final byte UCNV_2022_MAX_CONVERTERS- See Also:
-
ESC_2022
private static final byte ESC_2022- See Also:
-
INVALID_2022
private static final byte INVALID_2022- See Also:
-
VALID_NON_TERMINAL_2022
private static final byte VALID_NON_TERMINAL_2022- See Also:
-
VALID_TERMINAL_2022
private static final byte VALID_TERMINAL_2022- See Also:
-
VALID_MAYBE_TERMINAL_2022
private static final byte VALID_MAYBE_TERMINAL_2022- See Also:
-
normalize_esq_chars_2022
private static final byte[] normalize_esq_chars_2022 -
MAX_STATES_2022
private static final short MAX_STATES_2022- See Also:
-
escSeqStateTable_Key_2022
private static final int[] escSeqStateTable_Key_2022 -
escSeqStateTable_Value_2022
private static final byte[] escSeqStateTable_Value_2022 -
ISO_2022_JP
private static final byte ISO_2022_JP- See Also:
-
ISO_2022_KR
private static final byte ISO_2022_KR- See Also:
-
ISO_2022_CN
private static final byte ISO_2022_CN- See Also:
-
nextStateToUnicodeJP
private static final byte[] nextStateToUnicodeJPRecognized escape sequenes are(B ASCII .A ISO-8859-1 .F ISO-8859-7 (J JISX-201 (I JISX-201 $B JISX-208 $@ JISX-208 $(D JISX-212 $A GB2312 $(C KSC5601 -
nextStateToUnicodeCN
private static final byte[] nextStateToUnicodeCN -
jpCharsetPref
private static final byte[] jpCharsetPreffrom unicode -
escSeqChars
private static final byte[][] escSeqChars -
hwkana_fb
private static final char[] hwkana_fb -
fromUSubstitutionChar
protected byte[][] fromUSubstitutionChar -
GB_2312_80_STR
private static final byte[] GB_2312_80_STRISO-2022-CN -
ISO_IR_165_STR
private static final byte[] ISO_IR_165_STR -
CNS_11643_1992_Plane_1_STR
private static final byte[] CNS_11643_1992_Plane_1_STR -
CNS_11643_1992_Plane_2_STR
private static final byte[] CNS_11643_1992_Plane_2_STR -
CNS_11643_1992_Plane_3_STR
private static final byte[] CNS_11643_1992_Plane_3_STR -
CNS_11643_1992_Plane_4_STR
private static final byte[] CNS_11643_1992_Plane_4_STR -
CNS_11643_1992_Plane_5_STR
private static final byte[] CNS_11643_1992_Plane_5_STR -
CNS_11643_1992_Plane_6_STR
private static final byte[] CNS_11643_1992_Plane_6_STR -
CNS_11643_1992_Plane_7_STR
private static final byte[] CNS_11643_1992_Plane_7_STR -
escSeqCharsCN
private static final byte[][] escSeqCharsCNISO2022-CN Data
-
-
Constructor Details
-
CharsetISO2022
-
-
Method Details
-
ISO2022InitJP
private void ISO2022InitJP(int version) -
ISO2022InitCN
private void ISO2022InitCN(int version) -
ISO2022InitKR
private void ISO2022InitKR(int version) -
IS_2022_CONTROL
private static boolean IS_2022_CONTROL(int c) -
_2022FromGR94DBCS
private static int _2022FromGR94DBCS(int value) -
IS_JP_DBCS
private static boolean IS_JP_DBCS(byte cs) -
CSM
private static short CSM(short cs) -
getEndOfBuffer_2022
-
changeState_2022
-
getKey_2022
private static byte getKey_2022(byte c, int[] key, int[] offset) -
toUnicodeCallback
private static CoderResult toUnicodeCallback(CharsetDecoderICU cnv, int sourceChar, int targetUniChar) -
newDecoder
- Specified by:
newDecoder
in classCharset
-
newEncoder
- Specified by:
newEncoder
in classCharset
-
setInitialStateToUnicodeKR
private void setInitialStateToUnicodeKR() -
setInitialStateFromUnicodeKR
-
getUnicodeSetImpl
Description copied from class:CharsetICU
This follows ucnv.c method ucnv_detectUnicodeSignature() to detect the start of the stream for example U+FEFF (the Unicode BOM/signature character) that can be ignored. Detects Unicode signature byte sequences at the start of the byte stream and returns number of bytes of the BOM of the indicated Unicode charset. 0 is returned when no Unicode signature is recognized.- Specified by:
getUnicodeSetImpl
in classCharsetICU
-