Package org.jcodings.specific
Class BaseSJISEncoding
- java.lang.Object
-
- org.jcodings.Encoding
-
- org.jcodings.AbstractEncoding
-
- org.jcodings.MultiByteEncoding
-
- org.jcodings.CanBeTrailTableEncoding
-
- org.jcodings.specific.BaseSJISEncoding
-
- All Implemented Interfaces:
java.lang.Cloneable
- Direct Known Subclasses:
NonStrictSJISEncoding,SJISEncoding,Windows_31JEncoding
abstract class BaseSJISEncoding extends CanBeTrailTableEncoding
-
-
Field Summary
Fields Modifier and Type Field Description private static int[]CR_Hiraganaprivate static int[]CR_Katakanaprivate static BytesHash<java.lang.Integer>CTypeNameHashprivate static int[][]PropertyList(package private) static boolean[]SJIS_CAN_BE_TRAIL_TABLE(package private) static int[]SjisEncLen-
Fields inherited from class org.jcodings.CanBeTrailTableEncoding
CanBeTrailTable
-
-
Constructor Summary
Constructors Modifier Constructor Description protectedBaseSJISEncoding(java.lang.String name, int[][] Trans)
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description intcodeToMbc(int code, byte[] bytes, int p)Extracts code point into it's multibyte representationintcodeToMbcLength(int code)Returns character length given a code point Oniguruma equivalent:code_to_mbclenint[]ctypeCodeRange(int ctype, IntHolder sbOut)Returns code range for a given character type Oniguruma equivalent:get_ctype_code_rangejava.lang.StringgetCharsetName()The name of the equivalent Java Charset for this encoding.private static intgetLowerCase(int code)booleanisCodeCType(int code, int ctype)Perform a check whether given code is of given character type (e.g.intmbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] lower)onigenc_ascii_mbc_case_foldintmbcToCode(byte[] bytes, int p, int end)Returns code point for a character Oniguruma equivalent:mbc_to_codeintpropertyNameToCType(byte[] bytes, int p, int end)onigenc_minimum_property_name_to_ctype notably overridden by unicode encodingsprivate static booleanSJIS_ISMB_TRAIL(int code)-
Methods inherited from class org.jcodings.CanBeTrailTableEncoding
isReverseMatchAllowed, leftAdjustCharHead
-
Methods inherited from class org.jcodings.MultiByteEncoding
caseMap, isInRange, length, lengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLength
-
Methods inherited from class org.jcodings.AbstractEncoding
applyAllCaseFold, asciiApplyAllCaseFold, asciiCaseFoldCodesByString, asciiMbcCaseFold, caseFoldCodesByString, isCodeCTypeInternal, isNewLine
-
Methods inherited from class org.jcodings.Encoding
asciiToLower, asciiToUpper, digitVal, equals, getCharset, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, length, load, load, maxLength, maxLengthDistance, mbcodeStartPosition, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitVal
-
-
-
-
Field Detail
-
CR_Hiragana
private static final int[] CR_Hiragana
-
CR_Katakana
private static final int[] CR_Katakana
-
PropertyList
private static final int[][] PropertyList
-
CTypeNameHash
private static final BytesHash<java.lang.Integer> CTypeNameHash
-
SJIS_CAN_BE_TRAIL_TABLE
static final boolean[] SJIS_CAN_BE_TRAIL_TABLE
-
SjisEncLen
static final int[] SjisEncLen
-
-
Method Detail
-
getCharsetName
public java.lang.String getCharsetName()
Description copied from class:EncodingThe name of the equivalent Java Charset for this encoding. Defaults to the name of the encoding. Subclasses can override this to provide a different name.- Overrides:
getCharsetNamein classEncoding- Returns:
- the name of the equivalent Java Charset for this encoding
-
mbcToCode
public int mbcToCode(byte[] bytes, int p, int end)Description copied from class:EncodingReturns code point for a character Oniguruma equivalent:mbc_to_code
-
codeToMbcLength
public int codeToMbcLength(int code)
Description copied from class:EncodingReturns character length given a code point Oniguruma equivalent:code_to_mbclen- Specified by:
codeToMbcLengthin classEncoding
-
SJIS_ISMB_TRAIL
private static boolean SJIS_ISMB_TRAIL(int code)
-
codeToMbc
public int codeToMbc(int code, byte[] bytes, int p)Description copied from class:EncodingExtracts code point into it's multibyte representation
-
getLowerCase
private static int getLowerCase(int code)
-
mbcCaseFold
public int mbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] lower)Description copied from class:AbstractEncodingonigenc_ascii_mbc_case_fold- Overrides:
mbcCaseFoldin classAbstractEncoding- Parameters:
flag- case fold flagpp- anIntHolderthat points at character headlower- a buffer where to extract case folded character Oniguruma equivalent:mbc_case_fold
-
propertyNameToCType
public int propertyNameToCType(byte[] bytes, int p, int end)Description copied from class:AbstractEncodingonigenc_minimum_property_name_to_ctype notably overridden by unicode encodings- Overrides:
propertyNameToCTypein classAbstractEncoding
-
isCodeCType
public boolean isCodeCType(int code, int ctype)Description copied from class:EncodingPerform a check whether given code is of given character type (e.g. used by isWord(someByte) and similar methods)- Specified by:
isCodeCTypein classEncoding- Parameters:
code- a code point of a characterctype- a character type to check against Oniguruma equivalent:is_code_ctype
-
ctypeCodeRange
public int[] ctypeCodeRange(int ctype, IntHolder sbOut)Description copied from class:EncodingReturns code range for a given character type Oniguruma equivalent:get_ctype_code_range- Specified by:
ctypeCodeRangein classEncoding
-
-