Class CharsetProber
java.lang.Object
org.mozilla.universalchardet.prober.CharsetProber
- Direct Known Subclasses:
Big5Prober, EscCharsetProber, EUCJPProber, EUCKRProber, EUCTWProber, GB18030Prober, HebrewProber, Latin1Prober, MBCSGroupProber, SBCSGroupProber, SingleByteCharsetProber, SJISProber, UTF8Prober
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic enum///////////////////////////////////////////////////////////// -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final intstatic final intstatic final intstatic final intstatic final intstatic final intstatic final intstatic final float///////////////////////////////////////////////////////////// -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionstatic ByteBufferfilterWithEnglishLetters(byte[] buf, int offset, int length) static ByteBufferfilterWithoutEnglishLetters(byte[] buf, int offset, int length) abstract String/////////////////////////////////////////////////////////////abstract floatabstract CharsetProber.ProbingStategetState()abstract CharsetProber.ProbingStatehandleData(byte[] buf, int offset, int length) private static booleanisAscii(byte b) private static booleanisAsciiSymbol(byte b) abstract voidreset()abstract void
-
Field Details
-
SHORTCUT_THRESHOLD
public static final float SHORTCUT_THRESHOLD/////////////////////////////////////////////////////////////- See Also:
-
ASCII_A
public static final int ASCII_A- See Also:
-
ASCII_Z
public static final int ASCII_Z- See Also:
-
ASCII_A_CAPITAL
public static final int ASCII_A_CAPITAL- See Also:
-
ASCII_Z_CAPITAL
public static final int ASCII_Z_CAPITAL- See Also:
-
ASCII_LT
public static final int ASCII_LT- See Also:
-
ASCII_GT
public static final int ASCII_GT- See Also:
-
ASCII_SP
public static final int ASCII_SP- See Also:
-
-
Constructor Details
-
CharsetProber
public CharsetProber()
-
-
Method Details
-
getCharSetName
///////////////////////////////////////////////////////////// -
handleData
-
getState
-
reset
public abstract void reset() -
getConfidence
public abstract float getConfidence() -
setOption
public abstract void setOption() -
filterWithoutEnglishLetters
-
filterWithEnglishLetters
-
isAscii
private static boolean isAscii(byte b) -
isAsciiSymbol
private static boolean isAsciiSymbol(byte b)
-