Package de.danielnaber.jwordsplitter
Class GermanWordSplitter
java.lang.Object
de.danielnaber.jwordsplitter.AbstractWordSplitter
de.danielnaber.jwordsplitter.GermanWordSplitter
Split German compound words. Based on an embedded dictionary, or on an
external plain text dictionary.
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate GermanInterfixDisambiguatorprivate static final Stringprivate static final Collection<String> Interfixes = FugenelementeFields inherited from class de.danielnaber.jwordsplitter.AbstractWordSplitter
words -
Constructor Summary
ConstructorsConstructorDescriptionGermanWordSplitter(boolean hideInterfixCharacters) GermanWordSplitter(boolean hideInterfixCharacters, File plainTextDict) GermanWordSplitter(boolean hideInterfixCharacters, InputStream plainTextDict) GermanWordSplitter(boolean hideInterfixCharacters, Set<String> words) -
Method Summary
Modifier and TypeMethodDescriptionprotected intprotected GermanInterfixDisambiguatorprotected Collection<String> Interfix elements in lowercase, e.g.getWordList(InputStream stream) private voidinit()Methods inherited from class de.danielnaber.jwordsplitter.AbstractWordSplitter
addException, getAllSplits, getAllSplits, getSubWords, setExceptionFile, setMaximumWordLength, setMinimumWordLength, setStrictMode, splitWord, splitWord
-
Field Details
-
EXCEPTION_DICT
- See Also:
-
INTERFIXES
Interfixes = Fugenelemente -
disambiguator
-
-
Constructor Details
-
GermanWordSplitter
- Throws:
IOException
-
GermanWordSplitter
public GermanWordSplitter(boolean hideInterfixCharacters, InputStream plainTextDict) throws IOException - Throws:
IOException
-
GermanWordSplitter
- Throws:
IOException
-
GermanWordSplitter
- Throws:
IOException- Since:
- 4.2
-
-
Method Details
-
init
- Throws:
IOException
-
getWordList
- Specified by:
getWordListin classAbstractWordSplitter- Throws:
IOException
-
getWordList
- Specified by:
getWordListin classAbstractWordSplitter
-
getDisambiguator
- Specified by:
getDisambiguatorin classAbstractWordSplitter
-
getDefaultMinimumWordLength
protected int getDefaultMinimumWordLength()- Specified by:
getDefaultMinimumWordLengthin classAbstractWordSplitter
-
getInterfixCharacters
Description copied from class:AbstractWordSplitterInterfix elements in lowercase, e.g. at least "s" for German.- Specified by:
getInterfixCharactersin classAbstractWordSplitter
-