Class GermanTagger
java.lang.Object
org.languagetool.tagging.BaseTagger
org.languagetool.tagging.de.GermanTagger
- All Implemented Interfaces:
org.languagetool.tagging.Tagger
- Direct Known Subclasses:
SwissGermanTagger
public class GermanTagger
extends org.languagetool.tagging.BaseTagger
German part-of-speech tagger, requires data file in
de/german.dict in the classpath.
The POS tagset is described in
tagset.txt-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate GermanCompoundTokenizerprivate final org.languagetool.tagging.ManualTaggerFields inherited from class org.languagetool.tagging.BaseTagger
conversionLocale, wordTagger -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprivate List<org.languagetool.tagging.TaggedWord> private List<org.languagetool.AnalyzedToken> getAnalyzedTokens(List<org.languagetool.tagging.TaggedWord> taggedWords, String word) private List<org.languagetool.AnalyzedToken> getAnalyzedTokens(List<org.languagetool.tagging.TaggedWord> taggedWords, String word, List<String> compoundParts) private List<org.languagetool.AnalyzedToken> getImperativeForm(String word, List<String> sentenceTokens, int pos) private org.languagetool.AnalyzedTokengetNoInfoToken(String word) private List<org.languagetool.AnalyzedToken> getSubstantivatedForms(String word, List<String> sentenceTokens, int pos) private void@Nullable org.languagetool.AnalyzedTokenReadingsReturn only the first reading of the given word ornull.private booleanmatchesUppercaseAdjective(String unknownUppercaseToken) private StringsanitizeWord(String word) List<org.languagetool.tagging.TaggedWord> List<org.languagetool.AnalyzedTokenReadings> List<org.languagetool.AnalyzedTokenReadings> Methods inherited from class org.languagetool.tagging.BaseTagger
additionalTags, asAnalyzedToken, asAnalyzedTokenList, asAnalyzedTokenListForTaggedWords, createNullToken, createToken, getAnalyzedTokens, getDictionary, getDictionaryPath, getWordTagger, overwriteWithManualTagger
-
Field Details
-
removalTagger
private final org.languagetool.tagging.ManualTagger removalTagger -
compoundTokenizer
-
-
Constructor Details
-
GermanTagger
public GermanTagger()
-
-
Method Details
-
addStem
-
sanitizeWord
-
getManualAdditionsFileName
- Specified by:
getManualAdditionsFileNamein classorg.languagetool.tagging.BaseTagger
-
getManualRemovalsFileName
- Overrides:
getManualRemovalsFileNamein classorg.languagetool.tagging.BaseTagger
-
lookup
@Nullable public @Nullable org.languagetool.AnalyzedTokenReadings lookup(String word) throws IOException Return only the first reading of the given word ornull.- Throws:
IOException
-
tag
-
matchesUppercaseAdjective
-
tag
public List<org.languagetool.AnalyzedTokenReadings> tag(List<String> sentenceTokens) throws IOException - Specified by:
tagin interfaceorg.languagetool.tagging.Tagger- Overrides:
tagin classorg.languagetool.tagging.BaseTagger- Throws:
IOException
-
tag
public List<org.languagetool.AnalyzedTokenReadings> tag(List<String> sentenceTokens, boolean ignoreCase) throws IOException - Throws:
IOException
-
getImperativeForm
-
getSubstantivatedForms
-
initializeIfRequired
- Throws:
IOException
-
getNoInfoToken
-
getAnalyzedTokens
-
getAnalyzedTokens
-