Index
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form
A
- add(String) - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.Add n-gram to profile
- add(String) - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
- addChar(char) - Method in class com.optimaize.langdetect.cybozu.util.NGram
- addCharSequence(LangProfile, CharSequence) - Static method in class com.optimaize.langdetect.cybozu.util.Util
- addGram(String) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
-
Shortcut for addGram(ngram, 1).
- addGram(String, int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
-
If the builder already has this ngram, the given frequency is added to the current count.
- addOpt(String, String, String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
- addText(CharSequence) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
-
In order to use this you must set the
LanguageProfileBuilder.ngramExtractorfirst. - affixFactor(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
-
Sets prefixFactor() and suffixFactor() both to the given value.
- alpha - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
- alpha - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
- alpha(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
- ALPHA_DEFAULT - Static variable in class com.optimaize.langdetect.LanguageDetectorBuilder
- ALPHA_WIDTH - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
-
TODO document what this is for, and why that value is chosen.
- append(char) - Method in class com.optimaize.langdetect.text.TextObject
- append(Reader) - Method in class com.optimaize.langdetect.text.TextObject
-
Append the target text for language detection.
- append(CharSequence) - Method in class com.optimaize.langdetect.text.TextObject
-
Append the target text for language detection.
- append(CharSequence, int, int) - Method in class com.optimaize.langdetect.text.TextObject
- applyPadding(CharSequence) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
- arglist - Variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
- assignLang(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
B
- backwards() - Static method in class com.optimaize.langdetect.ngram.NgramExtractors
-
The old way of doing n-grams.
- BACKWARDS - Static variable in class com.optimaize.langdetect.ngram.NgramExtractors
- BackwardsCompatibleNgramFilter - Class in com.optimaize.langdetect.ngram
-
Filters those that were not generated by the old n-gram generator.
- BackwardsCompatibleNgramFilter() - Constructor for class com.optimaize.langdetect.ngram.BackwardsCompatibleNgramFilter
- BASE_FREQ - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
-
TODO document what this is for, and why that value is chosen.
- batchTest() - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
-
Batch Test of Language Detection (--batchtest option)
- buf_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
- build() - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
- build() - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
- build() - Method in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
- BuiltInLanguages - Class in com.optimaize.langdetect.profiles
- BuiltInLanguages() - Constructor for class com.optimaize.langdetect.profiles.BuiltInLanguages
- BUNDLE_NAME - Static variable in class com.optimaize.langdetect.cybozu.util.Messages
C
- capitalword_ - Variable in class com.optimaize.langdetect.cybozu.util.NGram
- charAt(int) - Method in class com.optimaize.langdetect.text.TextObject
- CharNormalizer - Class in com.optimaize.langdetect.cybozu.util
-
Some character normalization (and exclusion) functionality.
- CharNormalizer() - Constructor for class com.optimaize.langdetect.cybozu.util.CharNormalizer
- CharNormalizerTextFilterImpl - Class in com.optimaize.langdetect.text
-
Deprecated.can't be used because it would be a big loss to not inline this code.
- CharNormalizerTextFilterImpl() - Constructor for class com.optimaize.langdetect.text.CharNormalizerTextFilterImpl
-
Deprecated.
- CJK_CLASS - Static variable in class com.optimaize.langdetect.cybozu.util.CharNormalizer
-
CJK Kanji Normalization Mapping
- cjk_map - Static variable in class com.optimaize.langdetect.cybozu.util.CharNormalizer
- clear() - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
- closeQuietly(Closeable) - Static method in class com.optimaize.langdetect.frma.IOUtils
-
Deprecated.use java7 closeable
- closeTag(LangProfile) - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
- com.optimaize.langdetect - package com.optimaize.langdetect
- com.optimaize.langdetect.cybozu - package com.optimaize.langdetect.cybozu
-
Original language detection classes from https://code.google.com/p/language-detection/
- com.optimaize.langdetect.cybozu.util - package com.optimaize.langdetect.cybozu.util
-
Provides the utility classes for language detection.
- com.optimaize.langdetect.frma - package com.optimaize.langdetect.frma
- com.optimaize.langdetect.i18n - package com.optimaize.langdetect.i18n
- com.optimaize.langdetect.ngram - package com.optimaize.langdetect.ngram
-
Provides functionality for handling n-grams.
- com.optimaize.langdetect.profiles - package com.optimaize.langdetect.profiles
-
Provides functionality for loading, storing and creating
LanguageProfiles. - com.optimaize.langdetect.profiles.util - package com.optimaize.langdetect.profiles.util
- com.optimaize.langdetect.text - package com.optimaize.langdetect.text
-
Provides functionality for concatenating and cleaning text that is used as a) learning text to produces b) for the text for which the language is to be guessed.
invalid reference
com.optimaize.langdetect.LanguageProfile - CommandLineInterface - Class in com.optimaize.langdetect.cybozu
-
LangDetect Command Line Interface.
- CommandLineInterface() - Constructor for class com.optimaize.langdetect.cybozu.CommandLineInterface
- CommonTextObjectFactories - Class in com.optimaize.langdetect.text
-
Contains some standard
TextObjectFactorys ready to use for common use cases. - CommonTextObjectFactories() - Constructor for class com.optimaize.langdetect.text.CommonTextObjectFactories
- compareTo(DetectedLanguage) - Method in class com.optimaize.langdetect.DetectedLanguage
-
See class header.
- CONV_THRESHOLD - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
-
TODO document what this is for, and why that value is chosen.
- convert(LangProfile) - Static method in class com.optimaize.langdetect.profiles.OldLangProfileConverter
- count() - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
- count_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
- countByScript(CharSequence) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
- create() - Method in class com.optimaize.langdetect.text.TextObjectFactory
- create(NgramExtractor) - Static method in class com.optimaize.langdetect.LanguageDetectorBuilder
- create(Collection<LanguageProfile>, Collection<Integer>) - Static method in class com.optimaize.langdetect.NgramFrequencyData
D
- DEFAULT_ALPHA - Static variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
-
smoothing default parameter (ELE)
- DEFAULT_SEED - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
-
This is used when no custom seed was passed in.
- detect(CharSequence) - Method in interface com.optimaize.langdetect.LanguageDetector
-
Returns the best detected language if the algorithm is very confident.
- detect(CharSequence) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
- detectBlock(CharSequence) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
- detectBlockLongText(List<String>) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
-
This is the original algorithm used for all text length.
- detectBlockShortText(Map<String, Integer>) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
- DetectedLanguage - Class in com.optimaize.langdetect
-
Holds information about a detected language: the locale (language) and the probability.
- DetectedLanguage(LdLocale, double) - Constructor for class com.optimaize.langdetect.DetectedLanguage
- detectLang() - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
-
Language detection test for each file (--detectlang option)
E
- equals(Object) - Method in class com.optimaize.langdetect.i18n.LdLocale
- equals(Object) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- extractCountedGrams(CharSequence) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
- extractGrams(CharSequence) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
-
Creates the n-grams for a given text in the order they occur.
- extractNGrams(CharSequence, OldNgramExtractor.Filter) - Static method in class com.optimaize.langdetect.ngram.OldNgramExtractor
-
Deprecated.
F
- filter - Variable in class com.optimaize.langdetect.ngram.NgramExtractor
- filter(NgramFilter) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
- filter(CharSequence) - Method in class com.optimaize.langdetect.text.CharNormalizerTextFilterImpl
-
Deprecated.
- filter(CharSequence) - Method in class com.optimaize.langdetect.text.MultiTextFilter
- filter(CharSequence) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
- filter(CharSequence) - Method in interface com.optimaize.langdetect.text.TextFilter
- filter(CharSequence) - Method in class com.optimaize.langdetect.text.UrlTextFilter
- filters - Variable in class com.optimaize.langdetect.text.MultiTextFilter
- findMost(Map<Character.UnicodeScript, Long>) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
- forDetectingOnLargeText() - Static method in class com.optimaize.langdetect.text.CommonTextObjectFactories
- forDetectingShortCleanText() - Static method in class com.optimaize.langdetect.text.CommonTextObjectFactories
- forIndexing() - Static method in class com.optimaize.langdetect.text.CommonTextObjectFactories
- forIndexingCleanText() - Static method in class com.optimaize.langdetect.text.CommonTextObjectFactories
- forText(CharSequence) - Method in class com.optimaize.langdetect.text.TextObjectFactory
- forThreshold(double) - Static method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
-
If a script has less than this fraction of content compared to the most used one, its text is removed.
- freq - Variable in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.Key = ngram, value = count.
- FREQ_PATTERN - Static variable in class com.optimaize.langdetect.frma.LangProfileReader
- fromString(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
G
- generate(String, File) - Static method in class com.optimaize.langdetect.frma.GenProfile
-
Loads a text file and generate a language profile from its content.
- generateProfile() - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
-
Generate Language Profile from a text file.
- GenProfile - Class in com.optimaize.langdetect.cybozu
-
Load Wikipedia's abstract XML as corpus and generate its language profile in JSON format.
- GenProfile - Class in com.optimaize.langdetect.frma
-
Generate a language profile from any given text file.
- GenProfile() - Constructor for class com.optimaize.langdetect.cybozu.GenProfile
- GenProfile() - Constructor for class com.optimaize.langdetect.frma.GenProfile
- get(int) - Method in class com.optimaize.langdetect.cybozu.util.NGram
-
TODO this method has some weird, undocumented behavior to ignore ngrams with upper case.
- getFreq() - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.
- getFrequency(String) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
- getFrequency(String) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- getGramLengths() - Method in class com.optimaize.langdetect.ngram.NgramExtractor
- getGramLengths() - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
-
Tells what the n in n-grams are used here.
- getGramLengths() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- getInstance() - Static method in class com.optimaize.langdetect.ngram.BackwardsCompatibleNgramFilter
- getInstance() - Static method in class com.optimaize.langdetect.ngram.StandardNgramFilter
- getInstance() - Static method in class com.optimaize.langdetect.text.UrlTextFilter
- getLanguage() - Method in class com.optimaize.langdetect.i18n.LdLocale
- getLanguage(int) - Method in class com.optimaize.langdetect.NgramFrequencyData
- getLanguageList() - Method in class com.optimaize.langdetect.NgramFrequencyData
- getLanguages() - Static method in class com.optimaize.langdetect.profiles.BuiltInLanguages
- getLocale() - Method in class com.optimaize.langdetect.DetectedLanguage
- getLocale() - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
- getLocale() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- getMaxGramCount(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
-
Tells how often the n-gram with the highest amount of occurrences used in this profile occurred.
- getMaxGramCount(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- getMinGramCount(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
-
Tells how often the n-gram with the lowest amount of occurrences used in this profile occurred.
- getMinGramCount(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- getName() - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.
- getNumGramOccurrences(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
-
Tells how often all n-grams of a certain length occurred, combined.
- getNumGramOccurrences(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- getNumGrams() - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
-
Tells how many n-grams there are for all n-gram sizes combined.
- getNumGrams() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- getNumGrams(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
-
Tells how many different n-grams there are for a certain n-gram size.
- getNumGrams(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- getNWords() - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.
- getParamDouble(String, double) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
-
Returns the double, or the default is absent.
- getParamLongOrNull(String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
- getProbabilities(CharSequence) - Method in interface com.optimaize.langdetect.LanguageDetector
-
Returns all languages with at least some likeliness.
- getProbabilities(CharSequence) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
- getProbabilities(String) - Method in class com.optimaize.langdetect.NgramFrequencyData
-
Don't modify this data structure! (Can't make array immutable...)
- getProbability() - Method in class com.optimaize.langdetect.DetectedLanguage
- getRegion() - Method in class com.optimaize.langdetect.i18n.LdLocale
- getScript() - Method in class com.optimaize.langdetect.i18n.LdLocale
- getShortTextLanguages() - Static method in class com.optimaize.langdetect.profiles.BuiltInLanguages
- getString(String) - Static method in class com.optimaize.langdetect.cybozu.util.Messages
- gramLength(int) - Static method in class com.optimaize.langdetect.ngram.NgramExtractor
- gramLengths - Variable in class com.optimaize.langdetect.ngram.NgramExtractor
- gramLengths(Integer...) - Static method in class com.optimaize.langdetect.ngram.NgramExtractor
- grams_ - Variable in class com.optimaize.langdetect.cybozu.util.NGram
- guessNumDistinctiveGrams(int, int) - Static method in class com.optimaize.langdetect.ngram.NgramExtractor
-
This is trying to be smart.
H
- hashCode() - Method in class com.optimaize.langdetect.i18n.LdLocale
- hashCode() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- hasParam(String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
I
- increment(Map<Character.UnicodeScript, Long>, Character.UnicodeScript) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
- initProbability() - Method in class com.optimaize.langdetect.LanguageDetectorImpl
-
Initialize the map of language probabilities.
- INSTANCE - Static variable in class com.optimaize.langdetect.ngram.BackwardsCompatibleNgramFilter
- INSTANCE - Static variable in class com.optimaize.langdetect.ngram.StandardNgramFilter
- INSTANCE - Static variable in class com.optimaize.langdetect.text.UrlTextFilter
- internalReader - Static variable in class com.optimaize.langdetect.profiles.LanguageProfileReader
- IOUtils - Class in com.optimaize.langdetect.frma
-
Deprecated.
- IOUtils() - Constructor for class com.optimaize.langdetect.frma.IOUtils
-
Deprecated.Private constructor to prevent instantiation.
- isSpace() - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
- iterateGrams() - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
-
Iterates all ngram strings with frequency.
- iterateGrams() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- iterateGrams(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
-
Iterates all gramLength-gram strings with frequency.
- iterateGrams(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- ITERATION_LIMIT - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
-
TODO document what this is for, and why that value is chosen.
L
- langlist - Variable in class com.optimaize.langdetect.NgramFrequencyData
-
All the loaded languages, in exactly the same order as the data is in the double[] in wordLangProbMap.
- LangProfile - Class in com.optimaize.langdetect.cybozu.util
-
Deprecated.replaced by LanguageProfile
- LangProfile() - Constructor for class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.Constructor for JSONIC
- LangProfile(String) - Constructor for class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.Normal Constructor
- LangProfileReader - Class in com.optimaize.langdetect.frma
-
Reads
LangProfiles. - LangProfileReader() - Constructor for class com.optimaize.langdetect.frma.LangProfileReader
- LangProfileWriter - Class in com.optimaize.langdetect.frma
-
Writes a
LangProfileto an output stream (file). - LangProfileWriter() - Constructor for class com.optimaize.langdetect.frma.LangProfileWriter
- langsAdded - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
- language - Variable in class com.optimaize.langdetect.i18n.LdLocale
- LanguageDetector - Interface in com.optimaize.langdetect
-
Guesses the language of an input string or text.
- LanguageDetectorBuilder - Class in com.optimaize.langdetect
-
Builder for
LanguageDetector. - LanguageDetectorBuilder(NgramExtractor) - Constructor for class com.optimaize.langdetect.LanguageDetectorBuilder
- LanguageDetectorImpl - Class in com.optimaize.langdetect
-
This class is immutable and thus thread-safe.
- LanguageDetectorImpl(NgramFrequencyData, double, Optional<Long>, int, double, double, double, double, Map<LdLocale, Double>, NgramExtractor) - Constructor for class com.optimaize.langdetect.LanguageDetectorImpl
-
Use the
LanguageDetectorBuilder. - LanguageLister - Class in com.optimaize.langdetect.profiles.util
-
This is just a utility to update the code with the existing languages.
- LanguageLister() - Constructor for class com.optimaize.langdetect.profiles.util.LanguageLister
- languagePriorities(Map<LdLocale, Double>) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
-
TODO document exactly.
- LanguageProfile - Interface in com.optimaize.langdetect.profiles
-
A language profile knows the locale (language), and contains the n-grams and some statistics.
- LanguageProfileBuilder - Class in com.optimaize.langdetect.profiles
-
Builder for
LanguageProfile. - LanguageProfileBuilder(LdLocale) - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileBuilder
- LanguageProfileBuilder(String) - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileBuilder
-
Deprecated.
- LanguageProfileImpl - Class in com.optimaize.langdetect.profiles
-
This class is immutable.
- LanguageProfileImpl(LdLocale, Map<Integer, Map<String, Integer>>) - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileImpl
-
Use the builder.
- LanguageProfileImpl.Stats - Class in com.optimaize.langdetect.profiles
- LanguageProfileReader - Class in com.optimaize.langdetect.profiles
-
Reads
LanguageProfiles. - LanguageProfileReader() - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileReader
- languageProfiles - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
- LanguageProfileWriter - Class in com.optimaize.langdetect.profiles
-
Writes a
LanguageProfileto an output stream or file. - LanguageProfileWriter() - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileWriter
- languages - Static variable in class com.optimaize.langdetect.profiles.BuiltInLanguages
- langWeightingMap - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
- LATIN1_EXCLUDED - Static variable in class com.optimaize.langdetect.cybozu.util.CharNormalizer
- LdLocale - Class in com.optimaize.langdetect.i18n
-
A language-detector implementation of a Locale, similar to the java.util.Locale.
- LdLocale(String, Optional<String>, Optional<String>) - Constructor for class com.optimaize.langdetect.i18n.LdLocale
- length() - Method in class com.optimaize.langdetect.text.TextObject
- LESS_FREQ_RATIO - Static variable in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.Explanation by example: If the most frequent n-gram occurs 1 mio times, then 1'000'000 / this (100'000) = 10.
- load(String, File) - Static method in class com.optimaize.langdetect.cybozu.GenProfile
-
Load Wikipedia abstract database file and generate its language profile
- locale - Variable in class com.optimaize.langdetect.DetectedLanguage
- locale - Variable in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
- locale - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- logger - Static variable in class com.optimaize.langdetect.cybozu.GenProfile
- logger - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
- looksLikeGeoCode3166_1(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
- looksLikeGeoCodeNumeric(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
- looksLikeLanguageProfileFile(File) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
- looksLikeLanguageProfileName(String) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
- looksLikeScriptCode(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
M
- MAIL_REGEX - Static variable in class com.optimaize.langdetect.text.UrlTextFilter
- main(String[]) - Static method in class com.optimaize.langdetect.cybozu.CommandLineInterface
-
Command Line Interface
- main(String[]) - Static method in class com.optimaize.langdetect.profiles.util.LanguageLister
- makeDetector() - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
-
Using all language profiles from the given directory.
- makeInternalPrioMap(Map<LdLocale, Double>, List<LdLocale>) - Static method in class com.optimaize.langdetect.cybozu.util.Util
- makePathForClassLoader(String, String) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
- makeProfileFileName(LdLocale) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
- makeStats(Map<Integer, Map<String, Integer>>) - Static method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- maxGramCounts - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl.Stats
-
Key = gram length (1-3 or so).
- maxTextLength - Variable in class com.optimaize.langdetect.text.TextObject
- maxTextLength - Variable in class com.optimaize.langdetect.text.TextObjectFactory
- maxTextLength - Variable in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
- maxTextLength(int) - Method in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
- Messages - Class in com.optimaize.langdetect.cybozu.util
-
This is
Messagesclass generated by Eclipse automatically. - Messages() - Constructor for class com.optimaize.langdetect.cybozu.util.Messages
- minGramCounts - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl.Stats
-
Key = gram length (1-3 or so).
- minimalConfidence - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
- minimalConfidence - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
- minimalConfidence(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
-
LanguageDetector.detect(java.lang.CharSequence)returns a language if the best detected language has at least this probability. - minimalFrequency - Variable in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
- minimalFrequency(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
- MINIMUM_FREQ - Static variable in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.n-grams that occur less than this often can be removed using omitLessFreq().
- MultiTextFilter - Class in com.optimaize.langdetect.text
-
Groups multiple
TextFilters as one and runs them in the given order. - MultiTextFilter(List<TextFilter>) - Constructor for class com.optimaize.langdetect.text.MultiTextFilter
N
- N_GRAM - Static variable in class com.optimaize.langdetect.cybozu.util.NGram
-
ngrams are created from 1gram to this amount, currently 2grams and 3grams.
- N_TRIAL - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
-
TODO document what this is for, and why that value is chosen.
- N_WORDS_PATTERN - Static variable in class com.optimaize.langdetect.frma.LangProfileReader
- name - Variable in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.The language name (identifier).
- NAME_PATTERN - Static variable in class com.optimaize.langdetect.frma.LangProfileReader
- NGram - Class in com.optimaize.langdetect.cybozu.util
-
TODO document.
- NGram() - Constructor for class com.optimaize.langdetect.cybozu.util.NGram
- ngramExtractor - Static variable in class com.optimaize.langdetect.cybozu.util.Util
- ngramExtractor - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
- ngramExtractor - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
- ngramExtractor - Variable in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
- ngramExtractor(NgramExtractor) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
- NgramExtractor - Class in com.optimaize.langdetect.ngram
-
Class for extracting n-grams out of a text.
- NgramExtractor(List<Integer>, NgramFilter, Character) - Constructor for class com.optimaize.langdetect.ngram.NgramExtractor
- NgramExtractors - Class in com.optimaize.langdetect.ngram
-
Provides easy access to commonly used NgramExtractor configs.
- NgramExtractors() - Constructor for class com.optimaize.langdetect.ngram.NgramExtractors
- NgramFilter - Interface in com.optimaize.langdetect.ngram
-
Filters out some undesired n-grams.
- ngramFrequencyData - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
- NgramFrequencyData - Class in com.optimaize.langdetect
-
Contains frequency information for n-grams coming from multiple
LanguageProfiles. - NgramFrequencyData(Map<String, double[]>, List<LdLocale>) - Constructor for class com.optimaize.langdetect.NgramFrequencyData
- ngrams - Variable in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
- ngrams - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- normalize(char) - Static method in class com.optimaize.langdetect.cybozu.util.CharNormalizer
-
Character Normalization (and exclusion).
- normalizeProb(double[]) - Static method in class com.optimaize.langdetect.cybozu.util.Util
-
normalize probabilities and check convergence by the maximum probability
- numOccurrences - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl.Stats
-
Key = gram length (1-3 or so).
- nWords - Variable in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.Tells how many occurrences of n-grams exist per gram length.
O
- OldLangProfileConverter - Class in com.optimaize.langdetect.profiles
-
Converts an old
LangProfileto a newLanguageProfile. - OldLangProfileConverter() - Constructor for class com.optimaize.langdetect.profiles.OldLangProfileConverter
- OldNgramExtractor - Class in com.optimaize.langdetect.ngram
-
Deprecated.
- OldNgramExtractor() - Constructor for class com.optimaize.langdetect.ngram.OldNgramExtractor
-
Deprecated.
- OldNgramExtractor.Filter - Interface in com.optimaize.langdetect.ngram
-
Deprecated.
- omitLessFreq() - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.Removes ngrams that occur fewer times than MINIMUM_FREQ to get rid of rare ngrams.
- opt_with_value - Variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
-
for Command line easy parser
- opt_without_value - Variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
P
- parse(String[]) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
-
Command line easy parser
- prefixFactor - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
- prefixFactor - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
- prefixFactor(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
-
To weight n-grams that are on the left border of a word differently from n-grams in the middle of words, assign a value here.
- priorMap - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
-
User-defined language priorities, in the same order as
langlist. - probability - Variable in class com.optimaize.langdetect.DetectedLanguage
- PROBABILITY_SORTING_COMPARATOR - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
- probabilityThreshold - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
- probabilityThreshold - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
- probabilityThreshold(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
-
LanguageDetector.getProbabilities(java.lang.CharSequence)does not return languages with less probability than this. - PROFILES_DIR - Static variable in class com.optimaize.langdetect.profiles.LanguageProfileReader
R
- read(File) - Method in class com.optimaize.langdetect.frma.LangProfileReader
-
Reads a
LangProfilefrom a File in UTF-8. - read(File) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
-
Reads a
LanguageProfilefrom a File in UTF-8. - read(InputStream) - Method in class com.optimaize.langdetect.frma.LangProfileReader
-
Reads a
LangProfilefrom an InputStream in UTF-8. - read(InputStream) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
-
Reads a
LanguageProfilefrom an InputStream in UTF-8. - read(ClassLoader, String, Collection<String>) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
-
Load profiles from the classpath in a specific directory.
- read(String, Collection<String>) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
-
Same as
LanguageProfileReader.read(ClassLoader, String, java.util.Collection)using the class loader of this class. - read(Collection<String>) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
-
Same as
LanguageProfileReader.read(ClassLoader, String, java.util.Collection)using the class loader of this class, and the default profiles directory of this library. - readAll() - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
-
Deprecated.renamed to readAllBuiltIn()
- readAll(File) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
-
Loads all profiles from the specified directory.
- readAllBuiltIn() - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
-
Reads all built-in language profiles from the "languages" folder (shipped with the jar).
- readBuiltIn(LdLocale) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
- readBuiltIn(Collection<LdLocale>) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
- readFilesFromClassPathFolder(String) - Static method in class com.optimaize.langdetect.profiles.util.LanguageLister
- region - Variable in class com.optimaize.langdetect.i18n.LdLocale
- remove(CharSequence, Set<Character.UnicodeScript>) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
- RemoveMinorityScriptsTextFilter - Class in com.optimaize.langdetect.text
-
Removes text written in scripts that are not the dominant script of the text.
- RemoveMinorityScriptsTextFilter(double) - Constructor for class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
- removeNgramsWithLessFrequency() - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
- requireParamString(String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
- RESOURCE_BUNDLE - Static variable in class com.optimaize.langdetect.cybozu.util.Messages
S
- script - Variable in class com.optimaize.langdetect.i18n.LdLocale
- searchFile(File, String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
-
File search (easy glob)
- seed - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
- seed - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
- seed(long) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
- seed(Optional<Long>) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
- serialVersionUID - Static variable in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.
- setFreq(Map<String, Integer>) - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.
- setName(String) - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.
- setNWords(int[]) - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
-
Deprecated.
- setTag(String) - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
- shortTextAlgorithm - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
- shortTextAlgorithm - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
- shortTextAlgorithm(int) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
-
Defaults to 0, which means don't use this feature.
- shortTextLanguages - Static variable in class com.optimaize.langdetect.profiles.BuiltInLanguages
- sortProbability(double[]) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
-
Returns the detected languages sorted by probabilities descending.
- standard() - Static method in class com.optimaize.langdetect.ngram.NgramExtractors
-
The new standard n-gram algorithm.
- STANDARD - Static variable in class com.optimaize.langdetect.ngram.NgramExtractors
- StandardNgramFilter - Class in com.optimaize.langdetect.ngram
-
Filters what is generally not desired.
- StandardNgramFilter() - Constructor for class com.optimaize.langdetect.ngram.StandardNgramFilter
- stats - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- Stats(Map<Integer, Long>, Map<Integer, Long>, Map<Integer, Long>) - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileImpl.Stats
- stringBuilder - Variable in class com.optimaize.langdetect.text.TextObject
- subSequence(int, int) - Method in class com.optimaize.langdetect.text.TextObject
- suffixFactor - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
- suffixFactor - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
- suffixFactor(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
-
Defaults to 1.0, which means don't use this feature.
T
- tag_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
- TagExtractor - Class in com.optimaize.langdetect.cybozu.util
-
TagExtractoris a class which extracts inner texts of specified tag. - TagExtractor(String, int) - Constructor for class com.optimaize.langdetect.cybozu.util.TagExtractor
- target_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
- textFilter - Variable in class com.optimaize.langdetect.text.TextObject
- textFilter - Variable in class com.optimaize.langdetect.text.TextObjectFactory
- TextFilter - Interface in com.optimaize.langdetect.text
-
Allows to filter content from a text to be ignored for the n-gram analysis.
- textFilters - Variable in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
- TextObject - Class in com.optimaize.langdetect.text
-
A convenient text object implementing CharSequence and Appendable.
- TextObject(TextFilter, int) - Constructor for class com.optimaize.langdetect.text.TextObject
- textObjectFactory - Static variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
- textObjectFactory - Static variable in class com.optimaize.langdetect.frma.GenProfile
- TextObjectFactory - Class in com.optimaize.langdetect.text
-
Factory for
TextObjects. - TextObjectFactory(TextFilter, int) - Constructor for class com.optimaize.langdetect.text.TextObjectFactory
- TextObjectFactoryBuilder - Class in com.optimaize.langdetect.text
-
Builder for
TextObjectFactory. - TextObjectFactoryBuilder() - Constructor for class com.optimaize.langdetect.text.TextObjectFactoryBuilder
- textPadding - Variable in class com.optimaize.langdetect.ngram.NgramExtractor
- textPadding(char) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
-
To ensure having border grams, this character is added to the left and right of the text.
- threshold - Variable in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
- threshold_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
- toString() - Method in class com.optimaize.langdetect.DetectedLanguage
- toString() - Method in class com.optimaize.langdetect.i18n.LdLocale
-
The output of this can be fed to the fromString() method.
- toString() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
- toString() - Method in class com.optimaize.langdetect.text.TextObject
U
- unicodeEncode(String) - Static method in class com.optimaize.langdetect.cybozu.util.Util
-
unicode encoding (for verbose mode)
- updateLangProb(double[], String, int, double) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
-
update language probabilities with N-gram string(N=1,2,3)
- URL_REGEX - Static variable in class com.optimaize.langdetect.text.UrlTextFilter
- UrlTextFilter - Class in com.optimaize.langdetect.text
-
Removes URLs and email addresses from the text.
- UrlTextFilter() - Constructor for class com.optimaize.langdetect.text.UrlTextFilter
- use(String) - Method in class com.optimaize.langdetect.ngram.BackwardsCompatibleNgramFilter
- use(String) - Method in interface com.optimaize.langdetect.ngram.NgramFilter
- use(String) - Method in interface com.optimaize.langdetect.ngram.OldNgramExtractor.Filter
-
Deprecated.Allows to skip some n-grams.
- use(String) - Method in class com.optimaize.langdetect.ngram.StandardNgramFilter
- Util - Class in com.optimaize.langdetect.cybozu.util
-
A place for sharing code.
- Util() - Constructor for class com.optimaize.langdetect.cybozu.util.Util
V
- values - Variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
W
- withProfile(LanguageProfile) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
- withProfiles(Iterable<LanguageProfile>) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
- withTextFilter(TextFilter) - Method in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
-
Adds the given TextFilter to be run on
TextObject.append(java.io.Reader)methods. - wordLangProbMap - Variable in class com.optimaize.langdetect.NgramFrequencyData
-
Key = ngram Value = array with probabilities per loaded language, in the same order as
langlist. - wordProbToString(double[], List<LdLocale>) - Static method in class com.optimaize.langdetect.cybozu.util.Util
- write(LangProfile, OutputStream) - Method in class com.optimaize.langdetect.frma.LangProfileWriter
-
Writes a
LangProfileto an OutputStream in UTF-8. - write(LanguageProfile, OutputStream) - Method in class com.optimaize.langdetect.profiles.LanguageProfileWriter
-
Writes a
LanguageProfileto an OutputStream in UTF-8. - writeToDirectory(LanguageProfile, File) - Method in class com.optimaize.langdetect.profiles.LanguageProfileWriter
-
Writes a
LanguageProfileto a folder using the language name as the file name.
_
- _extractCounted(CharSequence, int, int, Map<String, Integer>) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form