Package org.languagetool.languagemodel
Class LuceneLanguageModel
java.lang.Object
org.languagetool.languagemodel.BaseLanguageModel
org.languagetool.languagemodel.LuceneLanguageModel
- All Implemented Interfaces:
AutoCloseable,LanguageModel
Like
LuceneSingleIndexLanguageModel, but can merge the results of
lookups in several independent indexes to one result.- Since:
- 2.7
-
Field Summary
FieldsFields inherited from interface org.languagetool.languagemodel.LanguageModel
GOOGLE_SENTENCE_END, GOOGLE_SENTENCE_START -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionvoidclose()longGet the occurrence count fortoken.longGet the occurrence count for the given token sequence.private static @Nullable File[]getSubDirectoriesOrNull(File topIndexDir) longtoString()static voidvalidateDirectory(File topIndexDir) Methods inherited from class org.languagetool.languagemodel.BaseLanguageModel
getPseudoProbability, getPseudoProbabilityStupidBackoff
-
Field Details
-
lms
-
-
Constructor Details
-
LuceneLanguageModel
- Parameters:
topIndexDir- a directory which contains either: 1) sub directories called1grams,2grams,3grams, which are Lucene indexes with ngram occurrences as created byorg.languagetool.dev.FrequencyIndexCreatoror 2) sub directoriesindex-1,index-2etc that contain the sub directories described under 1)
-
-
Method Details
-
validateDirectory
-
getSubDirectoriesOrNull
-
getCount
Description copied from class:BaseLanguageModelGet the occurrence count for the given token sequence.- Specified by:
getCountin classBaseLanguageModel
-
getCount
Description copied from class:BaseLanguageModelGet the occurrence count fortoken.- Specified by:
getCountin classBaseLanguageModel
-
getTotalTokenCount
public long getTotalTokenCount()- Specified by:
getTotalTokenCountin classBaseLanguageModel
-
close
public void close() -
toString
-