All Classes Interface Summary Class Summary Enum Summary Annotation Types Summary
| Class |
Description |
| AbstractArrayEncodedNgramLanguageModel<W> |
|
| AbstractContextEncodedNgramLanguageModel<W> |
|
| AbstractNgramLanguageModel<W> |
|
| AbstractNgramMap<T> |
|
| AbstractTMap<T extends java.lang.Comparable> |
Contains some limited shared functionality between Custom[type]Maps
|
| AbstractTMap.Functionality<T extends java.lang.Comparable> |
|
| AbstractTMap.MapType |
|
| Annotations |
|
| Annotations.Option |
|
| Annotations.OutputParameter |
Just a fancy-pants comment.
|
| Annotations.PrintMemoryCount |
Fields annotated with this annotation will have their memory usage added
to the memory usage map returned by countApproximateMemoryUsage.
|
| Annotations.SkipMemoryCount |
|
| ArpaLmReader<W> |
A parser for ARPA LM files.
|
| ArpaLmReaderCallback<V> |
Callback that is called for each n-gram in the collection
|
| ArrayEncodedCachingLmWrapper<W> |
|
| ArrayEncodedDirectMappedLmCache |
A direct-mapped cache.
|
| ArrayEncodedLmCache |
|
| ArrayEncodedNgramLanguageModel<W> |
Top-level interface for an n-gram language model which accepts n-gram in an
array-of-integers encoding.
|
| ArrayEncodedNgramLanguageModel.DefaultImplementations |
|
| ArrayEncodedProbBackoffLm<W> |
Language model implementation which uses Kneser-Ney-style backoff
computation.
|
| BitList |
|
| BitStream |
Wraps a portion of a long[] array with iterator-like functionality over a
stream of bits.
|
| BitUtils |
|
| BoundedList<E> |
List which returns special boundary symbols when get() is called outside the
range of the list.
|
| CompressedNgramMap<T> |
|
| CompressibleProbBackoffValueContainer |
|
| CompressibleValueContainer<V> |
|
| CompressionUtils |
|
| ComputeLogProbabilityOfTextStream |
Computes the log probability of a list of files.
|
| ConfigOptions |
Stores some configuration options, with useful defaults.
|
| ContextEncodedCachingLmWrapper<T> |
|
| ContextEncodedDirectMappedLmCache |
|
| ContextEncodedLmCache |
|
| ContextEncodedNgramLanguageModel<W> |
Interface for language models which expose the internal context-encoding for
more efficient queries.
|
| ContextEncodedNgramLanguageModel.DefaultImplementations |
|
| ContextEncodedNgramLanguageModel.LmContextInfo |
Simple class for returning context offsets
|
| ContextEncodedNgramMap<T> |
|
| ContextEncodedProbBackoffLm<W> |
Language model implementation which uses Kneser-Ney style backoff
computation.
|
| Counter<E> |
A map from objects to doubles.
|
| CountValueContainer |
|
| CustomWidthArray |
An array with a custom word "width" in bits.
|
| FirstPassCallback<V extends LongRepresentable<V>> |
Reader callback which adds n-grams to an NgramMap
|
| GoogleLmReader<W> |
Reads in n-gram count collections in the format that the Google n-grams Web1T
corpus comes in.
|
| HashNgramMap<T> |
|
| Indexer<E extends java.lang.Comparable<E>> |
Maintains a two-way map between a set of objects and contiguous integers from
0 to the number of objects.
|
| IOUtils |
Some IO utility functions.
|
| Iterators |
Utilities for dealing with Iterators
|
| Iterators.IteratorIterator<T> |
Wraps a two-level iteration scenario in an iterator.
|
| Iterators.Transform<S,T> |
Wraps a base iterator with a transformation function.
|
| KneserNeyCountValueContainer |
Stored type and token counts necessary for estimating a Kneser-Ney language
model
|
| KneserNeyCountValueContainer.KneserNeyCounts |
Warning: type counts are stored internally as 32-bit ints.
|
| KneserNeyFileWritingLmReaderCallback<W> |
Class for producing a Kneser-Ney language model in ARPA format from raw text.
|
| KneserNeyLmReaderCallback<W> |
Class for producing a Kneser-Ney language model in ARPA format from raw text.
|
| LmReader<V,C extends LmReaderCallback<V>> |
|
| LmReaderCallback<V> |
Callback that is called for each n-gram in the collection
|
| LmReaders |
This class contains a number of static methods for reading/writing/estimating
n-gram language models.
|
| Logger |
Basic logging singleton class.
|
| Logger.CompoundLogger |
Convenience class for stringing together loggers.
|
| Logger.LogInterface |
Logging interface.
|
| Logger.NullLogger |
Default logging goes nowhere.
|
| Logger.SystemLogger |
Logs to System.out and System.err
|
| LongArray |
|
| LongArray.StaticMethods |
|
| LongHashSet |
Open address hash map with linear probing.
|
| LongRef |
|
| LongRepresentable<T> |
|
| LongToIntHashMap |
Open address hash map with linear probing.
|
| MakeKneserNeyArpaFromText |
Estimates a Kneser-Ney language model from raw text, and writes the language
model out in ARPA-format.
|
| MakeLmBinaryFromArpa |
Given a language model in ARPA format, builds a binary representation of the
language model and writes it to disk.
|
| MakeLmBinaryFromGoogle |
Given a directory in Google n-grams format, builds a binary representation of
a stupid-backoff language model language model and writes it to disk.
|
| MakeNgramMapBinaryFromGoogle |
Like MakeLmBinaryFromGoogle, except it only writes the NgramMap
portion of the LM, meaning the binary does not contain the vocabulary.
|
| MosesPhraseTable |
Experimental class for reading Moses phrase tables and storing them
efficiently in memory using a trie.
|
| MosesPhraseTable.TargetSideTranslation |
|
| MosesPhraseTableReader<W> |
|
| MosesPhraseTableReaderCallback<W> |
Class for representing phrase tables efficiently in memory.
|
| MurmurHash |
Taken/modified from
http://d3s.mff.cuni.cz/~holub/sw/javamurmurhash/MurmurHash.java
|
| NgramIterableWrapper<W,V> |
Wraps an NgramMap as an Iterable, so it is easy to iterate over the n-grams
and associated values.
|
| NgramLanguageModel<W> |
Base interface for an n-gram language model, which exposes only inefficient
convenience methods.
|
| NgramLanguageModel.StaticMethods |
|
| NgramMap<V> |
|
| NgramMap.Entry<T> |
|
| NgramMapAddingCallback<V> |
Reader callback which adds n-grams to an NgramMap
|
| NgramMapWrapper<W,V> |
Wraps an NgramMap as a Java Map, with ngrams of all orders mixed together.
|
| NgramOrderedLmReaderCallback<V> |
Callback that is called for each n-gram in the collection
|
| NgramsForOrderIterableWrapper<W,V> |
Wraps an NgramMap as an Iterable, so it is easy to iterate over the n-grams
of a particular order.
|
| NgramsForOrderMapWrapper<W,V> |
Wraps an NgramMap as a Java Map, but only ngrams of a particular order.
|
| Pair<F,S> |
A generic-typed pair of objects.
|
| PhraseTableCounts |
|
| PhraseTableValueContainer |
Stored type and token counts necessary for estimating a Kneser-Ney language
model
|
| PhraseTableValueContainer.FeaturePhraseTableValues |
|
| PhraseTableValueContainer.PhraseTableValues |
|
| PhraseTableValueContainer.TargetTranslationsValues |
|
| ProbBackoffPair |
|
| ProbBackoffValueContainer |
|
| StringWordIndexer |
Implementation of a WordIndexer in which words are represented as strings.
|
| StrUtils |
|
| StupidBackoffLm<W> |
Language model implementation which uses stupid backoff (Brants et al., 2007)
computation.
|
| TextReader<W> |
Class for reading raw text files.
|
| TIntMap<T extends java.lang.Comparable> |
Provides a map from objects to non-negative integers.
|
| UncompressedProbBackoffValueContainer |
|
| UnrankedUncompressedProbBackoffValueContainer |
|
| ValueContainer<V> |
Manages storage of arbitrary values in an NgramMap
|
| VariableLengthBitCompressor |
|
| WordIndexer<W> |
Enumerates words in the vocabulary of a language model.
|
| WordIndexer.StaticMethods |
|