Class TwoStepsGOV3Function<T>
java.lang.Object
it.unimi.dsi.fastutil.objects.AbstractObject2LongFunction<T>
it.unimi.dsi.sux4j.mph.AbstractHashFunction<T>
it.unimi.dsi.sux4j.mph.TwoStepsGOV3Function<T>
- All Implemented Interfaces:
it.unimi.dsi.fastutil.Function<T,Long>, it.unimi.dsi.fastutil.objects.Object2LongFunction<T>, it.unimi.dsi.fastutil.Size64, Serializable, Function<T, Long>, ToLongFunction<T>
public class TwoStepsGOV3Function<T>
extends AbstractHashFunction<T>
implements Serializable, it.unimi.dsi.fastutil.Size64
A function stored using two GOV3Functions—one for
frequent values, and one for infrequent values. This naive idea turns out to be very effective in reducing the function
size when the distribution of values is skewed (e.g., as it happens in a
TwoStepsLcpMonotoneMinimalPerfectHashFunction).
To create an instance, we perform a pre-scan of the values to be assigned. If possible, we finds the best possible
r such that the 2r − 1 most frequent values can be stored in a GOV3Function
and suitably remapped when read. The function uses 2r − 1 as an escape symbol for all other
values, which are stored in a separate function.
Warning: during the construction phase, a filter
will be set on the BucketedHashStore used to store the keys. If you are passing a store,
you will have to reset it to its previous state.
- Since:
- 4.0
- Author:
- Sebastiano Vigna
- See Also:
-
Nested Class Summary
Nested Classes -
Field Summary
FieldsModifier and TypeFieldDescriptionprotected final intThe escape value returned byfirstFunctionto suggest thatsecondFunctionshould be queried instead, provided that there is a first function.protected final GOV3Function<T> The first function, ornull.protected final longThe number of keys.protected final doubleThe mean of the rank distribution.protected final long[]A mapping from values of the first function to actual values, provided that there is a first function.protected final GOV3Function<T> The second function.protected longThe seed to be used when converting keys to signatures.static final longprotected final it.unimi.dsi.bits.TransformationStrategy<? super T> The transformation strategy to turn objects of typeTinto bit vectors.protected final intThe width of the output of this function, in bits.Fields inherited from class it.unimi.dsi.fastutil.objects.AbstractObject2LongFunction
defRetValue -
Constructor Summary
ConstructorsModifierConstructorDescriptionprotectedTwoStepsGOV3Function(Iterable<? extends T> keys, it.unimi.dsi.bits.TransformationStrategy<? super T> transform, it.unimi.dsi.fastutil.longs.LongBigList values, File tempDir, BucketedHashStore<T> bucketedHashStore) Creates a new two-step function for the given keys and values. -
Method Summary
Methods inherited from class AbstractHashFunction
containsKey, sizeMethods inherited from class it.unimi.dsi.fastutil.objects.AbstractObject2LongFunction
defaultReturnValue, defaultReturnValueMethods inherited from class Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface it.unimi.dsi.fastutil.Function
apply, clearMethods inherited from interface it.unimi.dsi.fastutil.objects.Object2LongFunction
andThen, andThenByte, andThenChar, andThenDouble, andThenFloat, andThenInt, andThenLong, andThenObject, andThenReference, andThenShort, applyAsLong, composeByte, composeChar, composeDouble, composeFloat, composeInt, composeLong, composeObject, composeReference, composeShort, get, getOrDefault, getOrDefault, put, put, remove, removeLongMethods inherited from interface it.unimi.dsi.fastutil.Size64
size
-
Field Details
-
serialVersionUID
public static final long serialVersionUID- See Also:
-
n
protected final long nThe number of keys. -
transform
The transformation strategy to turn objects of typeTinto bit vectors. -
firstFunction
The first function, ornull. The special output valueescapedenotes thatsecondFunctionshould be queried instead. -
secondFunction
The second function. All queries for whichfirstFunctionreturnsescape(or simply all queries, iffirstFunctionisnull) will be rerouted here. -
remap
protected final long[] remapA mapping from values of the first function to actual values, provided that there is a first function. -
escape
protected final int escapeThe escape value returned byfirstFunctionto suggest thatsecondFunctionshould be queried instead, provided that there is a first function. -
seed
protected long seedThe seed to be used when converting keys to signatures. -
width
protected final int widthThe width of the output of this function, in bits. -
rankMean
protected final double rankMeanThe mean of the rank distribution.
-
-
Constructor Details
-
TwoStepsGOV3Function
protected TwoStepsGOV3Function(Iterable<? extends T> keys, it.unimi.dsi.bits.TransformationStrategy<? super T> transform, it.unimi.dsi.fastutil.longs.LongBigList values, File tempDir, BucketedHashStore<T> bucketedHashStore) throws IOException Creates a new two-step function for the given keys and values.- Parameters:
keys- the keys in the domain of the function.transform- a transformation strategy for the keys.values- values to be assigned to each key, in the same order of the iterator returned bykeys; ifnull, the assigned value will the ordinal number of each key.tempDir- a temporary directory for the store files, ornullfor the standard temporary directory.bucketedHashStore- a bucketed hash store containing the keys associated with their rank, ornull; the store can be unchecked, but in this casekeysandtransformmust be non-null.- Throws:
IOException
-
-
Method Details
-
getLong
-
getLongBySignature
public long getLongBySignature(long[] signature) -
size64
public long size64()- Specified by:
size64in interfaceit.unimi.dsi.fastutil.Size64- Overrides:
size64in classAbstractHashFunction<T>
-
numBits
public long numBits()Returns the number of bits used by this structure.- Returns:
- the number of bits used by this structure.
-
main
public static void main(String[] arg) throws NoSuchMethodException, IOException, com.martiansoftware.jsap.JSAPException - Throws:
NoSuchMethodExceptionIOExceptioncom.martiansoftware.jsap.JSAPException
-