Package it.unimi.dsi.sux4j.mph
Class TwoStepsLcpMonotoneMinimalPerfectHashFunction<T>
- java.lang.Object
-
- it.unimi.dsi.fastutil.objects.AbstractObject2LongFunction<K>
-
- it.unimi.dsi.sux4j.mph.AbstractHashFunction<T>
-
- it.unimi.dsi.sux4j.mph.TwoStepsLcpMonotoneMinimalPerfectHashFunction<T>
-
- All Implemented Interfaces:
it.unimi.dsi.fastutil.Function<T,java.lang.Long>,it.unimi.dsi.fastutil.objects.Object2LongFunction<T>,it.unimi.dsi.fastutil.Size64,java.io.Serializable,java.util.function.Function<T,java.lang.Long>,java.util.function.ToLongFunction<T>
public class TwoStepsLcpMonotoneMinimalPerfectHashFunction<T> extends AbstractHashFunction<T> implements it.unimi.dsi.fastutil.Size64, java.io.Serializable
A monotone minimal perfect hash implementation based on fixed-size bucketing that uses longest common prefixes as distributors, and store their lengths using aTwoStepsGOV3Function.This implementation should use a few less bits per elements than
LcpMonotoneMinimalPerfectHashFunction, but it is a bit slower as one or two additional functions must be queried.See the package overview for a comparison with other implementations. Similarly to a
GOV3Function, an instance of this class may be signed.- See Also:
- Serialized Form
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classTwoStepsLcpMonotoneMinimalPerfectHashFunction.Builder<T>A builder class forTwoStepsLcpMonotoneMinimalPerfectHashFunction.
-
Field Summary
Fields Modifier and Type Field Description protected intbucketSizeThe size of a bucket.protected intbucketSizeMaskThe mask forlog2BucketSizebits.protected GOV3Function<it.unimi.dsi.bits.BitVector>lcp2BucketA function mapping each longest common prefix to its bucket.protected TwoStepsGOV3Function<it.unimi.dsi.bits.BitVector>lcpLengthsA function mapping each element to the length of the longest common prefix of its bucket.protected intlog2BucketSizeFast.ceilLog2(int)ofbucketSize.protected longnThe number of elements.protected GOV3Function<it.unimi.dsi.bits.BitVector>offsetsA function mapping each element to the offset inside its bucket.protected longseedThe seed returned by theBucketedHashStore.static longserialVersionUIDprotected longsignatureMaskThe mask to compare signatures, or zero for no signatures.protected it.unimi.dsi.fastutil.longs.LongBigListsignaturesThe signatures.protected it.unimi.dsi.bits.TransformationStrategy<? super T>transformThe transformation strategy.
-
Constructor Summary
Constructors Modifier Constructor Description protectedTwoStepsLcpMonotoneMinimalPerfectHashFunction(java.lang.Iterable<? extends T> keys, long numKeys, it.unimi.dsi.bits.TransformationStrategy<? super T> transform, int signatureWidth, java.io.File tempDir)Creates a new two-steps LCP monotone minimal perfect hash function for the given keys.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description longgetLong(java.lang.Object o)longgetLongByBitVectorAndSignature(it.unimi.dsi.bits.BitVector bitVector, long[] signature)static voidmain(java.lang.String[] arg)longnumBits()Returns the number of bits used by this structure.longsize64()-
Methods inherited from class it.unimi.dsi.sux4j.mph.AbstractHashFunction
containsKey, size
-
Methods inherited from class it.unimi.dsi.fastutil.objects.AbstractObject2LongFunction
defaultReturnValue, defaultReturnValue
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface it.unimi.dsi.fastutil.objects.Object2LongFunction
andThen, andThenByte, andThenChar, andThenDouble, andThenFloat, andThenInt, andThenLong, andThenObject, andThenReference, andThenShort, applyAsLong, composeByte, composeChar, composeDouble, composeFloat, composeInt, composeLong, composeObject, composeReference, composeShort, get, getOrDefault, getOrDefault, put, put, remove, removeLong
-
-
-
-
Field Detail
-
serialVersionUID
public static final long serialVersionUID
- See Also:
- Constant Field Values
-
n
protected final long n
The number of elements.
-
bucketSize
protected final int bucketSize
The size of a bucket.
-
log2BucketSize
protected final int log2BucketSize
Fast.ceilLog2(int)ofbucketSize.
-
bucketSizeMask
protected final int bucketSizeMask
The mask forlog2BucketSizebits.
-
offsets
protected final GOV3Function<it.unimi.dsi.bits.BitVector> offsets
A function mapping each element to the offset inside its bucket.
-
lcpLengths
protected final TwoStepsGOV3Function<it.unimi.dsi.bits.BitVector> lcpLengths
A function mapping each element to the length of the longest common prefix of its bucket.
-
lcp2Bucket
protected final GOV3Function<it.unimi.dsi.bits.BitVector> lcp2Bucket
A function mapping each longest common prefix to its bucket.
-
transform
protected final it.unimi.dsi.bits.TransformationStrategy<? super T> transform
The transformation strategy.
-
seed
protected final long seed
The seed returned by theBucketedHashStore.
-
signatureMask
protected final long signatureMask
The mask to compare signatures, or zero for no signatures.
-
signatures
protected final it.unimi.dsi.fastutil.longs.LongBigList signatures
The signatures.
-
-
Constructor Detail
-
TwoStepsLcpMonotoneMinimalPerfectHashFunction
protected TwoStepsLcpMonotoneMinimalPerfectHashFunction(java.lang.Iterable<? extends T> keys, long numKeys, it.unimi.dsi.bits.TransformationStrategy<? super T> transform, int signatureWidth, java.io.File tempDir) throws java.io.IOException
Creates a new two-steps LCP monotone minimal perfect hash function for the given keys.- Parameters:
keys- the keys to hash.numKeys- the number of keys, or -1 if the number of keys is not known (will be computed).transform- a transformation strategy for the keys.signatureWidth- a signature width, or 0 for no signature.tempDir- a temporary directory for the store files, ornullfor the standard temporary directory.- Throws:
java.io.IOException
-
-
Method Detail
-
size64
public long size64()
- Specified by:
size64in interfaceit.unimi.dsi.fastutil.Size64- Overrides:
size64in classAbstractHashFunction<T>
-
numBits
public long numBits()
Returns the number of bits used by this structure.- Returns:
- the number of bits used by this structure.
-
getLong
public long getLong(java.lang.Object o)
- Specified by:
getLongin interfaceit.unimi.dsi.fastutil.objects.Object2LongFunction<T>
-
getLongByBitVectorAndSignature
public long getLongByBitVectorAndSignature(it.unimi.dsi.bits.BitVector bitVector, long[] signature)
-
main
public static void main(java.lang.String[] arg) throws java.lang.NoSuchMethodException, java.io.IOException, com.martiansoftware.jsap.JSAPException- Throws:
java.lang.NoSuchMethodExceptionjava.io.IOExceptioncom.martiansoftware.jsap.JSAPException
-
-