Class Variance
- All Implemented Interfaces:
DoubleConsumer, DoubleSupplier, IntSupplier, LongSupplier, DoubleStatistic, StatisticAccumulator<Variance>, StatisticResult
\[ \tfrac{1}{n-1} \sum_{i=1}^n (x_i-\overline{x})^2 \]
where \( \overline{x} \) is the sample mean, and \( n \) is the number of samples.
- The result is
NaNif no values are added. - The result is
NaNif any of the values isNaNor infinite. - The result is
NaNif the sum of the squared deviations from the mean is infinite. - The result is zero if there is one finite value in the data set.
The use of the term \( n − 1 \) is called Bessel's correction. This is an unbiased
estimator of the variance of a hypothetical infinite population. If the
biased option is enabled the normalisation factor is
changed to \( \frac{1}{n} \) for a biased estimator of the sample variance.
The accept(double) method uses a recursive updating algorithm based on West's
algorithm (see Chan and Lewis (1979)).
The of(double...) method uses the corrected two-pass algorithm from
Chan et al, (1983).
Note that adding values using accept and then executing
getAsDouble will
sometimes give a different, less accurate, result than executing
of with the full array of values. The former approach
should only be used when the full array of values is not available.
Supports up to 263 (exclusive) observations. This implementation does not check for overflow of the count.
This class is designed to work with (though does not require) streams.
Note that this instance is not synchronized. If
multiple threads access an instance of this class concurrently, and at least
one of the threads invokes the accept or
combine method, it must be synchronized externally.
However, it is safe to use accept
and combine
as accumulator and combiner functions of
Collector on a parallel stream,
because the parallel instance of Stream.collect()
provides the necessary partitioning, isolation, and merging of results for
safe and efficient parallel execution.
References:
- Chan and Lewis (1979) Computing standard deviations: accuracy. Communications of the ACM, 22, 526-531. doi: 10.1145/359146.359152
- Chan, Golub and Levesque (1983) Algorithms for Computing the Sample Variance: Analysis and Recommendations. American Statistician, 37, 242-247. doi: 10.2307/2683386
- Since:
- 1.1
- See Also:
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate booleanFlag to control if the statistic is biased, or should use a bias correction.private final SumOfSquaredDeviationsAn instance ofSumOfSquaredDeviations, which is used to compute the variance. -
Constructor Summary
ConstructorsModifierConstructorDescriptionprivateVariance()Create an instance.(package private)Creates an instance with the sum of squared deviations from the mean. -
Method Summary
Modifier and TypeMethodDescriptionvoidaccept(double value) Updates the state of the statistic to reflect the addition ofvalue.Combines the state of theotherstatistic into this one.static Variancecreate()Creates an instance.doubleGets the variance of all input values.static Varianceof(double... values) Returns an instance populated using the inputvalues.static VarianceofRange(double[] values, int from, int to) Returns an instance populated using the specified range ofvalues.setBiased(boolean v) Sets the value of the biased flag.Methods inherited from class Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface DoubleConsumer
andThenMethods inherited from interface StatisticResult
getAsBigInteger, getAsInt, getAsLong
-
Field Details
-
ss
An instance ofSumOfSquaredDeviations, which is used to compute the variance. -
biased
private boolean biasedFlag to control if the statistic is biased, or should use a bias correction.
-
-
Constructor Details
-
Variance
private Variance()Create an instance. -
Variance
Variance(SumOfSquaredDeviations ss) Creates an instance with the sum of squared deviations from the mean.- Parameters:
ss- Sum of squared deviations.
-
-
Method Details
-
create
Creates an instance.The initial result is
NaN.- Returns:
Varianceinstance.
-
of
-
ofRange
Returns an instance populated using the specified range ofvalues.Note:
Variancecomputed usingacceptmay be different from this variance.See
Variancefor details on the computing algorithm.- Parameters:
values- Values.from- Inclusive start of the range.to- Exclusive end of the range.- Returns:
Varianceinstance.- Throws:
IndexOutOfBoundsException- if the sub-range is out of bounds- Since:
- 1.2
-
accept
public void accept(double value) Updates the state of the statistic to reflect the addition ofvalue.- Specified by:
acceptin interfaceDoubleConsumer- Parameters:
value- Value.
-
getAsDouble
public double getAsDouble()Gets the variance of all input values.When no values have been added, the result is
NaN.- Specified by:
getAsDoublein interfaceDoubleSupplier- Returns:
- variance of all values.
-
combine
Description copied from interface:StatisticAccumulatorCombines the state of theotherstatistic into this one.- Specified by:
combinein interfaceStatisticAccumulator<Variance>- Parameters:
other- Another statistic to be combined.- Returns:
thisinstance after combiningother.
-
setBiased
Sets the value of the biased flag. The default value isfalse.If
falsethe sum of squared deviations from the sample mean is normalised byn - 1wherenis the number of samples. This is Bessel's correction for an unbiased estimator of the variance of a hypothetical infinite population.If
truethe sum of squared deviations is normalised by the number of samplesn.Note: This option only applies when
n > 1. The variance ofn = 1is always 0.This flag only controls the final computation of the statistic. The value of this flag will not affect compatibility between instances during a
combineoperation.- Parameters:
v- Value.- Returns:
thisinstance
-