Package astLib :: Module astStats

Module astStats

module for performing statistical calculations.

http://astlib.sourceforge.net

This module (as you may notice) provides very few statistical routines. It does, however, provide biweight (robust) estimators of location and scale, as described in Beers et al. 1990 (AJ, 100, 32), in addition to a robust least squares fitting routine that uses the biweight transform.

Some routines may fail if they are passed lists with few items and encounter a `divide by zero' error. Where this occurs, the function will return None. An error message will be printed to the console when this happens if astStats.REPORT_ERRORS=True (the default). Testing if an astStats function returns None can be used to handle errors in scripts.

For extensive statistics modules, the Python bindings for GNU R (http://rpy.sourceforge.net), or SciPy (http://www.scipy.org) are suggested.

Functions

[hide private]

float

mean(dataList)
Calculates the mean average of a list of numbers.

source code

float

weightedMean(dataList)
Calculates the weighted mean average of a two dimensional list (value, weight) of numbers.

source code

float

stdev(dataList)
Calculates the (sample) standard deviation of a list of numbers.

source code

float

rms(dataList)
Calculates the root mean square of a list of numbers.

source code

float

weightedStdev(dataList)
Calculates the weighted (sample) standard deviation of a list of numbers.

source code

float

median(dataList)
Calculates the median of a list of numbers.

source code

float

modeEstimate(dataList)
Returns an estimate of the mode of a set of values by mode=(3*median)-(2*mean).

source code

float

MAD(dataList)
Calculates the Median Absolute Deviation of a list of numbers.

source code

float

normalizdMAD(dataList)
Calculates the normalized Median Absolute Deviation of a list of numbers which, for a Gaussian distribution, is related to the standard deviation by 1.4826.

source code

float

biweightLocation(dataList, tuningConstant=6.0)
Calculates the biweight location estimator (like a robust average) of a list of numbers.

source code

float

biweightScale(dataList, tuningConstant=9.0)
Calculates the biweight scale estimator (like a robust standard deviation) of a list of numbers.

source code

float

biweightScale_test(dataList, tuningConstant=9.0)
Calculates the biweight scale estimator (like a robust standard deviation) of a list of numbers.

source code

dictionary

biweightClipped(dataList, tuningConstant, sigmaCut)
Iteratively calculates biweight location and scale, using sigma clipping, for a list of values.

source code

list

biweightTransform(dataList, tuningConstant)
Calculates the biweight transform for a set of values.

source code

float

gapperEstimator(dataList)
Calculates the Gapper Estimator (like a robust standard deviation) on a list of numbers.

source code

dictionary

OLSFit(dataList)
Performs an ordinary least squares fit on a two dimensional list of numbers.

source code

dictionary

clippedMeanStdev(dataList, sigmaCut=3.0, maxIterations=10.0)
Calculates the clipped mean and stdev of a list of numbers.

source code

dictionary

clippedMedianStdev(dataList, sigmaCut=3.0, maxIterations=10.0)
Calculates the clipped mean and stdev of a list of numbers.

source code

dictionary

clippedWeightedLSFit(dataList, sigmaCut)
Performs a weighted least squares fit on a list of numbers with sigma clipping.

source code

dictionary

weightedLSFit(dataList, weightType)
Performs a weighted least squares fit on a three dimensional list of numbers [x, y, y error].

source code

dictionary

biweightLSFit(dataList, tuningConstant, sigmaCut=None)
Performs a weighted least squares fit, where the weights used are the biweight transforms of the residuals to the previous best fit .i.e.

source code

list

cumulativeBinner(data, binMin, binMax, binTotal)
Bins the input data cumulatively.

source code

list

binner(data, binMin, binMax, binTotal)
Bins the input data..

source code

list

weightedBinner(data, weights, binMin, binMax, binTotal)
Bins the input data, recorded frequency is sum of weights in bin.

source code

tuple

bootstrap(data, statistic, resamples=1000, alpha=0.05, output='ci', **kwargs)
Returns the bootstrap estimate of the confidence interval for the given statistic. source code

tuple

runningStatistic(x, y, statistic='mean', binNumber=10, **kwargs)
Calculates the value given by statistic in bins of x. source code

numpy.array

slice_sampler(px, N=1, x=None)
Provides N samples from a user-defined discreet distribution.

source code

Variables

[hide private]

REPORT_ERRORS = True

__package__ = 'astLib'

Function Details

Module astStats

mean(dataList)

weightedMean(dataList)

stdev(dataList)

rms(dataList)

weightedStdev(dataList)

median(dataList)

modeEstimate(dataList)

MAD(dataList)

normalizdMAD(dataList)

biweightLocation(dataList, tuningConstant=6.0)

biweightScale(dataList, tuningConstant=9.0)

biweightScale_test(dataList, tuningConstant=9.0)

biweightClipped(dataList, tuningConstant, sigmaCut)

biweightTransform(dataList, tuningConstant)

gapperEstimator(dataList)

OLSFit(dataList)

clippedMeanStdev(dataList, sigmaCut=3.0, maxIterations=10.0)

clippedMedianStdev(dataList, sigmaCut=3.0, maxIterations=10.0)

clippedWeightedLSFit(dataList, sigmaCut)

weightedLSFit(dataList, weightType)

biweightLSFit(dataList, tuningConstant, sigmaCut=None)

cumulativeBinner(data, binMin, binMax, binTotal)

binner(data, binMin, binMax, binTotal)

weightedBinner(data, weights, binMin, binMax, binTotal)

bootstrap(data, statistic, resamples=1000, alpha=0.05, output='ci', **kwargs)

runningStatistic(x, y, statistic='mean', binNumber=10, **kwargs)

slice_sampler(px, N=1, x=None)

bootstrap(data, statistic, resamples=1000, alpha=0.05, output=`'ci'`, **kwargs)

runningStatistic(x, y, statistic=`'mean'`, binNumber=10, **kwargs)