Algorithmic Platform | Trading Strategies | OMS | AI | Trading Simulators

What does it mean to not have a “mean”? — Working with fat-tailed distributions

What does it mean to not have a “mean”? — Working with fat-tailed distributions

Categories: Technology, Trading | Tags: trading, infrastructure, high-frequency trading

Many traders instinctively look to the mean for value and variance for risk, but there are good reasons why these statistics are not always the most appropriate.

The log returns of an exchange traded asset generally show a smooth, unimodal distribution, and it is natural to look to the mean for an estimate of value, and the variance to estimate risk. These values are easily calculated from sample data, but it is important to consider how informative they are when it comes to understanding the underlying distribution.

Additionally, market returns are known to be fat-tailed, and failing to properly account for this can lead to drastically underestimating the likelihood of extreme events — as LTCM and others have found out to their cost.

The family of ⍺-stable distributions range from the Gaussian, through increasingly fat-tailed distributions, and naturally incorporate skew. They model financial data well, but there are some remarkable implications for the choice of summary statistics, since, for all but the Gaussian, the variance is undefined, and when the tails are sufficiently fat, even the mean is undefined!

Stable distributions

Stable distributions have been used to analyse financial data at least since Benoit Mandelbrot, of fractal fame, wrote on this as early as 1963.

The family of ⍺-stable distributions have their shape determined by a parameter ⍺, starting at 2 for a Gaussian, and producing increasingly fat tails as it decreases to 0.

The tails are asymptotically proportional to a power law.

Article Content

A second parameter, β, that ranges between +/-1, determines the skew of the distribution. When ⍺ = 2 this has no effect, but as ⍺ decreases, the introduced skew becomes more pronounced.

Article Content

Analysing samples from a stable distribution

Let’s generate some random samples from a symmetric stable distribution with ⍺ = 1.2, and calculate the sample mean and deviation. As the table below shows, even up to 1 million samples, the mean remains unclear, and the deviation does not converge.

Article Content

If we fatten the tails further by reducing ⍺ to 0.95, even the mean fails to converge.

Article Content

Clearly, these measures are not providing meaningful information about the underlying distribution.

How can this be overcome?

Replacing the mean and standard deviation by the corresponding robust measures, the median and median absolute deviation (MAD), results in the values shown opposite for the ⍺ = 1.2 samples.

Article Content

Both these measures converge nicely, and are effective alternatives.

As the second table shows, they remain effective with the fatter tails of ⍺ = 0.95.

Article Content

In the presence of skew, the median and MAD are able to capture and quantify the central shift and width of the distribution, as seen below when ⍺ = 1.4, β = 0.6

Article Content

What about real market data?

The following chart shows 1 day of BHP trading on the ASX. The corresponding histogram of 2 minute returns includes a stable distribution fit with ⍺ = 1.5, β = 0.24.

Article Content

Next are the same charts for 1 day of trading of BTCUSD. The stable distribution fit has ⍺ = 1.64, β = 0.04.

Article Content

It is clear that the stable distribution is quite successful in recognising the fat tails, much more so than a Gaussian, and hence modelling with these distributions is better suited for capturing the associated extra risk.

Written by Ken Wessen

You Might Also Like

We use cookies to enhance your experience. By continuing to browse, you accept our Corporate Social Responsibility Policy.