You currently DO NOT have javascript enabled, to view our site this must be enabled, read more

Education

How Statistics Can Confuse Investors

Common statistics hold plenty of pitfalls for investors; using properly constructed statistics, but in the absence of other relevant facts, can lead to poor decisions

Paul Kaplan 13 April, 2018 | 12:36AM

Quoting Benjamin Disraeli, Mark Twain famously quipped, “There are three kinds of lies: lies, damned lies, and statistics.” In the field of investments, in which we rely heavily on statistical analysis to evaluate the merits of investment strategies and products, Twain’s point is all too relevant.

Correlation Is Not Causation

One statistic that is all too easy to be misleading is correlation, starting with its definition. How many times have we heard that correlation measures the tendency for two variables to move up and down together? That’s not quite right. What correlation actually measures is the degree to which two variables, each in excess of its own average, are statistically related.

The other major mistake often made with respect to correlation is causation. Seeing that two variables are statistically related, we too easily jump to the conclusion that there is a causal relationship between them. But correlation and causation are two very different things.

Academic David Leinweber drove this point home in a paper in the mid-1990s that showed that there was a very high correlation between the annual level of the S&P 500 and the annual production of butter in Bangladesh. The author presented the results for the period 1981 through 1993 and found that the correlation over this period was about 87%.

I wanted to see if I could find a similar correlation for the S&P/TSX Composite over a more recent period. It didn’t take me long to discover that for the Canadian stock market, it’s the butter production of Brazil from 1994 through 2017 that does the trick.

I drew the level of the S&P/TSX Composite for each year as a red circle and a blue line that shows the level of the S&P/TSX Composite predicted by the annual butter production in Brazil. They appear to be strongly related. As in Leinweber’s example, the correlation is about 87%.

If you are thinking that there must be some trick to finding dairy production numbers that are correlated with stock market indexes, you’re right. The trick is to use trended variables.

Over any period of time, if two variables are trending upward, such as a stock market index and production in a growing dairy industry, they are positively correlated, even if there is no causal link between them.

The solution to trended variables is to remove the trends. With both stock market indexes and production levels, the natural way to remove the trends is to take the percentage rate of change of each variable. I did that for the annual levels of the S&P/TSX Composite and annual Brazilian butter production.

Correlation is not causation Common statistics hold plenty of pitfalls for investors

I then plotted the annual percentage rates of change of both of these variables. Now, we get the expected result of almost no correlation; just an insignificant 5%.

But even if we have constructed the variables properly, correlation is still not causation. If A and B are correlated, it could be that there is a third variable, C, related to both of them, that we cannot observe.

When trying to find causation, one must look to economic reasoning, not just statistical links. This is especially important to keep in mind when evaluating quantitative investment strategies, especially those behind new strategic-beta exchange-traded funds. Any causal explanation must be made apart from the statistics.

Watch Your Back Test

Correlation is not the only statistic in which statistical significance can get confused with causation. A common statistical procedure in investment management is back-testing. Back tests are run because the period of live performance is often quite limited, non-existent if the strategy has yet to go live. The idea is that if a strategy back-tests well, it should do well in real time.

But this can only be the case if there are causal links between what the strategy does at each point in time and its subsequent performance. There needs to be an economic rationale for the strategy before back-testing it. There are several issues that should be considered when evaluating a back test, especially one involving a factor-based strategy:

Positive Results Bias

As my colleague Ben Johnson says, “There is no such thing as a bad-looking back test.” When we are presented with impressive back-test results, we don’t know how many other strategies or factors were tried that didn’t turn out very well.

Zoo of Factors

There are so many factors to choose from that John Cochrane of the University of Chicago coined the term “zoo of factors.” Given this zoo of hundreds of factors, anyone with the right data set, a computer, and some programming knowledge could back-test any number of factor-based strategies in short order and report only the favourable results. As the late Nobel-prize winning economist Ronald H. Coase said, “If you torture the data long enough, it will confess.”

Simulation, Not Reality

The purpose of a back test is to see how a strategy would have performed in the past. But we can never know for sure how it would have done. Some back tests try to be more realistic by including assumptions such as trading costs, but many do not. But no matter what assumptions are made, a back test is a simulation, not a historical fact.

No Controlled Experiments

In the hard sciences, empirical work mainly consists of running controlled experiments in which the effects of factors other than the variable being studied are minimized or eliminated. Unfortunately, economists generally cannot perform controlled experiments. Instead, they do statistical analysis on historical data, with the assumption that the underlying processes that generate the data remained the same over time. This is the assumption of stationarity. Back-testing is an example of this kind of analysis.

Beware Statistics

It is all too easy to calculate nonsensical statistics, such as the correlation between a stock market index and butter production. Furthermore, statistical analysis, in the absence of economic analysis, can be misused to demonstrate almost anything, as can happen with back tests that have no linkage to economic reason.

Finally, using properly constructed statistics, but in the absence of other relevant facts, can lead to poor decisions, such as recommending a fund to an investor who lacks the patience to hold it through the rough patches. By paying careful attention to these issues, we can prevent statistics from being the third kind of lie.

About Author

Paul Kaplan Paul Kaplan is Director of Research for Morningstar Canada.

About Us

Our Story

Company Website

Our Signature Methodologies

Career

Connect With Us

Global Contacts

Advertising Opportunities

Get Help

FAQ

Ask Us

The Morningstar Star Rating for Stocks is assigned based on an analyst's estimate of a stocks fair value. It is projection/opinion and not a statement of fact. Morningstar assigns star ratings based on an analyst’s estimate of a stock's fair value. Four components drive the Star Rating: (1) our assessment of the firm’s economic moat, (2) our estimate of the stock’s fair value, (3) our uncertainty around that fair value estimate and (4) the current market price. This process culminates in a single-point star rating that is updated daily. A 5-star represents a belief that the stock is a good value at its current price; a 1-star stock isn't. If our base-case assumptions are true the market price will converge on our fair value estimate over time, generally within three years. Investments in securities are subject to market and other risks. Past performance of a security may or may not be sustained in future and is no indication of future performance. For detail information about the Morningstar Star Rating for Stocks, please visit here

Quantitative Fair Value Estimate represents Morningstar’s estimate of the per share dollar amount that a company’s equity is worth today. The Quantitative Fair Value Estimate is based on a statistical model derived from the Fair Value Estimate Morningstar’s equity analysts assign to companies which includes a financial forecast of the company. The Quantitative Fair Value Estimate is calculated daily. It is a projection/opinion and not a statement of fact. Investments in securities are subject to market and other risks. Past performance of a security may or may not be sustained in future and is no indication of future performance. For detail information about the Quantiative Fair Value Estimate, please visit here

The Morningstar Medalist Rating is the summary expression of Morningstar’s forward-looking analysis of investment strategies as offered via specific vehicles using a rating scale of Gold, Silver, Bronze, Neutral, and Negative. The Medalist Ratings indicate which investments Morningstar believes are likely to outperform a relevant index or peer group average on a risk-adjusted basis over time. Investment products are evaluated on three key pillars (People, Parent, and Process) which, when coupled with a fee assessment, forms the basis for Morningstar’s conviction in those products’ investment merits and determines the Medalist Rating they’re assigned. Pillar ratings take the form of Low, Below Average, Average, Above Average, and High. Pillars may be evaluated via an analyst’s qualitative assessment (either directly to a vehicle the analyst covers or indirectly when the pillar ratings of a covered vehicle are mapped to a related uncovered vehicle) or using algorithmic techniques. Vehicles are sorted by their expected performance into rating groups defined by their Morningstar Category and their active or passive status. When analysts directly cover a vehicle, they assign the three pillar ratings based on their qualitative assessment, subject to the oversight of the Analyst Rating Committee, and monitor and reevaluate them at least every 14 months. When the vehicles are covered either indirectly by analysts or by algorithm, the ratings are assigned monthly. For more detailed information about these ratings, including their methodology, please go to here

The Morningstar Medalist Ratings are not statements of fact, nor are they credit or risk ratings. The Morningstar Medalist Rating (i) should not be used as the sole basis in evaluating an investment product, (ii) involves unknown risks and uncertainties which may cause expectations not to occur or to differ significantly from what was expected, (iii) are not guaranteed to be based on complete or accurate assumptions or models when determined algorithmically, (iv) involve the risk that the return target will not be met due to such things as unforeseen changes in changes in management, technology, economic development, interest rate development, operating and/or material costs, competitive pressure, supervisory law, exchange rate, tax rates, exchange rate changes, and/or changes in political and social conditions, and (v) should not be considered an offer or solicitation to buy or sell the investment product. A change in the fundamental factors underlying the Morningstar Medalist Rating can mean that the rating is subsequently no longer accurate.

For information on the historical Morningstar Medalist Rating for any managed investment Morningstar covers, please contact your local Morningstar office.

For more detailed information about conflicts of interest, including EU MAR disclosures, please see the “Morningstar Medalist Rating Analyst Conflict of Interest & Other Disclosures for EMEA”here

Select Your Site Edition To make sure the site is relevant to you, we need to know if you’re an individual investor or a financial professional.