0

In the book "An Introduction to Statistical Learning with Applications in Python, Trevor Hastie et al., Springer", there's the following paragraph:

The left-hand panel of Figure 1.2 displays two boxplots of the previous day’s percentage changes in the stock index: one for the 648 days for which the market increased on the subsequent day, and one for the 602 days for which the market decreased.

Percentage change in S&P Index

Figure 1.2 (the left-hand panel)

It's not obvious to me why then the authors concluded:

The two plots look almost identical, suggesting that there is no simple strategy for using yesterday’s movement in the S&P to predict today’s returns.

Suppose in the boxplot for the "Up" bucket on the right, the median drops to $-4$ for example, then my understand is that in this case there's some strategy to predict today's return based on yesterday's movement. If so then what is that and why it's not applicable for the case in the image?

Tran Khanh
  • 155
  • 6

1 Answers1

2

If the median for the 'up' bucket is much lower than for the 'down' bucket, this means that days that have a negative market return are more likely to be followed by a day with a positive return. This information can be used to predict the market (and possibly make a profit), i.e. buy/go long for the next day if the current day's return is negative and sell/go short for the next day if the current day's return is positive. Since in this image the distributions for the return on the previous day looks roughly the same it contains no information on what you can expect the market to do the day after.

Oxbowerce
  • 7,472
  • 2
  • 8
  • 23
  • The boxplots are for 648 days and 602 days, meaning that the days are fixed regardless of where the medians are, so how is it possible that "this means that days that have a negative market return are more likely to be followed by a day with a positive return"? – Tran Khanh Mar 15 '24 at 01:42