What decisive factors have been driving the stock markets since 2017? IT IS NOT Economic Indicators


Abstract

Stock prices fluctuate according to changes in supply and demand. Investopedia, known as an extensive collection of online financial resources, listed the most common catalysts driving daily stock price movements: fundamentals, technical factors, and market sentiment. This study will be analyzing which factors mostly correlate with fluctuations in stock prices. The independent variables this study will use are EPS and PE ratios, which will be representing the fundamental aspects of stock prices; technical factors, represented by CPI and GDP numbers as well as substitute asset classes demand (government debt and median price of houses sold), and finally the VIX index which will monitor sentiment. The results of the study will serve the purpose of not only helping people better understand market fluctuations and which factors have a higher influence on stock prices but also create a model that can predict how prices will behave under certain market conditions. Anyway, this study considers the S&P 500 daily share price as the benchmark to representing the dependent variable (stock prices) simply because the index is considered the best representation of America’s stock market, with the country’s 500 biggest companies by market cap being quoted on the exchange. In essence, the study results show that although some results proved to be as expected, such as stock prices being positively correlated to positive economic news, many results proved to reject initial hypotheses, such as the VIX and alternative asset demand being positively correlated to stock prices or fundamental factors having a negative correlation. Lastly, economic news and fundamentals proved to be less influential towards stock prices than what was previously believed.

Introduction

            Although many people consider the stock market a way to easy riches, being profitable in the market is an enigma very few seasoned investors have managed to solve. According to a study led by eToro, a large European multinational broker, 80% of their 27 million active users lost money in the stock market, averaging -36.3% returns over 12 months (Lyck, 2020). Therefore, the stock market is all but easy to trade, reasons ranging from stock prices behaving in a random fashion, investors misinterpreting information, or the efficient market theory keeping traders from making money long-term. Whichever the reason, this study strives to understand which factors have the largest impact on S&P 500 prices, so that investors can improve their chances of better predicting price movements according to the different fundamental, technical and sentiment market environments.

            The next section will have a literature review that will analyze any past research led on the topic of stock price fluctuations and their catalysts. Knowing additional information on the topic may make it easier to form hypotheses about the results of this study. Secondly, there will be a section that discusses the study regression model, including any information related to data and methodology. Thirdly, will come a section that discusses the regression results and analyze any special characteristics that could have tweaked the results. Lastly, we will interpret the final data and understand what conclusion can be made about stock market price variations from these results.

Literature Review

What drives stock price movements? Review of Financial Studies

            A journal written by Chen et al. in 2013, addresses the most important factor that the public believes move stock market prices: expected earnings. In fact, the article does this by analyzing how sensitive stock prices are to expected company cash flows by computing numbers using the implied cost of equity capital (ICC) method at a firm and aggregate (market) level to generalize results to the wider stock market indices. One upside of this study is that the ICC method the researchers employ during their study is a superior model to the predictive regression method which they discuss about in their journal but refuse to use due to the inefficiencies it may produce to the study results. For example, the predictive regression method is sensitive to sample period, predictive variables, and cash flow inputs. Despite that, a clear inefficiency of the journal is the independent variables it includes, which may not be directly comparable to the variables used in the research of this paper. For example, Chen et al. use expected cash flows as the explanatory variable, while this study uses EPS, and PE ratios. Another issue that might impact results is the fact that the study employs cash flow forecasts rather than hard data to lead research, making the study unpredictable to a certain extent.  

The results of the study show that stock returns are greatly impacted by the cash flow news component with the greatest impact on price for investment horizons over two years.

What moves stock prices?

            A second example of a paper written by David et al. that tries to estimate the variance in stock prices according to certain kinds of news concludes that stock prices are greatly affected by a diverse variety of catalysts. Firstly, non-public information greatly impacts stock prices as larger price moves mainly occur during days without any identifiable major news release, casting doubt that predictable stock catalysts mentioned above, such as future cash flows are as effective an indicator for price moves. Thirdly, it is shown that non-economic news has no significant impact on stock prices, therefore showing that the main news drivers of stock prices are economic data releases. The study yields the abovementioned results by physically analyzing how major news impacted S&P prices in the period ranging from 1941 and 1987, but also by using models to understand how much of a variation in stock prices is explained by news, mainly through using the coefficient of variation, and VAR evidence.

            In essence, this paper by David et al. relates to my study as it gives an insight into a different catalyst that potentially impacts stock prices, news type. Additionally, by showing that economic data is the leading news driver to stock prices implies that the dependent variables, such as CPI and GDP numbers employed in my study will most likely be positively related and be significant drivers to changes in dependent variable (S&P500 prices).

Data and Methodology

            This study stives to analyze the best regression model to estimate the correlation between the explanatory variables mentioned in the abstract and the dependent variable (S&P500 price). After having designed the most appropriate regression model for the price, this study strives to understand which of the explanatory variables causes the largest change in y (is the leading catalyst of stock price movement) with the goal of allowing investors to better understand which factors drive stock price movements and to better predict stock price behavior under certain macroeconomic conditions. As shown by the sources quoted in the literature review, stock prices are highly correlated to future cash flow numbers and economic news, while being, virtually, unaffected by non-economic news. After having compiled the seven most accepted catalysts to stock price changes and having analyzed past research on the topic, I expect nominal GDP, EPS, PE ratios, and CPI numbers to have the largest positive correlation to SP500 price. The study uses OLS methodology (ordinary least squares) to yield the best predictive model through minimizing the sum of squared residuals and employing a 5% significant level to decide if data in the prediction model is statistically significant.

            Figure 1 shows the daily S&P 500 price since November 2017. The data is retrieved from finance.yahoo.com and is measured in US dollars per share of S&P500. As shown in figure 1, the S&P price frequency is clustered mostly between the 2000-3000 mark. However, figure 2 shows how in the years the price of the index entered a rally that carried its price from a low of 2237.4 to a high of 4796.56.

            To understand price change, various datasets were gathered from time series data sources for the period 2017-2022. Stocks are moved by three possible catalysts fundamentals, technical factors, and market sentiment signs. Major fundamental indicators include earnings per share (EPS) and price-to-earnings ratio (PE). Data for both EPS and PE ratios are taken from Ycharts.com, a source that specializes in time series financial data, and are shown monthly since 2017. Both financial ratios are generated from only S&P500 companies, so that overall statistics of the index can better emulate original data. Figures 3 and 4 show the distribution histogram for the S&P500 PE ratio and EPS ratio accordingly. Like Figure 1, both histograms are right-skewed, meaning that at a first glance, S&P500 prices might be largely linked to these financial ratios. 

            Technical indicators include CPI numbers, nominal GDP numbers, as well as alternative asset demand, such as public debt and real estate prices. The CPI data was retrieved from the U.S. Bureau of Labor Statistics and is measured on a percent basis monthly since 2017. Data for nominal GDP was taken from the Federal Reserve Economic Data website and is measured in billions of dollars on a quarterly basis that dates to January 2017. Past research shows that economic news has a great impact on stock prices, so I expect GDP data and CPI numbers to have a large impact on stock prices. Data for alternative asset demand is retrieved from Savills.com which tries to estimate the market capitalization of different assets. As shown by the source, total equities value stands at $109.2 trillion with both real estate and the debt obligations market dwarfing its size with a value of $258 trillion and $123.5 trillion accordingly. Therefore, it is important to understand the relationship between equities demand and alternative asset demand to see if a change in demand for other assets affects demand for equities and, therefore, changes stock prices. Data for real estate prices is retrieved from Ycharts.com and the US existing home median sales price in USD since 2017 is used to measure the variable. While to understand the variable of debt obligations, the data is retrieved from the Federal Reserve Economic Data Website that supplies the information for daily nominal outstanding debt (notes and bonds) on a weekly average measured in millions of dollars. The idea with alternative asset demand is that with an increasing alternative asset demand, less money is available to invest in equities, which drag S&P500 prices lower. Therefore, the correlation will be negative.

            The VIX is an index that represents S&P500 sentiment through expectations of the relative strength of short-term price changes, which also allows traders to make a 30-day projection of future volatility. The VIX is also called the fear index because volatility increases as investors start doubting market conditions, therefore, making the VIX a valuable sentiment indicator. VIX information is retrieved from Cboe.com and is measured monthly in points which are calculated by combining the weighted prices of index put and call options for the index in a period of 30 days. The idea is that the VIX will be negatively correlated with the S&P500 because when investors are fearful, VIX prices tend to increase while S&P500 prices tend to be suppressed by lower demand.  

Summary statistics for all explanatory variables are shown in figure 5.

Regression Results

            Although the data and methodology section stated that the regression model for this paper would include seven explanatory variables, there were complications upon testing that forced the model to change to accommodate and improve the results. The final regression model is:

*

*The regression model accounts for the exclusion of influential values found in the study

Analysis of assumptions of the error term

A series of tests were run to test for the violation of the four assumptions of the error term used in the regression model.

Linearity Assumption

            Figure 6 shows the graph of residuals of the regression to fitted values, a scatterplot that shows no correlation between the points. This means the explanatory variables have a linear relationship.

Homoskedasticity

            The test for homoskedasticity includes a scatterplot of residuals that shows no cone-shaped formation and a Breusch-Pagan test that detects no heteroskedasticity. Figure 8 shows the test result.

Normality Assumption

            Figure 9 shows the results of the Shapiro-Wilk test for normality. Unfortunately, the result allows us to reject the null hypothesis of normality, which does not allow us to confirm normality for the model. Although normality cannot be proved through test, normality can be proven through the Central Limit Theorem. For our model, we should have about 70 observations to prove normality through the theorem. Therefore, normality is proven through the Central Limit Theorem.

Independent Error Terms
            The Durbin Watson Test yields a result that rejects the null hypothesis of no autocorrelation. A possible solution to this is to use either first differing or a Prais-Winsten transformation. Nevertheless, both transformed results yield indeterminate results, meaning that we cannot assume that the error terms are independent.  

Influential Observations

            The regression model also included three influential observations, all of which were signaled a Cook’s D statistic test as shown in figure 9. The three most influential values were also justified in figure 7, where 3 points are marked as outliers outside the -2 to 2 standard deviations in the graph of standardized residuals. These influential observations were removed from the regression model.

Multicollinearity

            At first, multicollinearity was a problem in the regression model that included all seven explanatory variables initially mentioned in the paper. However, after careful examination of the regression model and variables, GDP, EPS, and Realestateprice variables were responsible for the most collinearity in the model. These variables could be omitted from the final model to fix multicollinearity, while also being replaced by other variables that emulated the former’s implications. For example, EPS numbers are like PE numbers and the relationship the regression model analyzes in the latter, can be expected in EPS alike, both positive. In other words, EPS can be expected to behave similarly to PE ratios in the regression model. Other examples include the demand for an alternative asset class like real estate where demand can easily be measured from bonds or the economic news characteristic of GDP where good economic data can pretty much be observed in CPI numbers and where the use of both is unnecessary, especially considering that GDP was one of the explanatory variables that skewed multicollinearity the most. In essence, many of the explanatory variables in the original model could be described as superfluous as they likely reiterated results another closely related variable already included in the model would. Final multicollinearity numbers were tested through Stata and the results can be seen in figure 10. No variable has multicollinearity over 5.

Results

            First, figure 12 shows that all variables, except for CPI, are statistically significant at a 5% confidence level. However, by excluding the CPI variable from the model, r-squared would be negatively impacted, so the variable will not be removed. Figure 11 regression (4) shows the full model for this paper. The regression results suggest that the model explains about 60% of the variation in S&P500 price from the explanatory variables. As can be seen from the figure, adding the other explanatory variables that were removed later in the study make the results extremely accurate with an r-squared of 0.895 in regression (1). However, the problem would arise with proving most OLS assumptions, yielding a model that is accurate on the surface but impractical. In the full regression model, the only explanatory variable that is positively correlated with the logarithm of S&P500 prices is the amount of bonds, such that for every million dollars increase in the amount of bonds outstanding, the logarithm of the price of each S&P500 share increases by 0.00000148 percentage points. Despite being negatively correlated, regression (4) shows that the explanatory variables most strictly related to the logarithm of S&P500 prices are VIX, PE ratios, and CPI, in the sequence in which they are listed. For every dollar increase in the logarithm of SP500 prices, there was a decrease of 0.00245 percentage points in VIX.

            Regression (3) eliminates the logarithm for the dependent variable, showing how explanatory variables are correlated to S&P prices. Firstly, it is unexpected to see PE ratios and EPS negatively correlated to the S&P. Expectations were that as S&P prices increased, so would current and future earnings in an optimistic investing environment. A possible reason for this behavior could be that with higher prices, PE ratios are getting dragged down by decreasing expected earnings per share, which are affected either by an environment which greatly overestimates companies’ net income during overoptimistic stock market runs, where companies are, additionally employing more of their capital and earnings to expand activities and start new ventures or where earnings per share are decreased by companies issuing more stock during periods of high prices (market rallies). Secondly, economic news seems to have been correlated to a smaller extent compared to what was predicted, while the sentiment indicator VIX was the most correlated. The reason for this behavior could be that economic data does not reflect the complete picture for investor sentiment, after all different investors might interpret economic data differently, while the VIX truly shows how investors feel towards the current market.

Conclusion

            This study sought to find which variable, among seven that investors consider the most impactful on stock prices, were the most correlated to the S&P500. The results of this study could be used by investors to not only understand what macroeconomic conditions move jointly with stock prices, but also better understand market conditions and what can be expected of stock prices under different economic environments.  The explanatory variables were nominal GDP numbers, CPI numbers, average S&P company EPS, average S&P company PE, VIX prices, median house sales price, and outstanding public debt. Results concluded that the sentiment indicator, VIX, had the largest correlation to stock price movements, followed by fundamental factors, such as PE ratios, economic news, such as CPI, and alternative asset demand represented by bonds. Past research was used to predict that economic news and expected future earnings had a positive correlation to stock prices. Nonetheless, the results showed that the relationship between these factors was weaker than expected and sentiment indicators, such as VIX was better correlated. One finding that also did not align with one of the initial hypotheses was the impact of alternative asset demand. Initially, alternative asset demand was supposed to have a large correlation to stock prices as investors must choose between either buying stocks or investing in alternative assets, such as real estate or public debt. The results of the regression proved otherwise. Although correlation between alternative asset demand and stock prices was weak, the correlation was positive nonetheless, meaning that stock prices increased as investors demanded more bonds and real estate investments. A reason for this might be that either investors who invested in other classes, also subsequently invested in stocks to diversify, hereby driving up stock prices or people tended to invest in alternative asset classes when the investing environment was ideal at the same time that stock prices were increasing.

            Finally, the research also showed many shortcomings. Firstly, research quoted in the literature review that was outdated, articles written in 1988 and 2013, which analyzed stock prices in times, where a different environment might have impacted stock prices. In other words, the research in this paper was led with data from the period 2017-2022, which might not align the research led in years 1988 and 2013, meaning the predictions made with outdated articles might be inaccurate. Secondly, the economic news used in the model, GDP, and CPI, are very limited in showing their impact on stock prices for many reasons. First, although GDP and CPI are considered some of the most important economic indicators of the stock market, different market conditions may weigh other economic indicators more heavily at different times. Second, other non-public economic or government manipulated data might be affecting stock prices, which might be affecting stock prices without letting investors know. Third, the regression model posed a challenge, whether to eliminate some explanatory variables which posed a problem with the properties of ordinary least squares or whether to leave these variables to significantly increase the r squared.

References

Assets: Securities held outright: U.S. Treasury Securities: Notes and bonds, nominal: Week average. FRED. (2022, December 1). Retrieved December 6, 2022, from https://fred.stlouisfed.org/series/WSHONBNA

Chen, L., Da, Z., & Zhao, X. (2013). What drives stock price movements? Review of Financial Studies, 26(4), 841–876. https://doi.org/10.1093/rfs/hht005

Cutler, D. M., Poterba, J. M., & Summers, L. H. (1988). WHAT MOVES STOCK PRICES? . NBER WORKING PAPER SERIES .

Gross domestic product. FRED. (2022, November 30). Retrieved December 6, 2022, from https://fred.stlouisfed.org/series/GDP

Harper, D. R. (2022, July 28). What drives the stock market? Investopedia. Retrieved December 4, 2022, from https://www.investopedia.com/articles/basics/04/100804.asp

Historical Data for Cboe VIX® Index and Other Volatility Indices . Cboe Global Markets. (2022). Retrieved December 6, 2022, from https://www.cboe.com/tradable_products/vix/vix_historical_data/

Larson, S. J., & Madura, J. (2003). What drives stock price behavior following extreme one-day returns. Journal of Financial Research, 26(1), 113–127. https://doi.org/10.1111/1475-6803.00048

Lyck, M. (2020, December 28). Why 80% of day traders lose money. Medium. Retrieved December 4, 2022, from https://marklyck.medium.com/why-80-of-day-traders-lose-money-78d51b10fe25#:~:text=According%20to%20the%20stock%20platform,quitting%20within%20just%20two%20years.

S&P 500 Earnings Per Share. S&P 500 earnings per share. (2022, July 1). Retrieved December 6, 2022, from https://ycharts.com/indicators/sp_500_eps

S&P 500 PE Ratio by Month. S&P 500 pe ratio by month. (2022). Retrieved December 6, 2022, from https://www.multpl.com/s-p-500-pe-ratio/table/by-month

Tostevin, P., & Oakley, M. (2021, December 7). The total value of Global Real Estate. Savills Impacts. Retrieved December 7, 2022, from https://www.savills.com/impacts/market-trends/the-total-value-of-global-real-estate.html

U.S. Bureau of Labor Statistics. (2022). Consumer price index historical tables for U.S. city average. U.S. Bureau of Labor Statistics. Retrieved December 6, 2022, from https://www.bls.gov/regions/mid-atlantic/data/consumerpriceindexhistorical_us_table.htm

US Existing Home Median Sales Price. US existing home median sales price. (2022). Retrieved December 6, 2022, from https://ycharts.com/indicators/us_existing_home_median_sales_price

Yahoo! (2022, December 6). S&P 500 (^GSPC) historical data. Yahoo! Finance. Retrieved December 6, 2022, from https://finance.yahoo.com/quote/%5EGSPC/history/?guccounter=1&guce_referrer=aHR0cHM6Ly93d3cuZ29vZ2xlLmNvbS8&guce_referrer_sig=AQAAALK1EOooWZOsuEhwGxMWtKYaErDH9u4V_waeemkJYEat5Pvpxl2jeE3CS1r5G6Uf79brojhyy5ArIk1S7V0ck98TPOhK28g-F1wRzRQOr0mcC-hribVCfvKKz5_yEHxxtGYf2ZtzvkTTuStEUrDFM63PEK7Ah835Ic4-ARKX2yE2


Appendix

Figure 1- Histogram of daily price of S&P500 index since November 2017

Figure 2- Summary statistics of S&P500 price in the period 2017-2022

Variable

Obs

Mean

Std. dev

Min

Max

Sp500price

1259

3424.475

674.2795

2237.4

4796.56

Figure 3- Average S&P500 PE ratio over the period 2017-2022

Figure 4- Average S&P500 EPS ratio over period 2017-2022

Figure 5- Summary statistics for regression model explanatory variables

Variable

n

Mean

S.D.

Min

.25

Mdn

.75

Max

Sp500price

1259

3424.47

674.28

2237.40

2822.48

3230.78

4023.61

4796.56

real estate price

50

3.2+05

49362.06

2.5+05

2.7+05

3.1e+05

3.6e+05

4.1e+05

VIX

2191

99.41

17.56

61.76

86.60

96.28

110.29

207.59

Gdp

23

21683.48

1919.17

19148.19

20155.49

21362.43

23046.93

25663.29

CPI

70

261.39

15.24

242.84

251.59

256.87

269.20

298.01

PE

72

24.54

4.56

19.38

21.65

23.20

25.16

39.26

EPS

50

27.94

9.15

11.88

21.72

24.87

33.02

53.94

Bonds

1041

1.7e+06

1.3e+06

3.9e+05

4.6e+05

1.6e+06

2.3e+06

5.0e+06

Figure 6- Residual to Fitted Values (linearity test)

Figure 7- Standardized Residuals to fitted values (Heteroskedasticity test)

Figure 8- Breusch-Pagan test for heteroskedasticity

Figure 9- D Cook’s statistic test

Figure 10- Multicollinearity Results for Regression Model

Variable

VIF

1/VIF

PE

1.12

0.895076

VIX

1,10

0.906977

CPI

1.07

0.932951

Bonds

1.06

0.946335

Mean VIF

1.09

 

 

Figure 11- Full regression model results

Variables

(1)

sp500price

(2)

sp500price

(3)

sp500price

(4)

sp500price

Vix

0.222

-0.206

 

0.772

-0.00245***

Cpi

0.1

 

0.268

-7.14e-05

Bonds

0.0116***

0.00820***

0.00964***

1.48e-06***

Eps

-0.359

 

-0.0141

 

Pe

-6.484*

 

-8.669**

-0.00130**

Gdp

0.00949**

0.00603

0.0109**

 

Realestateprice

0.000520*

0.000540**

 

 

Constant

-2,184*

-877.2

-1,290

7.537***

Observations

22

22

22

66

R-squared

0.895

0.856

0.861

0.358

Standard errors in parentheses

*** p<0.01, ** p<0.05, * p<0.1

 

 

 

Figure 12- Regression results table



Comments

Popular posts from this blog

What the CFA Franc is about: A quick summary

Bitcoin Posts a Buy Signal

Assessing the World Implications of Russia's Black Sea Grain Deal Withdrawal