International evidence on stock returns and dividend growth predictability using dividend yields

ABSTRACT This paper examines stock returns and dividend growth predictability using dividend yields in seven developed markets: United States of America (US), United Kingdom (UK), Japan, France, Germany, Italy, and Spain. Altogether, these countries account for around 85% of the Morgan Stanley Capital International (MSCI) World Index. The use of the long time series with up-to-date data allows the comparison not only between countries, but also across periods, putting into perspective the existence or not of noticeable changes since the 1980’s. The majority of the literature on this topic is US-centered. This emphasis on the US is even more pronounced when it comes to examining the relationship between the dividend unpredictability and dividend smoothing. There is also the need to know if the relationships already documented for the post-Second World War (WWII) period still hold during the last three decades, when stock markets were subjected to a high level of turbulence worldwide. The relationship between dividend yields and returns and dividend growth is central to understand the functioning of capital markets, and has considerable implications for capital asset pricing and investment strategies. Overall, the results show that even for developed capital markets there is no clear pattern on the predictive ability of dividend yields on stock returns and dividend growth, instead these relationships seem to be time-dependent and country-specific. For each country, the predictive ability of the dividend yield is examined in a first-order structural VAR framework by applying bootstrap significance tests and the degree of dividend smoothing is assessed using four partial-adjustment models for the dividend behavior. Additionally, an out-of-sample analysis is conducted using pseudo-R2 and a normal mean squared prediction error (MSPE) adjusted statistic. For the post-WWII period, returns are predictable, but dividends are unpredictable in the US and the UK, while the opposite pattern is observed in Spain and Italy. In Germany, there is some evidence of short-term predictability for both returns and dividends, while in France only returns are predictable. In Japan, neither variable can be forecasted. The dividend smoothing results show that dividends are more persistent in the US and the UK, however, there is no clear connection between dividend smoothness and predictability for the other countries. An important conclusion to retain from the out-of-sample analysis is that the predictability of returns after the WWII, especially present in the US, appeared to have been missing in the last three decades, most probably due to the turmoil experienced by the stock markets during this last period.


INTRODUCTION
The relationship between dividend yield and returns and dividend growth is a central issue to understand the functioning of capital markets and has considerable implications for capital asset pricing and portfolio investment strategies. This research topic is not new, but still there are no consensual general findings and the discussion on if the main results obtained for the United States of America (US) are applicable to other countries remains quite actual.
For the US, the mainstream of the literature find that the dividend yield has some predictive power on returns, but the predictive power decreases substantially when the dividend growth is the explained variable. A possible explanation proposed in the literature is the dividend smoothing practices undertaken by US firms.
This paper contributes to the existing literature by providing additional international evidence on the predictive power of the dividend yield on returns and dividend growth. More precisely, the paper analysis these relationships for the US, United Kingdom (UK), Japan, France, Germany, Italy, and Spain, using the longest time series available. The aim of the paper is not to construct significant and robust forecasts of returns and dividend growth, which, arguably would be better achieved by including other economic and financial variables as regressors, than just only the dividend yield. This paper is in line with Cochrane (2008), who constructs a joint test for the return and dividend growth predictability using just the dividend yield as the regressor. We also try to figure out if there is some pattern relating dividend smoothness measures and dividend growth predictability, as in Chen, Da and Priestley (2012). Finally, to give more robustness to our results, we conduct an out-of-sample analysis on the forecasting ability of the dividend yields. Although the methodologies used in our paper have been used elsewhere, our paper has some strengths. Namely, it uses up-to-date data including the post 2007 financial crisis period, it compares the results across the most important stock markets in the world and not only for the US or the UK, and it combines in-and out-of-sample analyses.
The remaining of this study is structured into five sections. Section 2 shows a brief literature review on the topic of dividend yield predictive power. Section 3 presents the data and provides some descriptive statistics. Section 4 outlines the basic theoretical concepts on the relationships between dividend yield and returns and dividend growth, the models aimed to assess dividend smoothing, and the methodology used to measure the out-of-sample forecast ability of the dividend yield. Section 5 shows the results and section 6 concludes de paper.

LITERATURE REVIEW
In the last decades, many authors have studied the predictability of returns and dividend growth by the dividend yield [see, for instance, Ferson and Harvey (1991), Campbell and Ammer (1993), Cochrane (2001Cochrane ( , 2008, Lettau and Ludvigson (2005), Lettau and Van Nieuwerburgh (2007), and, more recently, Maio and Santa-Clara (2015), Golez and Koudijs (2018), Jagannathan and Liu (2019), le Bris, Goetzmann, and Pouget (2019), and Piatti and Trojani (2019)]. The main finding is that the dividend yield strongly predicts stock returns, but it does not predict dividend growth rates. However, this result has been increasingly contested by other papers, showing that dividend yields also predict dividend growth. Campbell and Shiller (1988) report that in the US the dividend-price ratio significantly forecasted one-year dividend growth until 1986. Ang (2002) reaches the same conclusion using data until 2000, however, he also finds that for horizons beyond one year there is no significant dividend growth predictability by the dividend-price ratio. A more recent study by Chen (2009) presents some evidence that the dividend yield did, in fact, predict aggregate US dividend growth in the period before the Second World War (WWII), but this predictive power vanishes away in the post-war period. Binsbergen et al. (2010), using the present-value model framework, show that US dividends are predictable by the whole history of dividend yields. Piatti and Trojani (2019) apply bootstrap tests to the monthly US stock market data for the period 1946 to 2010, and confirm that the postwar return (dividend) predictability evidence in benchmark present-value models is similarly strong (weak) as in standard predictive regression tests, consistently with the  Maio and Santa-Clara (2015) point out that the generalized conviction that variation in dividend yields is exclusively related to expected returns and not to expected dividend growth, although valid for the aggregate stock market, is not true for portfolios of small and value stocks, where dividend yields are related mainly to future dividend changes. The authors show that what explains time variation in the dividend-to-price ratio of small stocks is the predictability of future dividend growth, while in the case of big stocks, it is all about return predictability, especially at longer horizons. Jagannathan and Liu (2019) show that the US dividend growth in the post-war period is predictable in a model that incorporates information about the corporate payout policy in the dividend growth rate function. The main idea is that high earnings relative to dividends imply that firms have been retaining earnings in the past and thus are expected to pay more dividends in the future. Using US data, the authors show that their model predicts 42.4 to 46.4% of the variation in annual dividend growth rates between 1946 and 2016 in-sample and 39.5 to 41.3% of the variation in annual dividend growth rates between 1976 and 2016 out-of-sample.
Most studies on return predictability by the dividend yield (or dividend-price ratio) use US data. This emphasis on the US is even more pronounced when it comes to examining dividend growth predictability [see Paye and Timmermann (2006)]. However, there are some studies on other countries. Campbell (2003) conducts a comprehensive study on asset price determination within a consumption-based framework using international data. He finds some evidence on the dividend growth predictability by the dividend-price ratio in several countries, but not in the US. Engsted and Pedersen (2010) study the dividend yield power in predicting Scandinavian dividend growth. They show that the degree of predictability depends on whether real or nominal variables are used in the analysis.
The different results on the ability of the dividend yield to predict dividend growth put into perspective that this ability is probably conditional to the sample period and it may be country-specific. For instance, Lettau and Ludvigson (2005) provide a potential justification for the absence of predictability of the dividend growth by the dividend-price ratio in the period after the WWII. They conclude that the forecasts of dividends and the forecasts of the excess stock returns are positively correlated with the business cycles. The variations, both in expected returns and in expected dividend growth, are compensated on the dividend-price ratio. They also provide an explanation for the consumption-wealth ratio having a higher power than the log dividend-price ratio in predicting the excess stock market returns over medium-term horizons. Chiang (2008) shows that when the dividends do not capture the relevant future cash flows, the expected dividend growth is not predictable by the dividend yields. They argue that this is due to the flatness of the dividend series, which in turn results from the manipulation and shifts in the financial policies of firms. Golez and Koudijs (2018) conclude that return predictability from dividend yields has been a robust characteristic of financial markets over the last four centuries, and argue that a possible justification for the absence of dividend predictability is the growing disconnection between earnings and dividends. A lower dividend-to-earnings ratio implies that firms push the eventual payouts to shareholders into the future. Given that expected returns appear to be less persistent than expected dividend growth rates, postponing dividend payments increases the relative sensitivity of dividend yields to shocks in expected returns and reduces the ability of dividend yields to predict dividend growth rates. Le Bris et al. (2019) study the pricing of the Bazacle Company of Toulouse, the earliest documented shareholding corporation, over the 1372 to 1946 period, using share prices and net dividends, showing that the changes in expectations of future dividends explain a significant fraction of price variations. This finding is consistent with the relative importance of cash flow versus discount rate news for individual firms.
Probably, the most established explication for the absence of dividend predictability has to do with the way companies define their dividend policies and more specifically to the practice of dividend smoothing. Chen et al. (2012) report that dividend smoothing can destroy the dividend predictability in a finite sample. In fact, there is a noticeable difference in firms' dividend policies in the US before and after the WWII, with the dividend payouts being much smoother in the post-war period. Thus, the dividend smoothing strategies pursued by US firms after the WWII is a plausible explanation to why in the US dividend growth is predictable before the WWII but not afterwards.
Dividend smoothing literature is heavily weighted towards the US and the international evidence on the relationship between the dividend unpredictability and dividend smoothing is scarce. Usually, authors study separately the two issues and, if there is evidence on International evidence on stock returns and dividend growth predictability using dividend yields 4 R. Cont. Fin. -USP, São Paulo dividend unpredictability and on dividend smoothing for a particular country, they conclude that probably there is a causal relationship from the later to the former. Rangvid, Schmeling, and Schrimpf (2014) provide a reference study on this relationship for the period 1973 to 2009, using a comprehensive list of countries. The authors show that dividend predictability is weaker in large and developed markets where dividends are more smoothed, the typical firm is larger, and volatility is lower, hence concluding that the apparent lack of dividend predictability in the US does not uniformly extend to other countries. Our study provides additional evidence on the predictive power of the dividend yield and its relationship with dividend smoothing for seven of the most developed economies in the world using up-to-date data.

DATA DESCRIPTION
This paper examines the predictability of dividend growth and stock returns in the US, the UK, Japan, France, Germany, Italy, and Spain using the time series of annual stock prices, dividends, and earnings since its availability until 2016. Annual data is used instead of monthly or quarterly data because at these finer frequencies the dividend measurement errors are more severe. Accordingly, annual data has the proper frequency to be used in long run investment strategies with horizons of several years.
For these countries, we collected from the Global Financial Data database the time series of the "total stock returns index", denoted by RI t , "stock prices index" (P t ), "stocks-dividend yields" (DY t ) (in percentage), "stocks -earnings yields" (E/P t ), and "consumer prices index" (CPI t ) (the subscript refers to the end of the year t). The data was obtained at the aggregated level of each country general stock index provided by Global Financial Data. The sampling period is different for each country, and for some countries the "stocks-earnings yields" are only available several years after all the other series. Figure 1 shows the sample periods of all the series for each country. The raw data was then used to compute the appropriate series. The dividend series are obtained as D t = (DY t /100) P t , hence the [G] Prices are measured at the end of the year. yearly dividend growth is given by DG t = (D t /D t-1 ) -1. The price-earnings ratios are obtained by simply inverting the earning yields series (P/E) t = ((E/P) t ) -1 and the earnings series are computed as E t = (E/P) t P t . The arithmetic returns are given by R t = (RI t /RI t-1 ) -1. All the nominal variables, including the dividend growth and returns, were then deflated by the national consumer price indices (CPI) in order to obtain their real values. The descriptive statistics of the real arithmetic return, real dividend growth, and dividend yield for each country are shown in Table 1. For those countries (US, UK, Japan, and France) where there is data prior to the WWII, the overall sample is divided into two segments: the pre-WWII period, from the start of the samples until 1945, and the post-war period, from 1946-2016.
For the overall sample, considering the US, the UK, Japan, and France, the mean annual return ranges from 5.7% for France to 8.9% for Japan, while the mean dividend growth, always lower than the mean return in the respective country, ranges from 0.9% for Japan to 3.8% for France. For these countries, the mean return and the mean dividend growth show an increasing pattern from the pre-to the post-WWII period, while their standard deviations (SD) do not show a clear pattern. For the dividend yield, the mean values have decreased while the SD have increased from the first sub-sample to the second sub-sample. The first-order autocorrelation coefficient of returns became negative in the period after the WWII, except for Japan, while this coefficient for the dividend growth has increased in the post-WWII period. The statistics of Germany, Italy, and Spain report to sampling periods after the WWII and hence are better comparable with the post-WWII subsample of the other four countries. The most out of line statistic is the negative correlation coefficient of dividend growth for Italy. The coefficient φ(1) shows that the dividend yield is highly persistent, independently of the country or sampling period, ranging from 0.5 (US, pre-WWII period) to 0.90 (Japan, overall sample).

MODELS SPECIFICATIONS
This section introduces the theoretical concepts and models that are central to the examination of the power of dividend yield in predicting stock returns and dividend growth in different time horizons. Additionally, it presents the concept of dividend smoothing and the models of dividend policy estimated in the next section and introduces a simple but widely recognized methodology to assess the predictive power of dividends out-of-sample.

The Dividend Yield Model for Returns and
Dividend Growth Following Campbell and Shiller (1988), one can derive the following one-period log-linear return approximation: where r t+1 , d t+1 , and p t+1 denote the log-return, the logdividend, and log-price of a given stock at period t + 1, respectively, and c is a linearization constant. Hence, ∆d t+1 is the logarithmic dividend growth rate and where E [d -p] is the expected logarithmic dividend yield that can be simply estimated by the average of the historical logarithmic dividend yield. Therefore, ρ is timeindependent and is typically close to unity.
The present value relationship can be obtained by solving equation 1 forward, taking conditional expectations and imposing a no-bubble condition in the dividend yield, i.e., lim ��� � �� ��� � ��� � � 0 . Accordingly, This equation indicates that the current dividend yield can be seen as the discounted value of all future returns r t+j+1 and future dividend growth rates ∆d t+j+1 , both discounted at a constant rate ρ (minus a constant c/ (1 -ρ)). Equation 3 implies that the dividend yield predicts future returns and/or future dividend growth.
To examine the predictability of returns and dividend growth by the dividend yield, we use the first-order VAR representation of the returns, dividend growth, and dividend yields, as proposed by Cochrane (2008): Table 1 Cont.
The predictability of returns and dividend growth can be assessed by standard marginal tests. However, Cochrane (2008) highlights that the identity in equation 1 applies to each data point, thus connecting the regression coefficients and the errors in the VAR system (equations 4, 5 and 6). Thus, the projection of equation 1 on d t -p t imply that the regression coefficients must obey to the approximate identity and the errors in the VAR are linked via ��� � � ��� � � ��� �� .
Assuming that the dividend yield process is not explosive, φ ≤ 1/ρ, a null hypothesis with both unpredictable returns and unpredictable dividend growth is impossible. In other words, assuming no bubbles, if the dividend yield does not predict the future stock returns (future dividend growth), then it must predict future dividend growth (future stock returns). Hence, under the null hypothesis of no return predictability, b r = 0, the dividend growth must be predictable, that is b d must be negative (null I).
Conversely, if the dividend yield does not predict the future dividend growth, b d = 0, then it must predict future returns, that is b r must be positive (null II).

Dividend Smoothing
Some authors argue that a possible explanation for the unpredictability of the dividend growth is the guidelines used by companies, aiming to smooth the dividends paid to shareholders, that is, the firms tend to determine the dividend payout taking into account current earnings and past dividend payouts, hence flattening the dividend time series.
The most used dividend smoothness measure is given by where σ(∆d) is the SD of dividend growth and σ(∆e) is the SD of earnings growth. A higher value of S means that the dividend smoothness is lower.
In order to investigate the presence of dividend smoothing in our sample, we apply the same framework as Chen et al. (2012). This framework includes four models: the partial-adjustment model for the dividend behavior proposed by Lintner (1956), two variants of this model introduced by Chen et al. (2012), and a fourth model proposed by Marsh and Merton (1987). The Lintner's model and the other two variants show the speed of adjustment (SA) of the dividend payout to a shock in the firm's earnings. The first model is the following: where ∆D t is the change in the level of dividends, E t is the level of earnings, and D t-1 is the lagged dividend payout. In this model -α 2 is the so-called SA parameter. A positive shock in the firm's earnings results in an additional dividend payout.
The second model is the following: where ∆E t is the change in the level of earnings and ∆D t-1 is the change in the dividend payout lagged one period. In this model 1 -β 2 is the SA, thus β 2 can be interpreted as a measure of dividend smoothness. The third model is the following: In this model, γ 2 can be interpreted as the dividend smoothness metric. A higher value γ 2 mean a smoother dividend payout.
The fourth model, proposed by Marsh and Merton (1987), is the following: where D t+1 is the next period dividend and P t is the price at t. This model allows capturing the intensity of the dividend response to permanent earning changes.
Here λ 1 can be interpreted as the dividend smoothness metric, such that a higher λ 1 corresponds to less dividend smoothing.

Out-of-Sample Predictive Power
This subsection describes the procedure adopted to generate forecasts and to evaluate their accuracy. We use an expanding window to predict the returns and dividend growth rates, based on equation 4 and 5, respectively.

12
International evidence on stock returns and dividend growth predictability using dividend yields are the return and dividend growth averages using data up to time t -1.
We assess if the models exhibit a higher predictive ability than the historical average using the mean squared prediction errors (MSPE)-adjusted statistic of Clark and West (2007). This test is an approximately normal modified version of the MSPE statistic, which the authors show to be undersized. Its null hypothesis stipulates that the MSPE of both the model and the historical average are equal, whereas, according to the alternative hypothesis, the model predictions are more accurate. The most convenient way to implement this one-sided test is to compute and then regress � �,� and � �,� on a constant and use the resulting t-statistics for a 0 coefficient. The null hypotheses of equal predictive ability are rejected at the 5% confidence level, if the t-statistics exceed 1.645 (one-sided test).

EMPIRICAL RESULTS
This section presents the estimation results of the models presented before. The VAR systems are estimated by ordinary least squares (OLS). Notice, however, that the VAR errors should be serially uncorrelated, but may present significant cross-correlations, hence the OLS estimators of b r , b d , and φ are consistent but biased in small samples.
Accordingly, in this framework, hypotheses testing should be conducted using computing intensive methods, such as bootstrap resampling or Monte Carlo simulation. Table 2 presents the results from the VAR(1), equations 4, 5 and 6, for overall samples of the seven countries under scrutiny using real variables (deflated by the inflation rate).  (White, 1980)

, the column "Corr. of Residuals" refers to the matrix, where the elements in the diagonal are the standard deviations and the elements off-diagonal are the cross-correlations of residuals, and Pm and Pc are probabilities obtained by the Monte Carlo simulations of the system given by equations 4, 5, and 6, considering
as in Cochrane (2008). These probabilities were obtained from 10,000 simulations of the joint distribution of {b r , b d }. For the returns rows, Pm is given by ��� and Pc is given by . For the dividend-growth rows, Pm is given by and Pc is given by are the values of the parameters implied by the identity b r = 1 - As expected, the constant discount factor (ρ) is close to unity, ranging from 0.958 for the UK and Spain to 0.973 for Japan. The coefficient φ for the dividend yield equation is quite high but below unity, ranging from 0.698 for Germany to 0.959 for Japan, showing a higher degree of persistence in the dividend yield process. All the estimates of φ are significant at the 1% level and the dividend yield equation has the highest R 2 for all countries. In absolute terms, these coefficients of determination are high, except for Spain, where R 2 = 0.008. The coefficients for the returns and dividend growth equations, i.e., The proportion of the variability of the oneperiod-ahead dividend growth explained by the dividend yield is only marginally lower than the corresponding proportion for the one-period-ahead return for the UK, Japan, and France. So there is evidence that the dividend growth is more predictable by the dividend yield than returns only for the US, Germany, and Italy.
The column labelled Pm in Table 2 exhibits the p-values corresponding to the null hypothesis of no return predictability and no dividend growth predictability, based on a single parameter, whereas Pc presents the results from the Cochrane (2008) joint test. The traditional test (Pm) reveals that returns are only predictable in the UK and France, which corroborates the findings in Cochrane (2008) that this test lacks power to detect return predictability. The joint test is able to reject the null hypothesis of no predictability, at the 5% significance level for all the countries, except Japan and Italy. Regarding the dividend growth, the single parameter test rejects the null hypothesis for the US, Germany, Italy, and Spain. According to the joint test, this hypothesis is rejected for an additional country, Japan.
The columns 8 to 10, labelled "Corr. of residuals", present the SD of the residuals on the diagonal and the cross-correlations off the diagonal. The return SD range from 0.18, for the US, to 0.46, for France. The unusually high return SD for France may be explained by the turbulent period during the WWII. The residuals of the dividend growth equation show a similar pattern: they are higher for France and Japan than for the remaining countries. The errors for the return and dividend yield equations are negatively correlated for all the countries, but the absolute values of the correlations for Japan and France are considerably lower than the ones for the remaining countries. This is important because the negative correlation between these errors generates a negative correlation between the estimates b r and φ, which increases the power of the joint test relative to the marginal one-sided test. The return and dividend growth errors are positively correlated for all the countries, except France and Spain. This phenomenon and the unexpected coefficient in the dividend growth equation can be seen as "red flags" for the case of France.
The lack of return and dividend growth predictability may be attributable to dividend smoothing practices and stock repurchases, as has been pointed out by Cochrane (2008), among others. That is, if prices move today in response to dividend news several years into the future, then this information would not be captured by the 1-year VAR presented in equations 4, 5 and 6 because of these news would not be reflected in next year's dividend. In order to address this issue, we tested if the long-horizon returns and dividend growth rates can be forecasted based on the dividend yield. These results are presented in tables 3 and 4, respectively. These tables also present the results for the two sub-samples, i.e., pre-and post-WWII. In the full sample, returns are predictable for all time horizons in the US and the UK and they are unpredictable in Japan and Italy. For the remaining countries, the evidence is mixed: the null hypothesis of no predictability is rejected in Spain (one year), France (one and five years), and Germany (one, five and 20 years). In the subsample analysis, there is evidence of an increase in predictability for the US and the UK in the post-WWII period, whereas France exhibits the reverse pattern. For Japan, returns are unpredictable in both sub-periods.
Regarding dividend growth predictability, Table 4 reveals that it is present in Spain at all time horizons, and it is completely absent in Japan and France. Dividends are forecastable at time horizons up to 10 years in the US, at one, two, five, and 20 years in Italy, and only at one year in the UK and Germany. In the subsample analysis, we can observe a decrease in dividend growth predictability for the US and the UK in the post-WWII period, and there is no discernible trend in the ability to forecast dividends in France and Japan.    In sum, we cannot conclude that long-horizon tests provide an overwhelming increase in power relative to the one-year tests. This is at odds with Cochrane (2008), who shows that there is a significant increase in power if the time horizon is extended beyond 15 years.
Hereafter, we discuss the results from the various dividend smoothing measures presented in section 4.2. We also analyze the connection between these measures and the predictability results. Note that if the dividends are strongly smoothed, the link between the dividend yield and dividend growth will be broken. Thus, the dividend growth will not be forecastable and b d will tend to 0. Dividend smoothing also increases the dividend yield autocorrelation (φ), which renders its effect on return predictability (b r ) ambiguous (see equation 7). Table 5 reports the estimated dividend behavior models mentioned in the last section and the smoothness parameter. The models are estimated using OLS, with Newey-West corrected standard errors. We had to exclude Germany due to the lack of data.
The first part of this table reveals that dividends have become more stable in the US after WWII, according to all the measures considered: the volatility of dividends relative to earnings decreased from 0.525 to 0.234, the SA decreased for the models Lintner, Chen-Da-Priestley 1, and Marsh-Merton, and the smoothness parameter increased from 0.249 to 0.374 for Chen-Da-Priestley 2. Comparing these results with Table 4, we conclude that, as expected, more dividend smoothing implies less dividend predictability, in accordance with Chen et al. (2012).
In the cross-country comparison, we choose to focus on the post-WWII values for the US and the UK, because for the remaining countries our data does not cover the pre-war period.
The first column in Table 5 shows the volatility of dividend growth relative to the volatility of earnings growth. By this measure, dividends are the most stable in the US (0.234) and the most volatile in Spain (0.968). It is noticeable that the dividend volatility in Spain is more than twice as high as the dividend volatility in every other country. These results corroborate Rangvid et al. (2014), who found that dividends are more stable in larger markets, and Renneboog and Trojanowski (2007) and Denis and Osobov (2008), who showed that dividends are smoother in the US than in France, Germany, and Japan. The SA in models Lintner and Chen-Da-Priestley 1 assesses how fast firms adjust their dividends in response to an earnings shock. In Lintner, the SA is the fastest for Italy (0.276), followed by Japan and France (0.181), and the slowest for the US (0.102). According to Chen-Da-Priestley 1, dividends are more persistent in the US and the UK than in the other countries. Curiously, French and Spanish companies increase their dividends, following a positive earnings shock, by more than 100% of the longterm dividend hike implied by the target payout ratio.
In Chen-Da-Priestley 2, the coefficient on the lagged change in dividends is positive and highly significant for the US and the UK, which indicates that firms in these countries engage in strong dividend smoothing. Unexpectedly, for Japan, France, and Spain, this coefficient is negative, but it is not significantly different from 0 at the 5% significance level.
The coefficient λ 1 , in the Marsh-Merton model, captures the response of dividends to permanent earnings changes. US and UK firms are the slowest to react to an earnings shock, as in most other models, while French firms are the fastest ones.
Overall, the different measures of dividend smoothing show some consistency, as they all rank the US and the UK among the countries where dividend persistence is stronger. However, there is some incoherence for the remaining countries. Spain presents the highest dividend volatility and the lowest dividend smoothing according to Chen-Da-Priestley 2, but Lintner shows that this is the second country that most smooths dividends. For France, Chen-Da-Priestley 1 and the Marsh-Merton models place it as the country that least practices dividend smoothing, but its dividend volatility is the second lowest.
These results provide some support to the hypothesis that dividend growth is unpredictable in countries where firms smooth their dividends. Dividend growth is unforecastable after the WWII in the US, and in the UK it is predictable only at the one-year horizon. However, dividends are strongly predictable in Spain and Italy, and unpredictable in France and Japan, even though these countries exhibit a similar degree of dividend smoothing.
Our results regarding the accuracy of the return and dividend growth predictions cover the period from 1900 to 2016, for the US and France, 1946 to 2016 for the UK and Japan, and 1981 to 2016 for Germany. The short data span available for both Spain and Italy prevents the realization of an out-of-sample analysis for these countries.
Figures 2 and 3 exhibit the differences between the cumulative prediction errors based on the historical averages and the models, for the returns and dividend growth rates, respectively.  Those segments of the lines that present increasing (decreasing) slopes should be interpreted as periods when the models outperform (underperform) the historical averages. Returns are highly predictable in Japan, after WWII until the late fifties, but thereafter the model's performance looks dismal. There is also some evidence of return predictability in France after the WWII and in the UK in the mid-seventies. Figure 3 reveals that the dividend growth rate was predictable in the US, during the first half of the 20th century, in Japan, after the WWII, and in Germany throughout most of the period considered. Table 6 presents the pseudo R 2 out-of-sample and its significance according to the MSPE-adjusted statistic.  Clark and West (2007) to test the models' predictive ability is also shown in the table. Significance at the 1, 5, and 10% levels are denoted by *, **, and ***, respectively.
Source: Elaborated by the authors. Table 6 shows that the pseudo R 2 is positive for the return predictions in the UK (significant at the 5% level) in the full out-of-sample period. It is also positive and significant, at the 1% level, for the dividend growth forecasts in the US and Germany. During the first sub-period, the pseudo R 2 is only positive for the dividend growth predictability in the US. The subperiod following the WWII shows evidence of return predictability in several countries. The pseudo R 2 is positive in the UK, the US, and France, and it is statistically significant at the levels of 1 and 5% in the last two countries, respectively. In the last period comprised between 1981 and 2016 predictability looks almost absent. The only exception is the dividend growth rate in Germany, which has a positive and highly significant pseudo R 2 .

CONCLUSION
The finance literature claims that the dividend yield variation can be explained by news about future returns and future dividends, which means that one can trace price movements back to visible news about dividends or cash flows. In the present study, we confirm some of the much highly reproduced results for the US. In fact, for the US aggregate stock market, the expected future returns account for most of the observed variation in the dividend yields.
We extended the analysis of previous studies by considering six additional countries that, jointly with the US, represent close to 85% of the Morgan Stanley Capital International (MSCI) World Index. Our results for the post-WWII period reveal that returns are predictable in the US and the UK, but dividends are unpredictable, while the opposite pattern is observed in Spain and Italy. In Germany, there is some evidence of short-term predictability for both returns and dividends, while in France only returns are predictable. In Japan, neither variable can be forecasted.
The dividend smoothing results show that dividends are more persistent in the US and the UK than in the remaining countries. The various measures of dividend volatility do not provide a consistent ranking for Japan, France, Italy, and Spain (Germany was excluded from the analysis due to the lack of data). However, we provide mild support to the hypothesis that dividend predictability is lower in countries where dividends are strongly smoothed, as there is no dividend predictability in the US and the UK, where dividend smoothing is strongest, but we cannot establish a clear connection between dividend volatility and predictability for the other countries.
Cross-country comparisons of dividend smoothing measures must be conducted with caution. Different accounting standards across countries may compromise earnings comparability, which adds noise to our estimators of dividend persistence. Besides that, one should point out that our database does not cover the same time span in different countries. Nevertheless, our study provides some novel insights regarding the cross-country analysis between dividend smoothing and predictability and it opens the door to further research that, using a more comprehensive data or more sophisticated methods, can either confirm or disprove them.
An important conclusion to retain from the out-ofsample analysis is that the predictability of returns after the WWII, especially present in the US, appeared to have been missing in the last three decades, most probably due to the turmoil experienced by the stock markets during this last period.
Overall, the results show that, even for developed capital markets, there is no clear pattern on the predictive ability of dividend yields on stock returns and dividend growth, instead these relationships seem to be timedependent and country-specific.