Forthcoming, International Journal of Forecasting 

Working Paper Version

Presented at: NBER-NSF Time Series Conference at Rice University, 2021,  European Winter Meeting of the Econometric Society at the University of Barcelona School of Economics, 2021 

Working Papers

with Seung C. Ahn

Abstract: This paper considers a forecasting model in which a target variable is a linear function of K latent factors among many predictors (N). The target variable is forecasted by regression with factors generated by the Partial Least Squares (PLS) method. Our asymptotic analysis shows that the optimal number (q*) of PLS factors for forecasting can be much smaller than K. Using more than q* PLS factors can cause an over-fitting problem, which deteriorates the out-of-sample forecasting accuracy while yielding high in-sample fit. Our Monte Carlo simulation results confirm these asymptotic results. Furthermore, our simulation exercises and topical empirical analysis indicate that using q* PLS factors is not necessarily desirable in practice unless very large samples are used. Using smaller than q* PLS factors often produces more accurate forecasting results. Especially, a single PLS factor very often outperforms q* PLS factors, even when q*>1.

Note: This paper was circulated and presented as "Forecasting with Partial Least Squares When a Large Number of Predictors Are Available". 

Presented at: Econometrics Seminar at Georgetown University, 2022, Econometrics Seminar at the University of York, 2022, Forecasting Seminar at George Washington University, 2022, North American Summer Meeting of Econometric Society at the University of Miami, 2022, European Summer Meeting of Econometric Society at Bocconi University, 2022, Asian Summer Meeting of Econometric Society at Keio University and the University of Tokyo, 2022, NBER-NSF Time Series Conference at Boston University, 2022, SNDE Symposium for Young Researchers, 2022 

with Seung C. Ahn

Abstract: In this paper, we develop a novel supervised factor estimation method called Single Component Analysis (SCA). We consider a contemporaneous and forecasting model where a single dependent or target variable of interest exists. As the name implies, SCA produces a one-dimensional factor that asymptotically estimates all factors governing the dependent variable or target variable. The SCA method incorporates the supervised aspects of Partial Least Squares (PLS) that maximizes the covariance with the dependent or target variable. However, our SCA factor does not suffer from overfitting problem opposed to many supervised methods. Consistency of SCA factor under the two models is shown when both sample sizes N and T increase to infinity. Simulation evidence demonstrates that SCA outperforms other alternatives and shows robust forecasting performance without an overfitting issue. Empirical application on forecasting major macroeconomic and finance variables in big data also confirms promising predictive results of SCA.

Presented at: Royal Economic Society Annual Conference at Queen's University Belfast, 2024, NBER-NSF Time Series Conference at the University of Pennsylvania, 2024 (Scheduled)

Work in Progress

Measuring Macroeconomic Uncertainty with Various Factor-Augmented Forecasting

Abstract: This paper investigates measuring macroeconomic uncertainty, using factor-augmented forecasting. In this paper, uncertainty is measured by the conditional volatility of various macroeconomic series, that was not predicted from factor-augmented forecasting. Various factor-augmented forecasting methods are used to construct uncertainty measure, in order to remove as much predictable variations as possible from economic series. Failure to do so will overestimate the economic uncertainty, since it will wrongly include forecastable variations as a part of uncertainty. Target-specific factor estimation methods that incorporate the information of a target variable when factors are estimated, generate less forecasting errors and hence produce more accurate uncertainty measures. However, all the uncertainty measures constructed by factor- augmented forecasting demonstrate similar properties, show more persistent and correlated uncertainty periods with real activity, compared to other uncertainty proxies.