statistical methods Archives | Page 3 of 5

What traders should know about seasonal adjustment

December 26, 2020

The purpose of seasonal adjustment is to remove seasonal and calendar effects from economic time series. It is a common procedure but also a complex one, with side effects. Seasonal adjustment has two essential stages. The first accounts for deterministic effects by means of regression and selects a general time series model. The second stage decomposes the original time series into trend-cycle, seasonal, calendar and irregular components.
Seasonal adjustment does not generally improve the quality of economic data. There is always some loss of information. Also, it is often unclear which calendar effects have been removed. And sometimes seasonal adjustment is just adding noise or fails to remove all seasonality. Moreover, seasonally adjusted data are not necessarily good trend indicators. By design, they do not remove noise and outliers. And extreme weather events or public holiday patterns are notorious sources of distortions. Estimated trends at the end of the series are subject to great uncertainty. Furthermore, seasonally adjusted time series are often revised and can be source of bias if these data are used for trading strategy backtests.

(more…)

Real-time growth estimation with reinforcement learning

December 5, 2020

Survey data and asset prices can be combined to estimate high-frequency growth expectations. This is a specific form of nowcasting that implicitly captures all types of news on the economy, not just official data releases. Methods for estimation include the Kalman filter, MIDAS regression, and reinforcement learning. Since reinforcement learning is model-free it can estimate more efficiently. And a recent paper suggests that this efficiency gain brings great benefits for nowcasting growth expectations. Nowcasting with reinforcement learning can be applied to expectations for a variety of macro variables.

(more…)

Nowcasting with MIDAS regressions

September 5, 2020

Nowcasting macro-financial indicators requires combining low-frequency and high-frequency time series. Mixed data sampling (MIDAS) regressions explain a low-frequency variable based on high-frequency variables and their lags. For instance, the dependent variable could be quarterly GDP and the explanatory variables could be monthly activity or daily market data. The most common MIDAS predictions rely on distributed lags of higher frequency regressors to avoid parameter proliferation. Analogously, reverse MIDAS models predict a high-frequency dependent variable based on low-frequency explanatory variables. Compared to state-space models (view post here), MIDAS simplifies specification and theory-based restrictions for nowcasting. The R package ‘midasr’ estimates models for multiple frequencies and weighting schemes. In practice, MIDAS has been used for nowcasting financial market volatility, GDP growth, inflation trends and fiscal trends.

(more…)

Nowcasting for financial markets

July 25, 2020

Nowcasting is a modern approach to monitoring economic conditions in real-time. It makes financial market trading more efficient because economic dynamics drive corporate profits, financial flows and policy decisions, and account for a large part of asset price fluctuations. The main technology behind nowcasting is the dynamic factor model, which condenses the information of numerous correlated ‘hard’ and ‘soft’ data series into a small number of ‘latent’ factors. A growth nowcast can be interpreted as the factor that is most correlated with a diverse representative set of growth-related data series. The state-space representation of the dynamic factor model formalizes how markets read economic data in real-time. The related estimation technique (‘Kalman filter’) generates projections for all data series and estimates for each data release a model-based surprise, called ‘news’. In recent years machine learning models, such as support vector machines, LASSO, elastic net and feed-forward artificial neural networks, have been deployed to improve the predictive power of nowcasts.

(more…)

Predicting volatility with heterogeneous autoregressive models

May 30, 2020

Heterogeneous autoregressive models of realized volatility have become a popular standard in financial market research. They use high-frequency volatility measures and the assumption that traders with different time horizons perceive, react to, and cause different types of volatility components. A key hypothesis is that volatility over longer time intervals has a stronger impact on short-term volatility than vice versa. This leads to an additive volatility cascade and a simple model in autoregressive form that can be estimated with ordinary least squares regression. Natural extensions include weighted least-squares estimations, the inclusion of jump-components and the consideration of index covariances. Research papers report significant improvement of volatility forecasting performance compared to other models, across equity, fixed income, and commodity markets.

(more…)

The predictive power score

May 16, 2020

The predictive power score is a summary metric for predictive relations between data series. Like correlation, it is suitable for quick data exploration. Unlike correlation, it can work with non-linear relations, categorical data, and asymmetric relations, where variable A informs on variable B more than variable B informs on variable A. Technically, the score is a measurement of the success of a Decision Tree model in predicting a target variable with the help of a predictor variable out-of-sample and relative to naïve approaches. For macro strategy development, predictive power score matrices can be easily created based on an existing python module and can increase the efficiency of finding hidden patterns in the data and selecting predictor variables.

(more…)

A statistical learning workflow for macro trading strategies

April 4, 2020

Statistical learning for macro trading involves model training, model validation and learning method testing. A simple workflow [1] determines form and parameters of trading models, [2] chooses the best of these models based on past out-of-sample performance, and [3] assesses the value of the deployed learning method based on further out-of-sample results. A convenient technology is the ‘list-column workflow’ based on the tidyverse packages in R. It stores all related objects in a single data table, including models and nested data sets, and implements statistical learning through functional programming on that table. Key steps are [1] the creation of point-in-time data sets that represent information available at a particular date in the past, [2] the estimation of different model types based on initial training sets prior to each point in time, [3] the evaluation of these different model types based on subsequent validation data just before each point in time, and [4] the testing of the overall learning method based on testing data at each point in time.

(more…)

Reward-risk timing

March 14, 2020

Reward-risk timing refers to methods for allocating between a risky market index and a risk-free asset. It is a combination of reward timing, based on expected future risk asset returns, and volatility timing, based on recent price volatility. A new paper proposes to use machine learning with random forests for estimating both risk premia (return expectations) and optimal lookback windows for volatility estimates This method allows for non-linear prediction interaction and averages forecasts across a range of simplistic valid prediction functions. In an empirical analysis with data going back to 1952 the random forest method for reward-risk timing has outperformed other methods and earned significantly higher risk-adjusted returns than a buy-and-hold strategy.

(more…)

Detecting market price distortions with neural networks

February 22, 2020

Detecting price deviations from fundamental value is challenging because the fundamental value itself is uncertain. A shortcut for doing so is to look at return time series alone and to detect “strict local martingales”, i.e. episodes when the risk-neutral return temporarily follows a random walk while medium-term return expectations decline with the forward horizon length. There is a test based on the instantaneous volatility to identify such strict local martingales. The difficulty is to model the functional form of volatility, which may vary over time. A new approach is to use a recurrent neural network for this purpose, specifically a long short-term memory network. Based on simulated data the neural network approach achieves much higher detection rates for strict local martingales than methods based on conventional volatility estimates.

(more…)

The predictive superiority of ensemble methods for CDS spreads

January 11, 2020

Through ‘R’ and ‘Python’ one can apply a wide range of methods for predicting financial market variables. Key concepts include penalized regression, such as Ridge and LASSO, support vector regression, neural networks, standard regression trees, bagging, random forest, and gradient boosting. The latter three are ensemble methods, i.e. machine learning techniques that combine several base models in order to produce one optimal prediction. According to a new paper, these ensemble methods scored a decisive win in the nowcasting and out-of-sample prediction of credit spreads. One apparent reason is the importance of non-linear relations in times of high volatility.

(more…)

What traders should know about seasonal adjustment

Real-time growth estimation with reinforcement learning

Nowcasting with MIDAS regressions

Nowcasting for financial markets

Predicting volatility with heterogeneous autoregressive models

The predictive power score

A statistical learning workflow for macro trading strategies

Reward-risk timing

Detecting market price distortions with neural networks

The predictive superiority of ensemble methods for CDS spreads

Tags

ABOUT US

FOLLOW US

LEGAL