Acessibilidade / Reportar erro

Bayesian inference for the log-symmetric autoregressive conditional duration model

Abstract

This paper adapts Hamiltonian Monte Carlo methods for application in log-symmetric autoregressive conditional duration models. These recent models are based on a class of log-symmetric distributions. In this class, it is possible to model both median and skewness of the duration time distribution. We use the Bayesian approach to estimate the model parameters of some log-symmetric autoregressive conditional duration models and evaluate their performance using a Monte Carlo simulation study. The usefulness of the estimation methodology is demonstrated by analyzing a high frequency financial data set from the German DAX of 2016.

Key words
ACD models; Bayesian inference; high frequency financial data; log-symmetric distributions

INTRODUCTION

Bibliographical review and preliminaries

The concept of log-symmetry appears when a random variable presents the same distribution as its reciprocal, or in terms of ordinary symmetry regarding the distribution of the logged random variable; see Jones 2008JONES MC. 2008. On reciprocal symmetry. J Stat Plan Inference 138: 3039-3043.. The class of distributions having this property is called log-symmetric and it has been used to describe the behavior of strictly positive data. The log-symmetric class encompasses bimodal distributions as special cases, and distributions that possess either lighter or heavier tails than the log-normal distribution, which is a particular case of this class; see e.g., Vanegas & Paula ( 2016bVANEGAS L & PAULA G. 2016b. Log-symmetric distributions: statistical properties and parameter estimation. Braz J Probab Stat 30: 196-220.). Some examples of log-symmetric distributions are: log-normal, log-Student-t, log-logistic, log-Laplace, log-Cauchy, log-power-exponential, log-slash, harmonic law, Birnbaum-Saunders, and Birnbaum-Saunders-t; see e.g., Crow & Shimizu (1988)CROW EL & SHIMIZU K. 1988. Lognormal Distributions: Theory and Applications. New York, US: Dekker., Birnbaum & Saunders (1969)BIRNBAUM ZW & SAUNDERS SC. 1969. A new family of life distributions. J Appl Probab 6: 319-327., Rieck & Nedelman (1991)RIECK J & NEDELMAN J. 1991. A log-linear model for the Birnbaum-Saunders distribution. Technometrics 3: 51-60., Johnson et al. (1994)JOHNSON N, KOTZ S & BALAKRISHNAN N. 1994. Continuous Univariate Distributions. Vol. 1. New York, US: Wiley., 1995JOHNSON N, KOTZ S & BALAKRISHNAN N. 1995. Continuous Univariate Distributions. Vol. 2. New York, US: Wiley., Díaz-García & Leiva (2005)DÍAZ-GARCÍA J & LEIVA V. 2005. A new family of life distributions based on elliptically contoured distributions. J Stat Plan Inference 128: 445-457., Marshall & Olkin (2007)MARSHALL A & OLKIN I. 2007. Life Distributions. New York, US: Springer., Jones (2008)JONES MC. 2008. On reciprocal symmetry. J Stat Plan Inference 138: 3039-3043., and Vanegas & Paula (2016b)VANEGAS L & PAULA G. 2016c. Log-symmetric regression models under the presence of non-informative left-or right-censored observations. Test, p. 1-24..

The class of log-symmetric distributions has been primarily used in the regression context. Vanegas & Paula ( 2016aVANEGAS L & PAULA G. 2016a. An extension of log-symmetric regression models: R codes and applications. J Stat Comput Simul 86: 1709-1735.) proposed log-symmetric regression models which allow both the median and skewness (or the relative dispersion) be described using an arbitrary number of non-parametric additive components. Vanegas & Paula ( 2016bVANEGAS L & PAULA G. 2016b. Log-symmetric distributions: statistical properties and parameter estimation. Braz J Probab Stat 30: 196-220.) studied some interesting properties of the log-symmetric class of distributions. Vanegas & Paula ( 2016cVANEGAS L & PAULA G. 2016c. Log-symmetric regression models under the presence of non-informative left-or right-censored observations. Test, p. 1-24.) proposed an extension to allow the presence of non-informative left or right-censored data in log-symmetric regression models. Medeiros & Ferrari 2017MEDEIROS FMC & FERRARI SLP. 2017. Small-sample testing inference in symmetric and log-symmetric linear regression models. Stat Neerl 71(3): 200-224. discussed the issue of testing hypothesis in symmetric and log-symmetric regression models. In special, the authors considered the Wald, likelihood ratio, score and gradient tests for this purpose. Finally, Ventura et al. 2019VENTURA M, SAULO H, LEIVA V & MONSUETO SE. 2019. Log-symmetric regression models: information criteria and application to movie business and industry data. Appl Stoch Models Bus Ind 35: 963-977. analyzed movie business data using log-symmetric regression models.

High frequency financial data on transactions have been modeled primarily by autoregressive conditional duration (ACD) models, which were proposed by Engle & Russell 1998ENGLE R & RUSSELL J. 1998. Autoregressive conditional duration: A new method for irregularly spaced transaction data. Econometrica 66: 1127-1162.. These models are commonly used to capture the clustering structure and they can be seen as the counterpart of GARCH models for modeling trade duration (TD) data; see Liu & Heyde 2008LIU S & HEYDE CC. 2008. On estimation in conditional heteroskedastic time series models under non-normal distributions. Stat Pap 49: 455-469.. We strongly recommend reading the works by Pacurar 2008PACURAR M. 2008. Autoregressive conditional durations models in finance: A survey of the theoretical and empirical literature. J Econ Surv 22: 711-751. and Bhogal & Variyam 2019BHOGAL SK & VARIYAM RT. 2019. Conditional duration models for high-frequency data: A review on recent developments. J Econ Surv 33(1): 252-273., which are literature reviews on ACD models. Some characteristics concerning TD data are: (C1) the irregular nature with respect to the way they are collected; (C2) the diurnal intra-day pattern; (C3) the large number of observations; (C4) the probability density function (PDF) with asymmetric shape; and (C5) the hazard rate (HR) with inverse bathtub shape (unimodal); see e.g., Leiva et al. 2014LEIVA V, SAULO H, LEÃO J & MARCHANT C. 2014. A family of autoregressive conditional duration models applied to financial data. CSDA 79: 175-191.. Extensions of the original ACD model proposed by Engle & Russell 1998ENGLE R & RUSSELL J. 1998. Autoregressive conditional duration: A new method for irregularly spaced transaction data. Econometrica 66: 1127-1162. are basically based on the following aspects: (A1) the distributional assumption in order to yield an asymmetric PDF and an unimodal HR; (A2) the linear form for the conditional mean or median dynamics; (A3) and the time series properties; see, for example, Bauwens & Giot 2000BAUWENS L & GIOT P. 2000. The logarithmic ACD model: An application to the bid-ask quote process of three NYSE stocks. Ann Econ Stat 60: 117-149., Grammig & Maurer 2000GRAMMIG J & MAURER K. 2000. Non-monotonic hazard functions and the autoregressive conditional duration model. J Econom 3: 16-38., Meitz & Terasvirta 2006MEITZ M & TERASVIRTA T. 2006. Evaluating models of autoregressive conditional duration. J Bus Econ Stat 24: 104-124., Chiang 2007CHIANG MH. 2007. A smooth transition autoregressive conditional duration model. Stud Nonlinear Dyn Econom 11: 108-144., Pacurar 2008PACURAR M. 2008. Autoregressive conditional durations models in finance: A survey of the theoretical and empirical literature. J Econ Surv 22: 711-751., Bhatti 2010BHATTI C. 2010. The Birnbaum-Saunders autoregressive conditional duration model. Math Comput Simul 80: 2062-2078., Leiva et al. 2014LEIVA V, SAULO H, LEÃO J & MARCHANT C. 2014. A family of autoregressive conditional duration models applied to financial data. CSDA 79: 175-191., Diana 2015DIANA T. 2015. Measuring the impact of traffic flow management on interarrival duration: An application of autoregressive conditional duration. J Air Transp Manag 42: 219-225., Dionne et al. 2015DIONNE G, PACURAR M & ZHOU X. 2015. Liquidity-adjusted intraday value at risk modeling and risk management: An application to data from Deutsche Börse. J Bank Finance 59: 202-219., Zheng et al. 2016ZHENG Y, LI Y & LI G. 2016. On Fréchet autoregressive conditional duration models. J Stat Plan Inference 175: 51-66., Saulo et al. 2019SAULO H, LEAO J, LEIVA V & AYKROYD RG. 2019. Birnbaum-Saunders autoregressive conditional duration models applied to high-frequency financial data. Stat Pap 60: 1605-1629., and Mishra & Ramanathan 2017MISHRA A & RAMANATHAN TV. 2017. Nonstationary autoregressive conditional duration models. Stud Nonlinear Dyn Econom 21(4): 1-22..

Recently, Saulo & Leao 2017SAULO H & LEAO. 2017. On log-symmetric duration models applied to high frequency financial data. Econ Bull 37: 1089-1097. proposed a family of ACD models based on the class of log-symmetric models. The log-symmetric ACD models encompass all the log-symmetric distributions cited at the beginning of this introduction as special cases, that is, they encompass highly competitive performance models in the literature. For example, the log-normal-ACD, Birnbaum-Saunders-ACD and Birnbaum-Saunders-t-ACD models; see Xu 2013XU Y. 2013. The lognormal autoregressive conditional duration (LNACD) model and a comparison with an alternative ACD models. Available at SSRN: https://ssrncom/abstract=2382159 doi:10.2139/ssrn.2382159.
https://ssrncom/abstract=2382159...
and Leiva et al. 2014LEIVA V, SAULO H, LEÃO J & MARCHANT C. 2014. A family of autoregressive conditional duration models applied to financial data. CSDA 79: 175-191.. The log-symmetric ACD models are written in terms of a conditional median duration rather than a conditional mean duration - the typical parameter used in the literature is the mean. The use of the median is more interesting because it is a measure of central tendency better than the mean, besides the median-based approach provides additional protection against outliers; see Saulo et al. 2019SAULO H, LEAO J, LEIVA V & AYKROYD RG. 2019. Birnbaum-Saunders autoregressive conditional duration models applied to high-frequency financial data. Stat Pap 60: 1605-1629.. On the other hand, the log-symmetric family provides a wide range of asymmetric distributions with HR with inverse bathtub shape. Therefore, characteristics (C4) and (C5) and aspects (A1) and (A2) are addressed by the log-symmetric ACD models.

The flexibility provided by the log-symmetric family makes its corresponding ACD models an important area to be explored in the literature. In this context, this paper deals with the problem of Bayesian inference for log-symmetric-ACD models. The estimation methodology is based on the Hamiltonian Monte Carlo (HMC) method, which generates chains both with little dependence and high probability of acceptance (Neal 2011NEAL RM. 2011. MCMC using Hamiltonian dynamics. In: Handbook of Markov Chain Monte Carlo. Vol. 2. p. 113-162. Boca Raton: Chapman and Hall-CRC Press.). The main advantage of log-symmetric ACD models is the robustness property of the median, namely, it is not affected by extremes or outliers. In terms of predictions, it implies that they will not be significantly affected by freak events; see Saulo et al. 2019SAULO H, LEAO J, LEIVA V & AYKROYD RG. 2019. Birnbaum-Saunders autoregressive conditional duration models applied to high-frequency financial data. Stat Pap 60: 1605-1629..

Log-symmetric distributions

A continuous and positive random variable X follows a log-symmetric distribution if its PDF is given by

fX(x;θ,ϕ,g())={1ϕxg(a2(x)),x>0,0,otherwise,(1)
where a(x)=log([x/θ]1/ϕ), θ>0 is a scale parameter, ϕ is a power parameter and g is a density generator with g(u)>0 for u>0 and 0u1/2g(u)du=1. Note that g may involve an extra parameter ξ or an extra parameter vector 𝛏. In this case, the notation XLS(θ,ϕ,g) is used. For example, in order to obtain a random variable X following a log-normal, log-Student-t (having ξ degrees of freedom), log-Laplace or log-slash distribution, we use, respectively, g(u)exp(12u), g(u)(1+uξ)ξ+12, g(u)exp(12u12), g(u)IGF(ξ+12,u2), where IGF(a,x)=1xa0xexp(t)ta1dt is the incomplete gamma function for a>0 and x0. The quantile function of the log-symmetric distribution is given by
tX(q;θ,ϕ,g())=θexp[ϕvξ(q)],(2)
where vξ(q) is the q×100th quantile of V=(Yμ)ϕS(μ=0,ϕ=1,g()), with the notation S referring to symmetrical distributions.

Some statistical properties related to a random variable X following a log-symmetric distribution, namely XLS(θ,ϕ,g), are: (P1) X=(Xθ)1/ϕLS(θ=1,ϕ=1,g) is standard log-symmetric distributed; (P2) cXLS(cθ,ϕ,g), with c>0; (P3) XcLS(θc,c2ϕ,g), with c0; (P4) θ is the the median of the distribution of X; and (P5) setting Y=log(X); (Vanegas & Paula 2015VANEGAS LH & PAULA GA. 2015. A semiparametric approach for joint modeling of median and skewness. Test 24(1): 110-135.). With this, we obtain a symmetric random variable whose distribution belongs to the symmetric class with PDF given by

fY(y;μ,ϕ,g())=1ϕg((yμ)2ϕ),y,(3)
where μ=log(θ) is a location parameter, ϕ>0 is a dispersion parameter and g is as in (2); see Fang et al. 1990FANG KT, KOTZ S & NG KW. 1990. Symmetric Multivariate and Related Distributions. London, UK: Chapman and Hall. and we write YS(μ,ϕ,g). The properties (P2) and (P3) say that the log-symmetric distribution is closed under scale and reciprocal transformations, respectively.

Organization of the paper

The rest of this paper proceeds as follows. The log-symmetric ACD models are formulated in Section LOG-SYMMETRIC ACD MODELS, with parameters estimated by the Bayesian approach using HMC algorithm. In Section NUMERICAL EVALUATION we present a numerical evaluation of the proposed model considering the (i) evaluation of this model via Monte Carlo (MC) simulations and; (ii) application of a real-world high-frequency financial data. Some concluding remarks and possible future research are mentioned in Section CONCLUDING REMARKS.

LOG-SYMMETRIC ACD MODELS

Consider a sequence of successive times T1,,Tn at which market events, or trades, occur. Then, the duration or time elapsed between Xi and Xi1, for i=1,,n, is given by Xi=TiTi1. The family of log-symmetric ACD models introduced by Saulo & Leao 2017SAULO H & LEAO. 2017. On log-symmetric duration models applied to high frequency financial data. Econ Bull 37: 1089-1097. considers a dynamic point process in terms of a conditional median duration

θi=tX(0.5;θi,ϕi,g),
where tX() is the quantile function (QF) of the log-symmetric distribution presented in (2) and Ωi1 is an information set including all information available until time Ti1. The log-symmetric ACD model is defined by
Xi=θiϵiϕi,i=1,,n,(4)
where {ϵi} are independent identically distributed (IID) random variables following a log-symmetric distribution with median and power equal to one, denoted by ϵ iIIDLS(1,1,g), and θi and ϕi are the median and skewness of the Xi distribution, respectively, as XiINDLS(θ i,ϕ i,g), then the Xis are independent (IND) not identically distributed. The linear form of (4) is given by
log(Xi)Yi=log(θi)μi+ϕilog(ϵi)εi,i=1,,n,(5)
where ε iINDS(0,1,g()), namely {εi}, are IID random variables following a standard symmetric distribution with PDF given (3). Then, we write YiINDS(μ i,ϕ i,g).

The component θi in (4) is defined in terms of autoregressive (AR) and moving average (MA) processes, of order p and q respectively, as

log(θi)=ϖ+j=1pαjlog(θij)+j=1qβj(Xijθij),(6)
where ϖ>0, αj0 and βj0. Then, the notation LSACD(p,q) is used. The order of lags for LSACD(p,q) models, in general, are set as p=1 and q=1, because a higher order does not improve the model fit; see Bhatti 2010BHATTI C. 2010. The Birnbaum-Saunders autoregressive conditional duration model. Math Comput Simul 80: 2062-2078.. Thereby, in the following, any LSACD(p=1,q=1) model is simply denoted as LSACD. Moreover, for simplicity’s sake, it is assumed that ϕi=ϕ, for i=1,,n.

Bayesian inference

The Bayesian inference is based on the Bayes theorem

π(θ,ϕiX)fX|θ,ϕ(xi|θi,ϕi;g())π(θ,ϕi),
where π(θ,ϕiX) is the posterior distribution and π(θ,ϕi) is the prior distribution. We will use the following estimator
θ̂=𝔼θy[θ],(7)
which minimizes the expected squared error of the estimate. The Equation (7) is analytically intractable. Therefore, we adopt HMC sampling strategies for obtaining samples from the joint posterior distributions and adopt the sample mean of the HMC simulation as an estimator of 𝔼θy[θ]. In the next subsection, we present the HMC methodology.

The HMC is a method which combines alternately Gibbs updates and Metropolis ones and avoids the random walk behavior. The main advantage of using HMC as opposed to other methods is to generate chains both with little dependence and high probability of acceptance (Neal 2011NEAL RM. 2011. MCMC using Hamiltonian dynamics. In: Handbook of Markov Chain Monte Carlo. Vol. 2. p. 113-162. Boca Raton: Chapman and Hall-CRC Press.), especially when compared to the Metropolis-Hastings algorithm. The HMC method is implemented in the R package rstan; see www.R-project.org and R-Team 2019R-TEAM. 2019. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Austria: Vienna. URL www.R-project.org.
www.R-project.org...
. Consider a random vector θk as position variables (parameters) and rrRk an independent auxiliary random vector, with rrNk(0,M). The joint PDF of (θ θ ,rr) is given by

π (θ θ ,rr)exp(H(θ θ ,rr)),(8)
where H(θ θ ,rr)=U(θ θ )+K(rr) is a Hamiltonian function, U(θ)=log[π(θy)π(θ)] is called the potential energy and K(rr)=rrM1rr is called the kinetic energy.

This method considers a candidate to (θ θ ,rr) which is generated in two stages before being subjected to a Metropolis acceptance step. These stages are presented in Algorithm 1

NUMERICAL EVALUATION

In this section, we carry out a simulation study to evaluate the performance of the Bayesian estimators of some log-symmetric ACD models. Then, we illustrate the proposed methodology by applying it to a real-world high-frequency financial data set. This data set refers to price durations of BASF-SE stock on 19th April 2016 downloaded from the Dukascopy site (www.dukascopy.com). We consider the ACD models based on the following log-symmetric distributions: log-normal (LNACD), log-Laplace (LLACD), log-Student-t (LtACD) and log-slash (LSACD). To estimate the parameters of the model, we used the HMC algorithm using rstan in the R software. In Stan’s programming language the convergence of a Markov chain to a stationary distribution by no-U-turn sampler (NUTS) which is even more efficient at exploring the posterior; see Hoffman & Gelman 2014HOFFMAN MD & GELMAN A. 2014. The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. J Mach Learn Res 15(1): 1593-1623.. The interested reader in Stan programming language and HMC algorithm is referred to Hoffman & Gelman 2014HOFFMAN MD & GELMAN A. 2014. The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. J Mach Learn Res 15(1): 1593-1623. and Carpenter et al. 2017CARPENTER B, GELMAN A, HOFFMAN MD, LEE D, GOODRICH B, BETANCOURT M, BRUBAKER M, GUO J, LI P & RIDDELL A. 2017. Stan: A probabilistic programming language. J Stat Softw 76(1)..

A simulation study

The scenario considers: sample size n{500,2000,5000}, vector of true parameters (ω,α,β,ϕ)={0.10, 0.90, 0.10,0.50} for LNACD and LLACD models; (ω,α,β,ϕ,ξ)={0.10, 0.90, 0.10,4.00} for LtACD and LSACD models, and the following independent prior distributions: ωN(0,100)𝕀(ω>0); αN(0,100)𝕀(0α1); βN(0,100)𝕀(0β1); μN(0,100); ξN(0,100)𝕀(ξ>2) if ϵtlog-tξ; and ξN(0,100)𝕀(ξ>1) if ϵtlog-slashξ.

For each value of the parameter and sample size, we report the empirical values for the mean, median, standard deviation (SD) and the percentage of data sets where the true parameter value was contained inside the Bayesian 90% and 95% credible intervals: this is the MC estimate of frequentist coverage for an interval estimator. The estimates computed by the Bayesian approach are presented by Tables I and II. As the sample size increases, the Bayesian estimators become more efficient. Therefore, in general, all of these results show the good performance of the Bayesian estimators of the corresponding parameters.

Table I
Summary statistics from simulated LNACD and LLACD data for the indicated estimators and 𝐧.
Table II
Summary statistics from simulated L𝐭ACD and LSACD data for the indicated estimators and 𝐧.

Analysis of high-frequency financial transaction data

We now illustrate the log-symmetric ACD models by analyzing the BASF-SE data set. A data adjustment is necessary due to the fact that these data often exhibit certain diurnal patterns; see Tsay 2002TSAY RS. 2002. Analysis of Financial Time Series. New Jersey, US: Wiley.. We used the R package ACDm see Belfrage 2015BELFRAGE M. 2015. R package ACDm: Tools for autoregressive conditional duration model. https://cran.r-project.org/web/packages/ACDm.
https://cran.r-project.org/web/packages/...
to perform the diurnal adjustment of the BASF-SE data. We simulated two chains with 1500 iterations and discarded the first 500 as burn-in. Table III presents some descriptive statistics for the BASF-SE data set, including central tendency statistics, standard deviation (SD), coefficient of variation (CV), skewness (CS) and kurtosis (CK). From Table III, we observe the right skewed nature and high kurtosis level of the data distribution. The skewness is ratified by the histogram showed by Figure 1(a).

Figure 1
Histogram (left), TTT plot (center) and boxplots (right) for the BASF-SE data.
Table III
Summary statistics for the BASF-SE data.

A tool to characterize the shape of a hazard rate is the scaled total time on test (TTT) function; see Aarset 1987AARSET M. 1987. How to identify a bathtub hazard rate. IEEE Transactions on Reliability 36: 106-108.. The hazard rate of a random variable X is defined by h(x)=f(x)/[1F(x)], where f and F are the PDF and cumulative distribution function (CDF) of X. The scaled TTT function is defined by W(u)=H1(u)/H1(1), for 0u1, where H1(u)=0F1(u)[1F(y)]dy, with F1 being the inverse function of the CDF of X. An approximation for W is obtained by plotting the points [k/n,Wn(k/n)], with

Wn(k/n)=i=1kx(i)+[nk]xki=1nx(i),k=1,,n,
and x(i) being the ith observed order statistic. Figure 1(b) suggests that the hazard rate for the BASF-SE data set is inverse-bathtub-shaped, as expected; see Leiva et al. 2014LEIVA V, SAULO H, LEÃO J & MARCHANT C. 2014. A family of autoregressive conditional duration models applied to financial data. CSDA 79: 175-191..

Figure 1(c) shows the usual and adjusted box-plots, where the latter is useful in cases when the data follow a skew distribution, since a significant number of observations can be classified as atypical when they are not; see Hubert & Vanderveeken 2008HUBERT M & VANDERVEEKEN S. 2008. Outlier detection for skewed data. J Chemom 22: 235-246.. From Figure 1(c), we note that potential outliers considered by the usual box-plot are not influential when we observe the adjusted box-plot.

Table IV reports the Bayesian estimates of the LNACD, LtACD, LLACD and LSACD model parameters (with estimated standard errors in parentheses). Furthermore, we report the Expected Bayesian Information Criterion (EBIC), Deviance Information Criterion (DIC), Watanabe-Akaike Information Criterion (WAIC) and Bayesian Information Criterion (BIC); see Akaike 1992AKAIKE H. 1992. Information theory and an extension of the maximum likelihood principle. In: Breakthroughs in statistics, Springer, p. 610-624., Schwarz et al. 1978SCHWARZ G ET AL. 1978. Estimating the dimension of a model. The annals of statistics 6(2): 461-464., Watanabe 2010WATANABE S. 2010. Asymptotic equivalence of Bayes cross validation and widely applicable information criterion in singular learning theory. J Mach Learn Res 11: 3571-3594. and Spiegelhalter et al. 2002SPIEGELHALTER DJ, BEST NG, CARLIN BP & VAN DER LINDE A. 2002. Bayesian measures of model complexity and fit. J R Stat Soc: Series B (Stat Methodol) 64(4): 583-639..

Table IV
Estimates (with SE in parentheses) for fit to the BASF-SE data.

From Table IV, note that the LNACD model provides the better adjustment compared to the other models based on the values of EBIC, DIC, WAIC and BIC. Table V reports the estimate of effective sample size and R statistic of Gelman et al. 1992GELMAN A, RUBIN DB ET AL. 1992. Inference from iterative simulation using multiple sequences. Stat Sci 7(4): 457-472., we observed that the generated chains were efficient. Figure 2 shows the sample autocorrelation function (ACF) and trace plots for the parameters from LNACD model. From this figure, we can notice the absence of autocorrelation and that the chains converge to their stationary distributions also observed by the statistic R of the Table V.


Trace plots and ACF for the LNACD model.
Table V
Estimate of effective sample size and R statistic.

CONCLUDING REMARKS

We have discussed Bayesian inference for log-symmetric autoregressive conditional duration models, which are based on the conditional median duration. We have employed Hamiltonian Monte Carlo sampling strategies to obtain the estimates of the model parameters. A Monte Carlo simulation study was carried out to evaluate the behavior of the Bayesian estimates of the corresponding parameters. We have applied the proposed models to a real-world data set of financial transactions from the German DAX stock exchange.

As part of future research, it would be of interest to propose an outlier detection procedure to detect and estimate outlier effects for these models; see Chiang & Wang 2012CHIANG MH & WANG LM. 2012. Additive outlier detection and estimation for the logarithmic autoregressive conditional duration model. Commun Stat: Simul Comput 41: 287-301.. Also, influence diagnostic tools can be extended to log-symmetric autoregressive conditional duration models; see Leiva et al. 2014LEIVA V, SAULO H, LEÃO J & MARCHANT C. 2014. A family of autoregressive conditional duration models applied to financial data. CSDA 79: 175-191.. Work on these issues is currently in progress and we hope to report some findings in a future paper.

ACKNOWLEDGMENTS

The research was partially supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) and Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) grants from the Brazilian government and by Fundação de Amparo `a Pesquisa do Estado do Amazonas (FAPEAM) grants from the government of the state of Amazonas, Brazil.

REFERENCES

  • AARSET M. 1987. How to identify a bathtub hazard rate. IEEE Transactions on Reliability 36: 106-108.
  • AKAIKE H. 1992. Information theory and an extension of the maximum likelihood principle. In: Breakthroughs in statistics, Springer, p. 610-624.
  • BAUWENS L & GIOT P. 2000. The logarithmic ACD model: An application to the bid-ask quote process of three NYSE stocks. Ann Econ Stat 60: 117-149.
  • BELFRAGE M. 2015. R package ACDm: Tools for autoregressive conditional duration model. https://cran.r-project.org/web/packages/ACDm
    » https://cran.r-project.org/web/packages/ACDm
  • BHATTI C. 2010. The Birnbaum-Saunders autoregressive conditional duration model. Math Comput Simul 80: 2062-2078.
  • BHOGAL SK & VARIYAM RT. 2019. Conditional duration models for high-frequency data: A review on recent developments. J Econ Surv 33(1): 252-273.
  • BIRNBAUM ZW & SAUNDERS SC. 1969. A new family of life distributions. J Appl Probab 6: 319-327.
  • CARPENTER B, GELMAN A, HOFFMAN MD, LEE D, GOODRICH B, BETANCOURT M, BRUBAKER M, GUO J, LI P & RIDDELL A. 2017. Stan: A probabilistic programming language. J Stat Softw 76(1).
  • CHIANG MH. 2007. A smooth transition autoregressive conditional duration model. Stud Nonlinear Dyn Econom 11: 108-144.
  • CHIANG MH & WANG LM. 2012. Additive outlier detection and estimation for the logarithmic autoregressive conditional duration model. Commun Stat: Simul Comput 41: 287-301.
  • CROW EL & SHIMIZU K. 1988. Lognormal Distributions: Theory and Applications. New York, US: Dekker.
  • DIANA T. 2015. Measuring the impact of traffic flow management on interarrival duration: An application of autoregressive conditional duration. J Air Transp Manag 42: 219-225.
  • DÍAZ-GARCÍA J & LEIVA V. 2005. A new family of life distributions based on elliptically contoured distributions. J Stat Plan Inference 128: 445-457.
  • DIONNE G, PACURAR M & ZHOU X. 2015. Liquidity-adjusted intraday value at risk modeling and risk management: An application to data from Deutsche Börse. J Bank Finance 59: 202-219.
  • ENGLE R & RUSSELL J. 1998. Autoregressive conditional duration: A new method for irregularly spaced transaction data. Econometrica 66: 1127-1162.
  • FANG KT, KOTZ S & NG KW. 1990. Symmetric Multivariate and Related Distributions. London, UK: Chapman and Hall.
  • GELMAN A, RUBIN DB ET AL. 1992. Inference from iterative simulation using multiple sequences. Stat Sci 7(4): 457-472.
  • GRAMMIG J & MAURER K. 2000. Non-monotonic hazard functions and the autoregressive conditional duration model. J Econom 3: 16-38.
  • HOFFMAN MD & GELMAN A. 2014. The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. J Mach Learn Res 15(1): 1593-1623.
  • HUBERT M & VANDERVEEKEN S. 2008. Outlier detection for skewed data. J Chemom 22: 235-246.
  • JOHNSON N, KOTZ S & BALAKRISHNAN N. 1994. Continuous Univariate Distributions. Vol. 1. New York, US: Wiley.
  • JOHNSON N, KOTZ S & BALAKRISHNAN N. 1995. Continuous Univariate Distributions. Vol. 2. New York, US: Wiley.
  • JONES MC. 2008. On reciprocal symmetry. J Stat Plan Inference 138: 3039-3043.
  • LEIVA V, SAULO H, LEÃO J & MARCHANT C. 2014. A family of autoregressive conditional duration models applied to financial data. CSDA 79: 175-191.
  • LIU S & HEYDE CC. 2008. On estimation in conditional heteroskedastic time series models under non-normal distributions. Stat Pap 49: 455-469.
  • MARSHALL A & OLKIN I. 2007. Life Distributions. New York, US: Springer.
  • MEDEIROS FMC & FERRARI SLP. 2017. Small-sample testing inference in symmetric and log-symmetric linear regression models. Stat Neerl 71(3): 200-224.
  • MEITZ M & TERASVIRTA T. 2006. Evaluating models of autoregressive conditional duration. J Bus Econ Stat 24: 104-124.
  • MISHRA A & RAMANATHAN TV. 2017. Nonstationary autoregressive conditional duration models. Stud Nonlinear Dyn Econom 21(4): 1-22.
  • NEAL RM. 2011. MCMC using Hamiltonian dynamics. In: Handbook of Markov Chain Monte Carlo. Vol. 2. p. 113-162. Boca Raton: Chapman and Hall-CRC Press.
  • PACURAR M. 2008. Autoregressive conditional durations models in finance: A survey of the theoretical and empirical literature. J Econ Surv 22: 711-751.
  • R-TEAM. 2019. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Austria: Vienna. URL www.R-project.org
    » www.R-project.org
  • RIECK J & NEDELMAN J. 1991. A log-linear model for the Birnbaum-Saunders distribution. Technometrics 3: 51-60.
  • SAULO H & LEAO. 2017. On log-symmetric duration models applied to high frequency financial data. Econ Bull 37: 1089-1097.
  • SAULO H, LEAO J, LEIVA V & AYKROYD RG. 2019. Birnbaum-Saunders autoregressive conditional duration models applied to high-frequency financial data. Stat Pap 60: 1605-1629.
  • SCHWARZ G ET AL. 1978. Estimating the dimension of a model. The annals of statistics 6(2): 461-464.
  • SPIEGELHALTER DJ, BEST NG, CARLIN BP & VAN DER LINDE A. 2002. Bayesian measures of model complexity and fit. J R Stat Soc: Series B (Stat Methodol) 64(4): 583-639.
  • TSAY RS. 2002. Analysis of Financial Time Series. New Jersey, US: Wiley.
  • VANEGAS L & PAULA G. 2016a. An extension of log-symmetric regression models: R codes and applications. J Stat Comput Simul 86: 1709-1735.
  • VANEGAS L & PAULA G. 2016b. Log-symmetric distributions: statistical properties and parameter estimation. Braz J Probab Stat 30: 196-220.
  • VANEGAS L & PAULA G. 2016c. Log-symmetric regression models under the presence of non-informative left-or right-censored observations. Test, p. 1-24.
  • VANEGAS LH & PAULA GA. 2015. A semiparametric approach for joint modeling of median and skewness. Test 24(1): 110-135.
  • VENTURA M, SAULO H, LEIVA V & MONSUETO SE. 2019. Log-symmetric regression models: information criteria and application to movie business and industry data. Appl Stoch Models Bus Ind 35: 963-977.
  • WATANABE S. 2010. Asymptotic equivalence of Bayes cross validation and widely applicable information criterion in singular learning theory. J Mach Learn Res 11: 3571-3594.
  • XU Y. 2013. The lognormal autoregressive conditional duration (LNACD) model and a comparison with an alternative ACD models. Available at SSRN: https://ssrncom/abstract=2382159 doi:10.2139/ssrn.2382159.
    » https://ssrncom/abstract=2382159
  • ZHENG Y, LI Y & LI G. 2016. On Fréchet autoregressive conditional duration models. J Stat Plan Inference 175: 51-66.

Publication Dates

  • Publication in this collection
    20 Oct 2021
  • Date of issue
    2021

History

  • Received
    19 Mar 2019
  • Accepted
    18 July 2019
Academia Brasileira de Ciências Rua Anfilófio de Carvalho, 29, 3º andar, 20030-060 Rio de Janeiro RJ Brasil, Tel: +55 21 3907-8100, CLOCKSS system has permission to ingest, preserve, and serve this Archival Unit - Rio de Janeiro - RJ - Brazil
E-mail: aabc@abc.org.br