THE PERFORMANCE OF INTEGER-VALUED AUTO-REGRESSIVE (INAR) MODEL IN ZERO INFLATED DATA
DOI:
https://doi.org/10.33003/fjs-2026-1001-4325Keywords:
Data, Fitting, Order, Valued, ZeroAbstract
Time series count data frequently exhibits zero inflation and even heavy-tailedness in practical applications. Many models have been proposed for modelling count data, but heavy-tailedness is less considered. The effect of excess zeros on time series count data cannot be disregarded. Thus, there is a need for a model that would cater for excess zeros in the time series data. The proposed model, a new integer-valued autoregressive process, is expected to capable of capturing these features. This study therefore investigates the effectiveness of Integer-Valued Autoregressive (INAR) models in handling time series count data at different proportions of excess zeros, determine the predictive ability of INAR models at different steps ahead and compare its performance with orders of model {INAR (1), INAR (2), INAR (3) and INAR (4)} being used for the data. The effects of sample sizes on the performance of the models were also studied through simulation. At every sample size, the best status of the orders p, where p = 1, 2, 3, 4 are respectively determined for 20%, 30% and 40% proportions of the excess zeros using information criteria AIC, BIC and HQIC. Forecast accuracy was assessed using the Thiel’s U statistic, where lower values indicate better performance. INAR (3) achieved the lowest AIC, BIC and HQIC values across most scenarios indicating a strong model fit and is suggested for use in fitting any time series of count to the underlying features given in this dissertation.
References
Akaike, H. (1973). Maximum likelihood identification of Gaussian auto-regressive moving-average models.” Biometrika, 60, 255–266
Al-Osh, M.A. and Alzaid, A.A. (1987). First-order integer-valued autoregressive INAR (1) process, of Time Series Analysis, 8, 261–275.
Cameron A.C. and Trivedi PK (2013) Regression analysis of count data.2nd ed. Cambridge University Press.
Harvey, A., & Fernandes, C. (1989). ‘’Time Series Models for Count or Qualitative Observations’’. Journal of Business & Economic Statistics, 7(4), 407-417.
Lambert, D. (1992). “Zero-inflated Poisson regression, with an application to defects in manufacturing”. Technometrics, 34(1),1–14.
Maria Eduarda Silva (2015). ‘’Modelling Time Series of Counts. ’An INAR Approach. Textos De Matematica, Dmuc, 47 (2015), 107-122.
Martin Ridout, Clarice G.B. Dem´etrio and John Hinde (1998). Models for count data with many zeros. International Biometric Conference Cape Town, December 1998.
Mckenzie, B., M, G., (1985). Bayesian predictions of low count time series. International Journal of Forecasting, 21, 315 – 330.
Ndwiga A. Macharia1, Oscar Ngesa, Anthony Wanjoya and Damaris FelistusMulwa (2019). “Comparison of Statistical Models in Modelling Over Dispersed Count Data with Excess Zeros. ’International Journal of Research and Innovation in Applied Science (IJRIAS) | Volume IV, Issue V, 80-90.
Phang, Y. N. and Loh E. F (2023). Zero Inflated Models for Overdispersed Count Data. International Journal of Mathematical, Computational, Physical and Quantum Engineering, 7:8
Saleh, I.M and. Nweze, N. O. (2021). Model Selection for Time Series Count Data with Over-Dispersion. Asian Journal of Probability and Statistics. 14(2): 60-73
Saleh I. M., Akeyede I., Abubakar A. M. and Sulaiman A. I. (2024): Exploring The INAR Model On Heavy Tailed Time Series Data with Outliers. FULafia Journal of Science & Technology, Vol. 8, No. 1
Schwarz G. (1978) “Estimating the Dimension of a Model. Annals of Statistics. 6(2), 461-464.
Hannan, E. J., & Quinn, B. G. (1979). “The Determination of the order of an Autoregression. Journal of the Royal Statistical Society: Series B (Methodological), 41(2), 190-195.
Silva, I.; Silva, M.E.; Pereira, I. and Silva, N. (2005). Replicated INAR (1) Process, Methodology and Computing in Applied Probability, 7, 517–542.
Steutel, F.W. and Van Harn, K. (1979). Discrete analogues of self-decomposability and stability, The Annals of Probability, 5, 893–899.
Silva, I.; Silva, M.E.; Pereira, I. and Silva, N. (2005). Replicated INAR (1) Process, Methodology and Computing in Applied Probability, 7, 517–542.
Tawiah (2021), ‘’Zero-Inflated Time Series Modelling of COVID-19 Deaths in Ghana.’’Hindawi Journal of Environmental and Public Health Volume 2021, Article ID 5543977, 9 pages https://doi.org/10.1155/2021/5543977
Theil, H. (1966). Applied Economic Forecasting. Chicago, Rand McNally.
Xekalaki E. (2014). On the distribution theory of over-dispersion. Article in Journal of Statistical Distributions and Applications • pg-2-22. DOI:10.1186/s40488-014-0019-z
Aleksandrov, B., & Weiss, C. H. (2024). Model selection for count time series: A focus on information criteria and residual analysis. Statistical Papers.
Chen, Y., Kang, J., & Wang, X. (2022). Bayesian inference for zero-inflated negative binomial INAR(1) models with application to infectious disease data. Journal of Applied Statistics.
Karlis, D., & Tsiamyrtzis, P. (2024). On the use of thinning-based models for integer-valued time series with excess zeros. Statistics and Computing.
Liboschik, T., Kerschke, P., & Fried, R. (2023). Modeling zero-inflated and overdispersed time series of counts: A new R package for generalized linear autoregressive models. Journal of Statistical Software.
Weiß, C. H. (2023). Flexible count time series models: A review and benchmark. Wiley Interdisciplinary Reviews: Computational Statistics
Downloads
Published
Issue
Section
Categories
License
Copyright (c) 2026 ibrahim Modu, Dr. H. A. Chamalwa, Prof. H. R. bakari, A. U. Abubakar

This work is licensed under a Creative Commons Attribution 4.0 International License.