November 2024 Generally Altered, Inflated, Truncated and Deflated Regression
Thomas W. Yee, Chenchen Ma
Author Affiliations +
Statist. Sci. 39(4): 568-588 (November 2024). DOI: 10.1214/24-STS925

Abstract

Models such as the zero-inflated and zero-altered Poisson and zero-truncated binomial are now well established. We review important ideas behind the three operators involved (as well as deflation and modification) and other incremental extensions. We propose a super mixture model that unifies alteration, inflation, truncation and deflation for counts, given a 1- or 2-parameter parent or base distribution. Since all of the operators except for truncation have both parametric and nonparametric variants, it is necessary to consider a total of seven different scenarios. Highlights of this paper include the following: (i) the mixture distribution is exceeding flexible, accommodating up to seven modes; (ii) under-, equi- and over-dispersion can be handled using a negative binomial (NB) parent, with underdispersion handled by a novel Generally-Truncated-Expansion method; (iii) under- and over-dispersion are studied holistically in terms of the the four operators previously referred to; (iv) while generally-altered regression explains why observations are there, generally-inflated regression accounts for why they are there in excess, and generally-deflated regression explains why observations are not there. The important application of heaped and seeped data from retrospective self-reported surveys is briefly mentioned. The GAITD-NB has potential to become a Swiss army knife for analyzing count responses.

Acknowledgments

We thank Rolf Turner for proof-reading, Paul Murrell and Simon Urbanek for help with the figures, Theodora Ge Jin for support, and delegates of the Multivariate Count Analysis workshop held at Besançon, France, in July 2018 for helpful feedback. CM was supported by a 2018 University of Auckland Northern Hemisphere Summer Research Scholarship while she was a student at Zhejiang University. Comments from two reviewers were very helpful for making improvements to an earlier draft.

Citation

Download Citation

Thomas W. Yee. Chenchen Ma. "Generally Altered, Inflated, Truncated and Deflated Regression." Statist. Sci. 39 (4) 568 - 588, November 2024. https://doi.org/10.1214/24-STS925

Information

Published: November 2024
First available in Project Euclid: 30 October 2024

Digital Object Identifier: 10.1214/24-STS925

Keywords: finite mixture distribution , multinomial logit model , overdispersion and underdispersion , vector generalized linear model

Rights: Copyright © 2024 Institute of Mathematical Statistics

Vol.39 • No. 4 • November 2024
Back to Top