The Beta-Binomial Multivariate Model for Correlated Categorical Data

Nusar Hajarisman, Asep Saefuddin

Abstract


Over the past year, a significant amount of research has explored the logistic regression models for
analyzing correlated categorical data. In these models, it is assumed that the data occur in clusters,
where individuals within each cluster are correlated, but individuals from different clusters are
assumed independent. A commonly used in modeling correlated categorical univariate data is to
assume that individual counts are generated from a Binomial distribution, with probabilities vary
between individuals according to a Beta distribution. The marginal distribution of the counts is then
Beta-Binomial. In this paper, a generalization of the model is made allowing the number of
respondent m, to be random. Thus both the number units m, and the underlying probability vector
are allowed to vary. We proposed the model for correlated categorical data, which is generalized to
account for extra variation by allowing the vectors of proportions to vary according to a Dirichlet
distribution. The model is a mixture distribution of multinomial and Dirichlet distribution, and we
call the model as the beta-binomial multivariate model.



DOI: https://doi.org/10.29313/jstat.v8i1.976

Refbacks

  • There are currently no refbacks.


Copyright Notice

Creative Commons License
STATISTIKA is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License