On Designing a New Control Chart Using the Generalized Conway–Maxwell–Poisson Distribution to Monitor Count Data. (2024)

Link/Page Citation

Author(s): Fakhar Mustafa (corresponding author) [1,2,*]; Rehan Ahmad Khan Sherwani [1]; Muhammad Ali Raza [3]; Jumanah Ahmed Darwish [4]

1. Introduction

Regarding several processes related to engineering, healthcare, and manufacturing fields, researchers have theoretical questions involving count data as a response variable. Count data refers to data that consist of discrete values and shows how often an event happened in a certain period of time. Count data takes on positive or zero value only. Those processes that involve count data as a response variable require particular attention to monitor certain features that are associated with such processes. Usually, monitoring of such processes is conducted using a CC. A CC helps regulate, improve, and enhance the process’s efficiency. Salient features of particular interest relating to count data for researchers are equi-dispersion, under-dispersion (UD), over-dispersion (OD), and ZI. As a standard, Poisson distribution-based CCs are widely used to evaluate and monitor distinctive features of count data. Despite their wide use, these charts are not suitable for monitoring count data that have different levels of dispersion—UD or OD—because they rely on the Poisson distribution, which assumes equi-dispersion [1,2,3].

To overcome the problem associated with Poisson distribution, many researchers proposed different CCs to monitor UD, OD, and ZI in count data by utilizing other probability distributions. For example, Famoye [4] suggested control charts for the total and average number of events based on the shifted-generalized Poisson distribution, which can monitor UD or OD count data. Fang [5] applied the Katz family of distribution to monitor equi-dispersion, UD, and OD in the count data. He et al. [6] designed CCs using a generalized Poisson distribution for OD count data. Xie M [7] examined the usefulness of the zero-inflated Poisson (ZIP) distribution and provided different methods to compare it with the Poisson model. He also recommended using an upper-sided Shewhart chart with probability limits to monitor ZI processes. Chen et al. [8] introduced a new charting method using the generalized ZIP distribution. Sellers [9] proposed a generalized CC using the Conway–Maxwell–Poisson (COMP) distribution for UD and OD count data. Saghir et al. [10] applied probability limits and exact k-sigma limits instead of 3-sigma limits for COMP CCs. Using the COMP distribution, Saghir and Lin [11] proposed three different CUSUM CCs. These CCs have the ability to detect shifts in the dispersion rate or in both parameters of COMP processes. Alevizakos and Koukouvinos [12] introduced a PM chart for the COMP distribution (CMP-PM) to monitor equi-dispersed, UD, and OD count data. Due to its ability and efficiency to model UD and OD count data, many researchers proposed CCs using COMP distribution using a different monitoring scheme, including Refs. [13,14,15,16,17]. Rakitzis and Castagliola [18] investigated the performance of different Shewhart-type CCs to monitor ZIP and ZI binomial processes. Ho et al. [19] explored the applicability of Touchard distribution through a Shewhart chart for monitoring different features of count data, including UD, OD, and ZI. Bourguignon et al. [20] studied the BerG distribution and proposed a CC for the monitoring process mean of count data based on BerG distribution. BerG distribution, a sum of Bernoulli and geometric random variables has been used to model both UD and OD count data. Boaventura et al. [21] used Bell distribution to monitor OD count data. Several studies have been conducted regarding monitoring different features of count data through CC, including Refs. [14,15,22,23,24,25]. However, all these mentioned CCs are based on such distributions in which the behavior of the count data is exponentially bounded (shorter-tailed (ST)).

With the recent developments and use of the latest technology in the fields of engineering, medicine, and manufacturing there are many processes where random outcomes that could be summarized as count data are not exponentially bounded (longer-tailed (LT)). So, it is imperative to propose an efficient CC that could monitor different vital features associated with count data distribution considering its tail behavior. Furthermore, in the field of Statistical Process Control (SPC), no widely accepted general model is available for monitoring LT count data. Motivated by the work of Sellers [9], Mustafa et al. [26], and Mustafa et al. [27], this study proposed a new CC to monitor count data considering the tail behavior using the GCOMP distribution as proposed by Imoto [28]. The GCOMP distribution is a three-parameter extension of conventional COMP distribution and can model the ST and LT behavior in the count data. Moreover, the GCOMP distribution models the ZI data without using the ZI property. In this research, considerable attention has been given to vital features of count data such as UD, OD, and ZI as provided by the GCOMP model.

The organization of this article is as follows: in Section 2 a brief description of the GCOMP distribution and design structures for the proposed CC to monitor the count data is provided. In Section 3, a simulation study to evaluate the performance of the proposed CC is conducted. Moreover, UD, OD, and ZI cases are considered while conducting simulation studies. Section 4 presents numerical and real-life examples that establish the effectiveness of the proposed CC for monitoring count data. Section 5 reviews the main results of the research study.

2. Materials and Methods

2.1. Generalized Conway-Maxwell-Poisson Distribution

It is imperative to model the observed counts using an appropriate distribution in statistical analysis. Reliance on the equi-dispersion assumption restricts the applicability of the Poisson distribution, which is frequently employed in count data modeling. As a result, Poisson distribution underperforms while modeling dispersed count data. Understanding dispersion is helpful in the selection of the pertinent distribution to model count data. When a dataset of counts exhibits more variability than would be predicted by a given statistical model, this is referred to as having OD in count data. The UD in count data refers to a situation in which the observed variability in a dataset of counts is lower than what would be anticipated based on a certain statistical model. The utility of the COMP distribution to model UD and OD count data has also been explored, making it as the preferred choice compared to conventional models [9,10,11,23,29,30]. In many experiential studies, understanding the behavior of the tail of the under-study probability distribution is fundamental. Generally, in distributions ST and LT behaviors are observed due to the short or long infinite decreasing parts of distributions, respectively. One significant deficiency associated with the COMP model is its inefficacy in considering the LT model. Imoto [28] proposed a new GCOMP distribution incorporating the negative binomial (NB) distribution as a distinct case to counter this. The GCOMP distribution became an LT model when the NB distribution was added, in contrast to the COMP distribution. Additionally, the ST behavior in the count data tends to be modeled by the GCOMP distribution. Furthermore, a bimodal distribution with a single mode at zero can be created from the GCOMP distribution.

Assume X to be a random variable and assume its distribution to be the GCOMP distribution, which has the following function and parameters:(1)P(X=x)=Gv+xrµx/x!Cr,v,µ,wherex=0,1,2,3,…., where v is the control parameter that controls the length of the tail of the distribution, r is a dispersion parameter, µ is a location parameter, and C(r,v,µ) is the normalizing constant. The distribution is defined for r<1,?>0 and µ > 0 or r=1,?>0 and 0<µ<1.

Also, the normalizing constant can be computed as:(2)C(r,v,µ)=?[sub.z=0][sup.8]Gv+zr/z!µ[sup.z], and converges at r<1 or r=0 and |µ|<1.

The GCOMP distribution extends the NB distribution when r equals 1. Likewise, when parameters µ,1-r, and ? approach 1, the GCOMP distribution simplifies to the COMP distribution, which encompasses Poisson, NB, and geometric distributions as specific cases. The range 0<r<1 characterizes OD, while r<0 indicates UD in count data. Comparatively, for ? greater than 1 and 0<r<1, the GCOMP distribution is more LT than the COMP distribution, whereas for ? greater than 1 and r less than 0, it demonstrates ST behavior. Figure 1 illustrates the GCOMP distribution’s behavior across parametric configurations.

The calculation of the normalizing constant is linked to the moments of the GCOMP distribution. Through the asymptotic approximation of C(r,v,µ), valid for r<1, Imoto [28] has obtained the approximate formula for moments. This process yields expressions for the expected value and variance as follows:(3)E(X)=µ? log Cr,?,µ/?µ˜µ[sup.11-r]+2?-1r/21-r, (4)Var(X)=µ? EX/?µ˜µ11-r/1-r.

The maximum likelihood estimation approach can be used to estimate parameters of the GCOMP distribution by utilizing the log likelihood function in the following manner:(5)l(r,v,µ|X)=log(?[sub.i=0][sup.n]G?+xµxi/xi! Cr,?,µ), where x[sub.i] is the observed frequency of the i events. Fisher’s scoring method can be used to solve the expression presented in (5). The numerical computation of log likelihood expression in (5) could be challenging due to statistical complexities such as precision issues with extremely small or large probabilities, high dimensionality, local optima, and numerical integration.

It can be observed that the GCOMP distribution depicts the behavior of unimodal distribution when a space of (2) is set as r<0 or r<1 and v>1. The GCOMP distribution also becomes a bimodal distribution when 0<v<1,0<r<1 and µv[sup.r]<1. One of the modes for bimodal distributions locates at zero. Because of this characteristic, the GCOMP distribution is more suited to simulate ZI count data. The ZI behavior of the GCOMP distribution is shown in Figure 2. Since it represents the ZI count data without using the ZI property, as other traditional ZI models do, the GCOMP distribution is noteworthy. The GCOMP distribution was shown to be more versatile than the COMP distribution due to its adaptability to dispersion, tail length, and significance to ZI.

2.2. Proposed G-Chart to Monitor Count Data

Assume that a univariate process at consecutive time points {X[sub.1],X[sub.2],X[sub.3],…,X[sub.n]} originates n independent observations, particularly quality measurements or surveillance characteristics from the GCOMP distribution. Also, assume that these observations have standard in-control GCOMP distribution with the process mean µ[sub.0]? log Cr,?,µ0/?µ0 and follow the GCOMP distribution with the shifted process mean µ[sub.1]? log Cr,?,µ1/?µ1 when the process becomes out of control. Our focus lies in monitoring the stability of the total number of counts under the GCOMP distribution in this study. For this purpose, Ls control limits are obtained using the established Shewhart’s methodology [31]. Control limits are set considering G=?i=1nX[sub.i] for the total number of counts at subgroup size n. Hence, the mean and variance of statistic G are provided as:(6)E(G)=n(µ? log Cr,?,µ/?µ);V(G)=n(µ? EX/?µ)

In Table 1, lower, central, and upper control limits (LCL, CL, UCL) for the proposed control chart are provided.

The control limits of the G-chart provided in Table 1 are approximately equal to the control limits of Q-chart (total number of counts chart) based on the COMP distribution proposed by Sellers [9] at µ,1-r and when ??1. Moreover, it also encompasses the Poisson distribution-based c-chart at v=1, and the geometric distribution-based g-chart at v=0 and µ<1.

3. Findings and Discussion

In this section, the results of a thorough simulation analysis are presented to assess the performance of the proposed chart. The primary goal is to assist practitioners in monitoring UD, OD, and ZI count data by assessing the influence of the CC using the GCOMP model. Average run length (ARL), which is one of the widely used measurements, is utilized to assess the effectiveness of the proposed CC. Before initiating simulations, it is imperative to determine the values of the chart multiplier (L) associated with the control limits of any chart. Usually, practitioners prefer a value of 3 for the chart multiplier. In order to ensure that control limits yield the pre-specified false alarm probability (a) for any given sample size, the value of L can be chosen carefully [10]. Therefore, in this study, L control limits are determined to achieve a=0.0027 through a Monte Carlo simulation using 10,000 iterations in R-language (version 4.1.1). The value of a=0.0027 is selected to yield a desired observation of in-control ARL (ARL[sub.0]), which is approximately equal to 370. The procedural flow chart describing the computation of CC multiplier L for the G-chart to achieve ARL[sub.0]˜370 is presented in Figure 3. Corresponding values of CC multiplier L for the G-chart to obtain ARL[sub.0]˜370 at different sample sizes and various parametric values are provided in Table 2. This study explored generating control limits and evaluating the performance of the proposed chart at v>1, which is relevant to note when examining tail behavior in UD and OD count data. The ZI in the count data is considered following the parametric setting of the GCOMP distribution mentioned in Section 2.

3.1. UD and OD Cases Considering Tail Behavior

The proposed G-chart performance is evaluated using out-of-control ARL (ARL[sub.1]) profiles at different values of µ,r, and v. The efficacy of the proposed chart is assessed on the smaller values of ARL[sub.1], which indicate the average number of samples needed to identify out-of-control signals that are well thought out due to the fact that unusual cause µ[sub.0] of the process may shift (d) to µ[sub.1]=(µ[sub.0]±d). To approximate the ARL[sub.1] profile, we again used the Monte Carlo approach and performed 10,000 iterations in R-language using sample sizes of n={3,15,50,100,300,1000} simulated from the GCOMP distribution. Over-dispersed and longer-tailed (OD-LT) (0<r<1,&v>1) as well as under-dispersed and shorter-tailed (UD-ST) (r<0,&v>1) cases are considered while reporting results [32]. As per the requirement for the shifted process, shifts in the µ[sub.0] are introduced as d={0.1,0.2,0.3,0.5} for the G-chart. The proposed G-chart is expected to perform better than the Q-chart based on the COMP distribution [10]. Therefore, the performance evaluation of the Q-chart is also conducted for comparison purposes. For a rationale comparison, the maximum likelihood estimated (MLE) values of the parameters (µ and r) of the COMP distribution are computed for simulated data. Furthermore, the Monte Carlo simulation study is conducted to determine the values of L for the Q-chart to achieve ARL[sub.0]˜370, and then the ARL[sub.1] profiles are obtained by introducing shifts in µ[sub.0]. In Table 3 and Table 4, the results of the ARL[sub.1] profiles for the G and Q charts, along with the values of L and valid MLEs for the Q-chart, are presented.

Table 3 shows that the G-chart performed efficiently in identifying out-of-control signals than the competitive Q-chart when the upward shift in the process is considered at different sample size n for the UD-ST model. However, the G-chart performs comparatively poorly for the downward shift compared to the Q-chart.

Table 4 shows that the G-chart performed efficiently in monitoring out-of-control signals than the Q-chart at different sample sizes n for the OD-LT model at both upward and downward shifts at most of the shifts.

The above discussion indicates that the proposed chart efficiently detects shifts in the processes when the tail behavior of the count data is considered. As expected, it is also observed in Table 3 and Table 4 that with an increase in n, the G-chart becomes more effective and sensitive in detecting out-of-control signals.

3.2. Zero-Inflation Case

Zero-inflation phenomena are observed in manufacturing, health care, and high-yield processes. The concept of zero inflation emerges frequently in the context of count data analysis. It refers to a phenomenon in which the observed data contain more zeros than a conventional count distribution would predict. Many probability distributions such as ZIP and ZINB utilize and embed ZI property to model ZI count data. The GCOMP distribution tends to model the ZI count data. It is essential to mention that the GCOMP distribution did not invoke the ZI property compared to other ZI probability distributions that model ZI data. Additionally, even though they are adept at modeling and observing ZI data, ZI models may not consistently offer a practical solution [33].

We studied the feature of the GCOMP distribution to model the ZI count data through the proposed chart and conducted a comparison with charts based on ZIP, ZINB, and ZICOMP distributions. We have simulated random samples of size n under the excessive zeros generating a parametric setting (0<r<1,0<v<1) of the GCOMP distribution. The ARL[sub.1] profiles for the G-chart are obtained at n[sub.1]=100,µ[sub.0]=1,v=0.05,r=0.3 (Data-I) and at n[sub.2]=100,µ[sub.0]=1.5,v=0.1,r=0.5 (Data-II). The ZIP, ZINB, and ZICOMP charts are modified considering the total number of counts (ZIP[sub.C], ZINB[sub.C], ZICOMP[sub.C]). It is important to note that the ZIP[sub.C] chart is based on the ZIP (µ,p[sub.zi]). The ZINB[sub.C] chart is based on the ZINB (µ, r, p[sub.zi]). And the ZICOMP[sub.C] is based on the ZICOMP (µ, r, p[sub.zi]). The ZI parameter, p[sub.zi], in the ZIP, ZINB, and ZICOMP distributions provides the additional probability thrust to the value 0. The ARL[sub.1] profiles of the ZIP[sub.C], ZINB[sub.C], and ZICOMP[sub.C] charts are computed following the MLE values of the parameters of the ZIP, ZINB, and ZICOMP distribution for Data-I and Data-II. For Data-I and Data-II, the Monte Carlo simulation study is conducted to determine the values of L for the G, ZIP[sub.C], ZINB[sub.C], and ZICOMP[sub.C] charts to achieve ARL[sub.0]˜370.

Here, upward shifts (µ[sub.0]+d) are monitored in the ZI count process, where d={0.1,0.3,0.5,0.7,1}. The results of all charts for detecting shifts in the ZI count data along with values of L and the estimated parameters for the ZIP[sub.C], ZINB[sub.C], and ZICOMP[sub.C] charts are reported in Table 5 and Table 6. It is evident from Table 5 and Table 6 that the GCOMP-based CC is more efficient in detecting out-of-control signals as compared to competitive CCs. Hence, this emphasizes the effectiveness of monitoring ZI data without using ZI property through the chart based on the GCOMP distribution.

4. Illustrative Examples

This section is based on the real-life and numerical applications containing some level of dispersion for the proposed G-chart.

4.1. OD-LT Count Data: Early Detection in COVID-19 Mortality Cases

In public health surveillance and healthcare monitoring, CCs are widely used [34] to help health and public authorities make vital decisions. During the ongoing COVID-19 pandemic, health authorities are keenly observing the trend of cases, mortalities, and recoveries to advise the public authorities in implementing appropriate interventions for the safety of the masses. Understanding the variation and behavior of pandemics can be facilitated by using CCs when the trend of cases and mortalities is shifting. The proposed CC was employed to monitor the counts of total number of deaths in a day due to COVID-19 in El Salvador during 8 June 2020, and 17 May 2021. The data are available on https://covid19.who.int/ (accessed on 1 August 2021) and are provided in Table 7. The data show OD with x¯=6.22 and s=11.15.

Furthermore, an assessment of serial correlation has been conducted to analyze the dependency among the occurrences of death over a given time span, a factor that is deemed unfavorable for the implementation of our suggested chart scheme. As a result, the autocorrelation function (ACF) in R-language was employed to evaluate the serial correlation of the COVID-19 death count data in El Salvador. Figure 4 presents the ACF values for the dataset with a one-day interval (lag). From the figure, it can be deduced that consecutive observations show negligible correlation, as the ACF values are close to zero, indicating minimal impact of lags.

Additionally, monitoring is conducted using the existing Q-chart [10] for comparative purposes. Table 8 provides the MLEs of the model parameters for the GCOMP and COMP distribution for targeted data along with negative log likelihood (LL), Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC) for fitting and comparison purposes. Table 8 makes it clear that, in comparison to the COMP distribution, the GCOMP distribution offers a better fit for the data based on the all-criterion values. The estimated parametric values of the GCOMP distribution in Table 7 confirm that the data also exhibit longer-tail behavior (v>1). In Figure 5 and Figure 6, we presented the G-chart and Q-chart for monitoring the total number of daily deaths in El Salvador to achieve ARL[sub.0]˜370. In Table 8, the control limits for G-chart and Q-chart are set considering the parametric values mentioned. It is observed that the proposed chart trigger signals (red dots) on the 40th, 59th, 61st, 209th, and 272nd days during the phase while the competitive chart trigger signals only on the 209th and 272nd days. As expected, the proposed chart can identify process variation more quickly than the existing chart when El Salvador’s death toll begins to rise (that could turn the situation out of control). This could help the relevant authorities to take appropriate and quick action, especially during the pandemic. Consequently, it is evident that, in comparison to the existing CC, the proposed chart encourages prompt action when specific cause variation signals a new phase more quickly when the data are OD-LT.

4.2. UD-ST Count Data: A Simulated Data

The GCOMP distribution tends to model the UD count data. It is important to mention that the GCOMP distribution displays ST behavior for the parametric setting, which is calibrated to achieve UD, as discussed in previous sections. In the second application, a simulated dataset is utilized, where the total number of events are generated through the GCOMP distribution. We have simulated n[sub.1]=100 observations with a parametric setting of µ[sub.0]=1,v=1.5, and r=-0.5, which will be utilized for Phase-I monitoring, and then simulated n[sub.2]=50 observations with v=1.5, r=-0.5, and µ[sub.1]=?µ[sub.0], where ?=(µ[sub.0]+0.5) for Phase-II monitoring. Table 9 displays both of the simulated datasets.

The G-chart is presented in Figure 7 for both Phase-I and Phase-II monitoring of the simulated data set (Phase-I for 1–100 observations and Phase-II for 101–150 observations, separated by dotted lines in the Figure) to yield ARL[sub.0]˜370. The control limits for G-chart are set at µ[sub.0]=1,v=1.5, r=-0.5, and L=3.200. When used for Phase-II monitoring, the proposed chart trigger alarms, but not for Phase-I monitoring, as would be expected.

4.3. ZI Count Data: Monitoring Number of Defective LEDs

In the third application, we consider the data set used by He et al. [35] to monitor the zero-inflation in the total number of defectives in LEDs within a batch. The dataset contains 100 observations each for Phase-I and Phase-II monitoring purposes. The datasets are accessible in Table 10. Due to some assignable causes, some values (in bold) are removed [36]. To determine if both datasets follow the GCOMP distribution, the score statistic test is utilized [37]. For this statistic, the null hypothesis yields an asymptotic ?[sup.2] distribution with one degree of freedom. The score statistic used is:(7)Z=(nz-nilo)2/nilo1-lo-nix¯(lo)2 where n[sub.z] is the total number of zero-value observations in the data; n[sub.i] is the total number of observations, l[sub.o]=e[sup.-p^], in which p^ is the estimated Poisson parameter under the null hypothesis; and x¯ is the average of the observations. For the Phase-I dataset, n[sub.i]=96, n[sub.z]=84, and p^=x¯=0.69; for the Phase-II dataset,n[sub.i]=100, n[sub.z]=79, and p^=x¯=1.37. The ?[sup.2]-value for the Phase-I dataset at 1 degree of freedom is 177.16 with a near-zero p-value; for the Phase-II dataset, it is 284.12 at 1 degree of freedom, and also with a near-zero p-value. It can be inferred that the GCOMP distribution is followed by both Phase-I and Phase-II datasets. The valid MLEs of the GCOMP distribution for the LED dataset are µ^[sub.0]=0.4679,v^=0.000045, and r^=2.2705. Figure 8 shows the monitoring of the defective LED data through the G-chart to yield ARL[sub.0]˜370. The control limits for the G-chart are set at L=3.00 and at valid MLE values of the GCOMP distribution for the LED dataset. During Phase-I monitoring, it was found that all 96 observations are in control. Our proposed chart detects a few out-of-control batches (red dots) during Phase-II monitoring.

5. Conclusive Remarks

Poisson distribution-based CCs are employed to monitor count data. However, despite their widespread usage, these CCs are not suitable for tracking UD and OD within count data because Poisson distribution relies on an equi-distribution assumption. The tail behavior of count data is also overlooked during its monitoring. In this study, a CC based on the GCOMP distribution is proposed to monitor the total number of UD, OD, and ZI counts. During monitoring, the count data’s tail behavior is taken into consideration. Both longer- and shorter-tail behaviors are anticipated. When compared to the existing CCs for the OD-LT count data, the proposed chart has been shown to be more effective at identifying both upward and downward shifts in the process. It is also noticed that proposed the CCs outperform existing CCs in detecting upward shifts for the UD-ST count data. Compared to conventional ZI models, the GCOMP distribution effectively monitors the ZI count data without utilizing the ZI property. Researchers studying this area may find great value in the GCOMP distribution’s adaptability in monitoring several critical features of the count data. Applying the Shewhart technique, researchers might investigate the GCOMP distribution’s suitability for monitoring dispersed count data while taking tail behavior into consideration for the average number of counts. Considering the variation in the count data-generating process, the feasibility of probability control limits instead of Ls control limits could also be explored for the GCOMP process. Furthermore, to improve the performance of Ls limits, the asymmetrical structure of Ls limits can also be considered in the future.

Author Contributions

Conceptualization, F.M. and R.A.K.S.; methodology, F.M., M.A.R. and R.A.K.S.; software, F.M.; validation, R.A.K.S., M.A.R. and J.A.D.; formal analysis, F.M.; investigation, M.A.R.; resources, J.A.D.; data curation, F.M.; writing—original draft preparation, F.M. and M.A.R.; writing—review and editing, M.A.R. and J.A.D.; visualization, F.M. and M.A.R.; supervision, R.A.K.S. and M.A.R. All authors have read and agreed to the published version of the manuscript.

Data Availability Statement

The data are provided in this paper.

Conflicts of Interest

The authors declare that they have no conflicts of interest to report regarding this present study.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

1. D.J. Spiegelhalter Handling over-dispersion of performance indicators., 2005, 14,pp. 347-351. DOI: https://doi.org/10.1136/qshc.2005.013755.

2. M.A. Mohammed; D. Laney Overdispersion in health care performance data: Laney’s approach., 2006, 15,pp. 383-384. DOI: https://doi.org/10.1136/qshc.2006.017830. PMID: https://www.ncbi.nlm.nih.gov/pubmed/17074879.

3. W. Albers Control charts for health care monitoring under overdispersion., 2011, 74,pp. 67-83. DOI: https://doi.org/10.1007/s00184-009-0290-z.

4. F. Famoye Statistical control charts for shifted generalized poisson distribution., 1994, 3,pp. 339-354. DOI: https://doi.org/10.1007/BF02589023.

5. Y. Fang c-charts, X-charts, and the Katz family of distributions., 2003, 35,pp. 104-114. DOI: https://doi.org/10.1080/00224065.2003.11980195.

6. B. He; M. Xie; T.N. Goh; K.L. Tsui On Control Charts Based on the Generalized Poisson Model., 2006, 3,pp. 383-400. DOI: https://doi.org/10.1080/16843703.2006.11673122.

7. M. Xie; B. He; T.N. Goh Zero-inflated poisson model in statistical process control., 2001, 38,pp. 191-201. DOI: https://doi.org/10.1016/S0167-9473(01)00033-0.

8. N. Chen; S. Zhou; T.S. Chang; H. Huang Attribute control charts using generalized zero-inflated poisson distribution., 2008, 24,pp. 793-806. DOI: https://doi.org/10.1002/qre.928.

9. K.F. Sellers A generalized statistical control chart for over- or under-dispersed data., 2012, 28,pp. 59-65. DOI: https://doi.org/10.1002/qre.1215.

10. A. Saghir; Z. Lin; S.A. Abbasi; S. Ahmad The use of probability limits of COM-poisson charts and their applications., 2013, 29,pp. 759-770. DOI: https://doi.org/10.1002/qre.1426.

11. A. Saghir; Z. Lin Cumulative sum charts for monitoring the COM-Poisson processes., 2014, 68,pp. 65-77. DOI: https://doi.org/10.1016/j.cie.2013.12.004.

12. V. Alevizakos; C. Koukouvinos A progressive mean control chart for COM-Poisson distribution., 2020, 51,pp. 849-867. DOI: https://doi.org/10.1080/03610918.2019.1659361.

13. J.-H. Chen A Double Generally Weighted Moving Average Chart for Monitoring the COM-Poisson Processes., 2020, 12, 1014. DOI: https://doi.org/10.3390/sym12061014.

14. M. Aslam; L. Ahmad; C.H. Jun; O.H. Arif A Control Chart for COM–Poisson Distribution Using Multiple Dependent State Sampling., 2016, 32,pp. 2803-2812. DOI: https://doi.org/10.1002/qre.1965.

15. M. Aslam; A.H. Al-Marshadi Design of a Control Chart Based on COM-Poisson Distribution for the Uncertainty Environment., 2019, 2019,p. 8178067. DOI: https://doi.org/10.1155/2019/8178067.

16. O.A. Adeoti; J.C. Malela-Majika; S.C. Shongwe; M. Aslam A hom*ogeneously weighted moving average control chart for Conway–Maxwell Poisson distribution., 2022, 49,pp. 3090-3119. DOI: https://doi.org/10.1080/02664763.2021.1937582. PMID: https://www.ncbi.nlm.nih.gov/pubmed/36035607.

17. G.S. Rao; M. Aslam; U. Rasheed; C.-H. Jun Mixed EWMA–CUSUM chart for COM-Poisson distribution., 2020, 23,pp. 511-527. DOI: https://doi.org/10.1080/09720510.2019.1639947.

18. A.C. Rakitzis; P.E. Maravelakis; P. Castagliola CUSUM Control Charts for the Monitoring of Zero-inflated Binomial Processes., 2016, 32,pp. 465-483. DOI: https://doi.org/10.1002/qre.1764.

19. L.L. Ho; B. Andrade; M. Bourguignon; F.H. Fernandes Monitoring count data with Shewhart control charts based on the Touchard model., 2021, 37,pp. 1875-1893. DOI: https://doi.org/10.1002/qre.2833.

20. M. Bourguignon; R.M.R. Medeiros; F.H. Fernandes; L. Lee Ho Simple and useful statistical control charts for monitoring count data., 2021, 37,pp. 541-566. DOI: https://doi.org/10.1002/qre.2748.

21. L.L. Boaventura; P.H. Ferreira; R.L. Fiaccone New statistical process control charts for overdispersed count data based on the Bell distribution., 2023, 95,p. e20200246. DOI: https://doi.org/10.1590/0001-3765202320200246.

22. M.A. Raza; M. Aslam Design of control charts for multivariate Poisson distribution using generalized multiple dependent state sampling., 2019, 16,pp. 629-650. DOI: https://doi.org/10.1080/16843703.2018.1497935.

23. M. Aslam; A. Saghir; L. Ahmad; C.H. Jun; J. Hussain A control chart for COM-Poisson distribution using a modified EWMA statistic., 2017, 87,pp. 3491-3502. DOI: https://doi.org/10.1080/00949655.2017.1373114.

24. P. Urbieta; H.O.L. Lee; A. Alencar CUSUM and EWMA Control Charts for Negative Binomial Distribution., 2017, 33,pp. 793-801. DOI: https://doi.org/10.1002/qre.2057.

25. V. Alevizakos; C. Koukouvinos Monitoring of zero-inflated binomial processes with a DEWMA control chart., 2021, 48,pp. 1319-1338. DOI: https://doi.org/10.1080/02664763.2020.1761950. PMID: https://www.ncbi.nlm.nih.gov/pubmed/35706893.

26. F. Mustafa; R.A.K. Sherwani; M.A. Raza A new exponentially weighted moving average control chart to monitor count data with applications in healthcare and manufacturing., 2023, 93,pp. 3308-3328. DOI: https://doi.org/10.1080/00949655.2023.2220859.

27. F. Mustafa; R.A.K. Sherwani; M.A. Raza On designing a cumulative sum control chart using generalized Conway-Maxwell-Poisson distribution for monitoring the count data., 2024, 18,pp. 637-668. DOI: https://doi.org/10.1504/EJIE.2024.10057052.

28. T. Imoto A generalized Conway-Maxwell-Poisson distribution which includes the negative binomial distribution., 2014, 247,pp. 824-834. DOI: https://doi.org/10.1016/j.amc.2014.09.052.

29. A. Saghir; Z. Lin A flexible and generalized exponentially weighted moving average control chart for count data., 2014, 30,pp. 1427-1443. DOI: https://doi.org/10.1002/qre.1564.

30. K.F. Sellers; G. Shmueli A flexible regression model for count data., 2010, 4,pp. 943-961. DOI: https://doi.org/10.1214/09-AOAS306.

31. D.C. Montgomery, Wiley: Hoboken, NJ, USA, 2013,

32. F. Mustafa; R.A. Khan Sherwani; M.A. Raza A progressive mean control chart for dispersed count data considering tail behavior., 2023,pp. 1-20. DOI: https://doi.org/10.1080/16843703.2023.2246768.

33. H. Campbell The consequences of checking for zero-inflation and overdispersion in the analysis of count data., 2021, 12,pp. 665-680. DOI: https://doi.org/10.1111/2041-210X.13559.

34. W.H. Woodall The use of control charts in health-care and public-health surveillance., 2006, 38,pp. 89-104. DOI: https://doi.org/10.1080/00224065.2006.11918593.

35. S. He; W. Huang; W.H. Woodall CUSUM charts for monitoring a zero-inflated poisson process., 2012, 28,pp. 181-192. DOI: https://doi.org/10.1002/qre.1228.

36. V. Alevizakos; C. Koukouvinos Monitoring of zero-inflated Poisson processes with EWMA and DEWMA control charts., 2020, 36,pp. 88-111. DOI: https://doi.org/10.1002/qre.2561.

37. J. Van den Broek A score test for zero inflation in a Poisson distribution., 1995, 51,pp. 738-743. DOI: https://doi.org/10.2307/2532959.

Figures and Tables

Figure 1: Behavior of GCOMP distribution at different parametric settings. Panel (a) depicts under-dispersed and shorter-tail behavior of the GCOMP distribution at different parametric values, and Panel (b,c) depicts over-dispersed and longer-tail behavior of the GCOMP distribution at different parametric values. [Please download the PDF to view the image]

Figure 2: ZI behavior of the GCOMP distribution at different parametric values. [Please download the PDF to view the image]

Figure 3: Flow chart for computation of CC multiplier L for G-chart to achieve ARL[sub.0]˜370. [Please download the PDF to view the image]

Figure 4: ACF plot for El Salvador COVID-19 mortality data. [Please download the PDF to view the image]

Figure 5: Monitoring the total number of daily deaths due to COVID-19 in El Salvador through the G-chart. [Please download the PDF to view the image]

Figure 6: Monitoring the total number of daily deaths due to COVID-19 in El Salvador through the Q-chart. [Please download the PDF to view the image]

Figure 7: Phase-I and Phase-II monitoring of UD-ST count data through the G-chart. [Please download the PDF to view the image]

Figure 8: Phase-I and Phase-II monitoring of the total number of defective LEDs produced within a batch through the G-chart. [Please download the PDF to view the image]

Table 1: Control limits for G-Chart.

	G-Chart
LCL	n(µ? l o g C r, ?, µ/? µ)-L[square root of n(µ? E X/? µ)]
CL	n(µ? l o g C r, ?, µ/? µ)
UCL	n(µ? l o g C r, ?, µ/? µ)+L[square root of n(µ? E X/? µ)]

Table 2: L-coefficient values to achieve ARL[sub.0]˜370 for G-chart.

	n
r	v	µ	3	15	50	100	300	1000
-0.5	1.5	1	3.510	3.156	3.120	3.200	3.440	4.005
-0.5	3	2	3.256	3.693	4.46	5.261	7.073	10.69
-1.5	1.5	1	2.800	2.861	3.063	3.144	3.303	3.967
-1	2	2	3.401	3.371	3.521	4.002	4.903	6.751
0.3	2	2.5	3.010	2.95	3.050	3.100	3.302	3.708
0.3	1.5	1	3.245	3.124	3.248	3.551	4.090	5.250
0.5	1.5	1	3.366	3.468	4.099	4.667	6.100	8.550
0.5	2	2	2.965	3.067	3.251	3.553	4.105	5.202
0.7	2	2	3.101	3.053	3.180	3.342	3.796	4.622

Table 3: ARL[sub.1] profile of G-chart and Q-chart for the total sum of UD-ST count data.

	ARL1
n	3	15	50	100	300	1000
d	G-Chart µ=1 v=1.5 r=-1.5 L=2.80	Q-Chart µ=0.746 r=2.100 L=2.85	G-Chart µ=1 v=1.5 r=-1.5 L=2.86	Q-Chart µ=0.74 r=2.10 L=2.87	G-Chart µ=1 v=1.5 r=-1.5 L=3.06	Q-Chart µ=0.74 r=2.10 L=3.51	G-Chart µ=1 v=1.5 r=-1.5 L=3.14	Q-Chart µ=0.74 r=2.10 L=3.60	G-Chart µ=1 v=1.5 r=-1.5 L=3.30	Q-Chart µ=0.74 r=2.10 L=4.38	G-Chart µ=1 v=1.5 r=-1.5 L=3.96	Q-Chart µ=0.74 r=2.10 L=5.74
-0.5	3350.1	9599.01	9990.12	6.45	25.9	1.08	4.1	1	1	1	1	1
-0.3	1001.6	6186.23	6468	48.74	423	4.01	91.3	1.72	11.1	1.20	1.8	1
-0.2	731.7	3000.10	3008.7	136.36	1620	13.91	617.2	5.58	184.0	1.98	59.4	1
-0.1	481.3	1250.18	1209.8	344.84	1389.6	61.99	1853.2	35.76	4481.8	9.72	8021.7	2.17
0.1	213.3	374.25	219.4	256.01	101.12	601.89	64.7	2201.89	23.5	6789.1	5.9	9999
0.2	156.2	236.12	96.5	105.34	39.7	180.33	19.1	267.23	4.7	520.13	1.3	1717.35
0.3	117.3	162.83	47	54.84	18	46.55	7.6	41.23	1.9	29.13	1.0	5.12
0.5	71.8	84.55	11.1	17.68	5.5	9.67	2.3	5	1.01	1.56	1	1.01

Table 4: ARL[sub.1] profile of G-chart and Q-chart for the total sum of OD-LT count data.

	ARL1
n	3	15	50	100	300	1000
d	G-Chart µ=1 v=1.5 r=0.5 L=3.36	Q-Chart µ=1.41 r=0.60 L=3.36	G-Chartµ=1v=1.5r=0.5L=3.46	Q-Chart µ=1.41 r=0.60 L=3.10	G-Chart µ=1 v=1.5 r=0.5 L=4.09	Q-Chart µ=1.41 r=0.60 L=3.14	G-Chart µ=1 v=1.5 r=0.5 L=4.66	Q-Chart µ=1.41 r=0.60 L=3.16	G-Chart µ=1 v=1.5 r=0.5 L=6.10	Q-Chart µ=1.41 r=0.60 L=3.23	G-Chart µ=1 v=1.5 r=0.5 L=8.55	Q-Chart µ=1.41 r=0.60 L=3.56
-0.5	9900.12	9653.2	1.71	22.84	1	2.01	1	1.01	1	1	1	1
-0.3	6998.37	5023.21	7.64	193.08	1.4	9	1.03	3.69	1	1.01	1	1
-0.2	4134.81	2468.23	22.24	758.31	3.65	39.23	1.6	11.23	1.01	2	1	1.01
-0.1	1256.88	1052.71	84.79	1065.23	22.13	314.23	10.75	123.23	2.25	29.1	1.01	6.23
0.1	155.13	220.41	98.23	88.43	37.11	35.36	19.89	20.87	5	6.23	2	3.23
0.2	68.55	112.44	32.63	27.32	8.77	6.76	3.91	4.81	1.01	1.99	1	1.01
0.3	33.15	59.69	10.55	12.1	3	3.12	1.09	1.52	1	1.01	1	1
0.5	10.34	23.23	2.41	4.12	1.07	1.19	1	1	1	1	1	1

Table 5: ARL[sub.1] profiles of G, ZIP[sub.C], and ZINB[sub.C] CCs for Data-I.

	G-Chart	ZIP[sub.C]-Chart	ZINB[sub.C]-Chart	ZICOMP[sub.C]-Chart
d	µ[sub.0]=1, v=0.05, r=0.3 L=2.280	µ ^[sub.0]=1.39, P[sub.z i]=0.71, L=4.305	µ ^[sub.0]=1.17, r^=0.37, P[sub.z i]=0.76, L=5.310	µ^[sub.0]=1.09,r^=0.72,P[sub.zi]=0.41,L=3.7378
0.1	3.21	50.99	4.95	144.66
0.3	1.36	5.45	2.17	56.90
0.5	1	2.03	1.55	25.37
0.7	1	1.08	1.01	14.35
1	1	1	1	7.24

Table 6: ARL[sub.1] profiles of G, ZIP[sub.C], and ZINB[sub.C] CCs for Data-II.

	G-Chart	ZIP[sub.C]-Chart	ZINB[sub.C]-Chart	ZICOMP[sub.C]-Chart
d	µ[sub.0]=1.5,v=0.1,r=0.5,L=3.270	µ ^[sub.0]=1.91, P[sub.z i]=0.63 L=8.830	µ ^[sub.0]=1.67, r^=0.61, P[sub.z i]=0.65 L=6.450	µ^[sub.0]=1.62,r^=0.58,P[sub.zi]=0.35,L=3.501
0.1	23.33	24.21	29.63	194.56
0.3	1.75	2.81	2.01	64.91
0.5	1.01	1.84	1.71	26.99
0.7	1	1.09	1.03	13.50
1	1	1	1	6.34

Table 7: Counts of total number of deaths in a day due to COVID-19 in El Salvador from 8 June 2020 to 17 May 2021.

3,4,4,4,0,4,2,0,2,3,3,4,7,5,9,6,6,7,7,10,9,12,10,8,9,11,8,7,6,6,6,8,6,5,6,7,11,8,12,11,15,11,9,8,11,9,7,11,10,8,9,13,9,9,11,8,10,9,12,15,7,16,13,14,7,7,7,11,8,9,6,7,8,7,6,8,7,8,9,9,7,8,6,5,4,7,7,8,5,8,7,5,1,5,4,3,5,3,3,4,4,5,3,4,3,1,2,5,4,3,0,0,5,8,4,5,5,4,6,2,4,4,4,4,6,3,4,5,5,4,4,5,5,4,3,4,3,4,4,5,4,4,5,5,4,4,4,4,4,5,5,5,4,4,4,6,4,4,5,6,5,3,5,4,5,3,6,5,6,5,0,12,4,5,4,3,6,9,5,8,11,6,5,4,6,6,6,7,7,5,7,7,8,7,8,8,7,8,9,9,6,8,8,8,0,14,0,0,0,31,6,9,9,8,8,10,11,9,9,10,12,10,10,8,11,11,12,9,10,11,10,11,11,6,10,5,10,9,9,6,8,7,9,11,8,11,9,8,10,8,7,8,8,8,9,9,9,7,7,8,8,9,9,6,7,7,8,9,9,7,0,17,8,8,6,6,5,5,5,4,4,4,0,9,4,4,4,3,4,0,6,2,2,3,3,0,0,11,4,4,5,0,7,3,4,4,3,3,3,3,4,4,4,3,3,4,4,3,4,5,5,3,5,6,3,4,0,8,2,3,4,2,3,4,4,4,6,4,5,5,4,5,4

Table 8: Estimated values of parameters, ?[sup.2], LL, AIC, and BIC for the GCOMP and COMP distribution.

	Parameters	Criterion
Distribution(s)	µ^	r^	v^	?[sup.2]	-LL	AIC	BIC
GCOMP	2.7363	0.3895	1.3528	495.43	-880.979	1767.959	1779.472
COMP	2.4636	0.5156		566.65	-885.979	1774.189	1781.865

Table 9: Simulated data under the UD-ST parametric setting of the GCOMP distribution.

Phase-I Data	Phase-II Data
0,1,0,2,2,0,1,2,1,0,2,0,1,1,0,2,0,0,0,2,2,1,1,3,1,1,1,1,0,0,2,2,1,1,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,2,0,0,1,0,1,0,0,1,2,0,1,0,0,0,1,0,1,1,1,0,1,1,1,0,1,0,0,1, 0,0,0,1,0,1,0,0,3,2,2,0,0,1,0,1,0,0,1,0,0,1	1,0,1,0,0,1,2,0,1,0,0,0,1,0,1,1,1,0,1,2,1,2,3,1,0,2,0,0,3,0,0,0,1,2,1,4,1,3,2,0,0,4,0,0,0,0,1,5,3,2,0,2,1,0,2,1,1,1,1,0,0,1,5,1,1,0,1,2

Table 10: Number of defective LEDs within a batch.

Phase-I Data	Phase-II Data
0,0,19,0,0,0,0,5,0,0,0,6,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,12,0,9,0,0,0,8,0,0,0,16,0,0,0,6,3,0,0,0,0,0,0,0,0,2,0,0,0,7,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,8,0,0,0,18,9,0,0,0,0,2,0,0,0,0,0,0,0,0,0	0,0,0,19,0,0,0,0,5,0,0,6,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,12,0,10,0,0,0,8,0,0,0,0,0,16,0,6,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,5,0,2,0,8,4,0,0,0,0,0,0,0,0,0,0,1,4,0,0,6,0,2,0,9,0,0,0,0,0,0,0,0,5,0,0,0,0,4,0,0

Author Affiliation(s):

[1] College of Statistical Sciences, University of the Punjab, Lahore 54590, Pakistan; [emailprotected]

[2] Department of Computer Science, COMSATS University Islamabad, Sahiwal Campus, Sahiwal 57000, Pakistan

[3] Department of Statistics, Government College University Faisalabad, Faisalabad 38000, Pakistan; [emailprotected]

[4] Department of Mathematics and Statistics, College of Science, University of Jeddah, Jeddah 21589, Saudi Arabia; [emailprotected]

Author Note(s):

[*] Correspondence: [emailprotected]

DOI: 10.3390/pr12040688

COPYRIGHT 2024 MDPI AG
No portion of this article can be reproduced without the express written permission from the copyright holder.