Non-Homogeneous Data with Subset IDs Example

Discovering Subpopulations Using Warranty Return Montoring Example

The SPC (warranty monitoring) methodology explained in this section can also be used to detect different subpopulations. The different subpopulations can reflect different use conditions, different material, etc. In this methodology, one can use different IDs to differentiate between subpopulations, and obtain models that are distinct to each subpopulation. The following example illustrates this concept. A manufacturer collected the following sales and return data.

$$\begin{matrix} {} & Oct 05 & Nov 05 & Dec 05 & Jan 06 & Feb 06 & Mar 06 & Apr 06 & May 06 \\ Sep 05 & \text{2} & \text{4} & \text{5} & \text{7} & \text{12} & \text{13} & \text{16} & \text{17} \\ Oct 05 & \text{-} & \text{3} & \text{4} & \text{5} & \text{3} & \text{8} & \text{11} & \text{14} \\ Nov 05 & \text{-} & \text{-} & \text{2} & \text{3} & \text{5} & \text{7} & \text{23} & \text{13} \\ Dec 05 & \text{-} & \text{-} & \text{-} & \text{2} & \text{3} & \text{4} & \text{6} & \text{7} \\ Jan 06 & \text{-} & \text{-} & \text{-} & \text{-} & \text{2} & \text{3} & \text{3} & \text{4} \\ Feb 06 & \text{-} & \text{-} & \text{-} & \text{-} & \text{-} & \text{2} & \text{3} & \text{3} \\ Mar 06 & \text{-} & \text{-} & \text{-} & \text{-} & \text{-} & \text{-} & \text{2} & \text{12} \\ Apr 06 & \text{-} & \text{-} & \text{-} & \text{-} & \text{-} & \text{-} & \text{-} & \text{2} \\ \end{matrix}$$

The data were analyzed using the two-parameter Weibull distribution and the MLE analysis method. The parameters are estimated to be:


 * $$\begin{align}

& \beta = & 2.31 \\ & \eta = & 25.07 \end{align}$$

The SPC's $$\alpha $$  value are set at 0.01 for the Critical Value and 0.1 for the Caution Value. When analyzed and color coded in Weibull++ the following window is obtained:



Here the Nov. 05 and Mar 06 sales periods are colored in yellow indicating that they are `outlier' sales periods, while the rest are green. One suspected reason for the variation may be the material used in production in this period. Further analysis confirmed that for these periods the material was acquired from a different supplier. This then implies that the units are not homogenous, and that there are different subpopulations present in the field populations.

Based on this, the data is re-analyzed after categorizing the different shipments (using the ID column) based on their material supplier. The data as entered are shown next.



The new models that describe the data are (assuming a two-parameter Weibull distribution and using MLE as the analysis method for both sub-populations):

$$\begin{matrix} Supplier 1 & Supplier 2 \\ \begin{matrix} \beta =2.38 \\ \eta =25.39 \\ \end{matrix} & \begin{matrix} \beta =2.32 \\ \eta =21.28 \\ \end{matrix} \\ \end{matrix}$$

This analysis helped in uncovering different subpopulations as well as allowing us to compute different distributions for each subpopulation. Note that if the analysis were performed on the failure and suspension times in a regular Standard Folio, using the mixed Weibull distribution, one would not be able to detect which units fall into which subpopulation.