Competing Failure Modes Analysis

Often, a group of products will fail due to more than one failure mode. One can take the view that the products could have failed due to any one of the possible failure modes, but since an item cannot fail more than one time, there can only be one failure mode for each failed product. In this view, the failure modes compete as to which causes the failure for each particular item. This can be viewed as a series system reliability model, with each failure mode composing a block of the series system. Competing failure modes analysis segregates the analyses of failure modes and then combines the results to provide an overall model for the product in question.

In order to begin analyzing data sets with more than one competing failure mode, one must perform a separate analysis for each failure mode. During each of these analyses, the failure times for all other failure modes not being analyzed are considered to be suspensions. This is because the units under test would have failed at some time in the future due to the failure mode being analyzed, had the unrelated (not analyzed) mode not occurred. Thus, in this case, the information available is that the mode under consideration did not occur and the unit under consideration accumulated test time without a failure due to the mode under consideration (or a suspension due to that mode).

Once the analysis for each separate failure mode has been completed (using the same principles as before), the resulting reliability equation for all modes is the product of the reliability equation for each mode, or:

$$R(t)={{R}_{1}}(t)\cdot {{R}_{2}}(t)\cdot ...\cdot {{R}_{n}}(t)$$

where $$n$$  is the total number of failure modes considered. This is the product rule for the reliability of series systems with statistically independent components, which states that the reliability for a series system is equal to the product of the reliability values of the components comprising the system. Do note that in Eqn. (sysrel)    is the reliability function based on any assumed life distribution. In Weibull++ this life distribution can be either the 2-parameter Weibull, lognormal, normal or the 1-parameter exponential.

Example 13
From Meeker & Escobar [27], the following table gives failure times for an electric component that has two failure modes.

One failure mode is due to random voltage spikes which cause failure by overloading the system (denoted as a $$V$$  in the table). The other failure mode is due to wear-out failures which usually happen only after the system has run for many cycles (this failure mode is denoted as a $$W$$  in the table).

Considering that these are competing failure modes, determine the overall reliability for the component at 100,000 cycles.

$$\begin{matrix} Number & Failure & Failure & Number & Failure & Failure \\ in State & Time* & Mode & in State & Time* & Mode \\ \text{1} & \text{2} & \text{V} & \text{1} & \text{147} & \text{W} \\ \text{1} & \text{10} & \text{V} & \text{1} & \text{173} & \text{V} \\ \text{1} & \text{13} & \text{V} & \text{1} & \text{181} & \text{W} \\ \text{2} & \text{23} & \text{V} & \text{1} & \text{212} & \text{W} \\ \text{1} & \text{28} & \text{V} & \text{1} & \text{245} & \text{W} \\ \text{1} & \text{30} & \text{V} & \text{1} & \text{247} & \text{V} \\ \text{1} & \text{65} & \text{V} & \text{1} & \text{261} & \text{V} \\ \text{1} & \text{80} & \text{V} & \text{1} & \text{266} & \text{W} \\ \text{1} & \text{88} & \text{V} & \text{1} & \text{275} & \text{W} \\ \text{1} & \text{106} & \text{V} & \text{1} & \text{293} & \text{W} \\ \text{1} & \text{143} & \text{V} & \text{8} & \text{300} & \text{suspended} \\ \end{matrix}$$


 * Failure times given are in thousands of cycles.

Solution to Example 13
We will begin by performing a Weibull analysis of the voltage spike ( $$V$$ ) failure mode. In order to do this, we must consider all of the failures for the wear-out mode to be suspensions. The input data for the analysis are shown next:

Analyzing this data set using the maximum likelihood method (recommended due to the number of suspensions in the data) and assuming a Weibull distribution, we obtain the parameters $${{\beta }_{V}}=0.6711$$  and  $${{\eta }_{V}}=449.4$$. The reliability for this failure mode at $$t=100$$  is  $${{R}_{V}}(100)=0.694$$.

We follow an identical procedure for the wear-out failure mode, counting only the $$W$$  entries as failures and assuming the  $$V$$  entries are suspensions. This is shown next.

Once again, analyzing with a Weibull distribution with maximum likelihood estimators, we obtain the parameters $${{\beta }_{W}}=4.337$$  and  $${{\eta }_{W}}=340.4$$. The reliability for this failure mode at $$t=100$$  is  $${{R}_{W}}(100)=0.995$$.

We can now use Eqn. (sysrel) to determine the overall system reliability at 100,000 cycles:

$$\begin{align} & {{R}_{sys}}(100)= & {{R}_{V}}(100)\cdot {{R}_{W}}(100) \\ & = & 0.694\cdot 0.995 \\ & = & 0.69053 \end{align}$$

Or the reliability of the unit (or system) under both modes is $${{R}_{sys}}(100)=69.053%.$$

Note that Weibull++ can perform this analysis for you automatically. To accomplish this, the data would be entered in a single data sheet and competing failure modes chosen as the analysis method. This is shown in the next graphic.

Plotting Competing Failure Modes

$$$$

When plotting competing failure modes in Weibull++, your plots can contain the combined mode line as well as the individual mode lines. The User's Guide describes how these options can be turned on or off.

Confidence Bounds for Competing Failure Modes
The method available in Weibull++ for estimating the different types of confidence bounds, for competing failure modes analysis, is the Fisher matrix method, and is presented in this section.

Variance/Covariance Matrix
The variances and covariances of the parameters are estimated from the inverse local Fisher matrix, as follows:

$$\begin{align} & & \left( \begin{matrix}   Var\left( {{\widehat{a}}_{1}} \right) & Cov\left( {{\widehat{a}}_{1}},{{\widehat{b}}_{1}} \right) & 0 & 0 & 0 & 0 & 0  \\   Cov\left( {{\widehat{a}}_{1}},{{\widehat{b}}_{1}} \right) & Var\left( {{\widehat{b}}_{1}} \right) & 0 & 0 & 0 & 0 & 0  \\   0 & 0 & . & 0 & 0 & 0 & 0  \\   0 & 0 & 0 & . & 0 & 0 & 0  \\   0 & 0 & 0 & 0 & . & 0 & 0  \\   0 & 0 & 0 & 0 & 0 & Var\left( {{\widehat{a}}_{n}} \right) & Cov\left( {{\widehat{a}}_{n}},{{\widehat{b}}_{n}} \right)  \\   0 & 0 & 0 & 0 & 0 & Cov\left( {{\widehat{a}}_{n}},{{\widehat{b}}_{n}} \right) & Var\left( {{\widehat{b}}_{n}} \right)  \\ \end{matrix} \right) \\ & &  \\  & = & {{\left( \begin{matrix}   -\tfrac{{{\partial }^{2}}\Lambda }{\partial a_{1}^{2}} & -\tfrac{{{\partial }^{2}}\Lambda }{\partial {{a}_{1}}\partial {{b}_{1}}} & 0 & 0 & 0 & 0 & 0  \\   -\tfrac{{{\partial }^{2}}\Lambda }{\partial {{a}_{1}}\partial {{b}_{1}}} & -\tfrac{{{\partial }^{2}}\Lambda }{\partial a_{1}^{2}} & 0 & 0 & 0 & 0 & 0  \\   0 & 0 & . & 0 & 0 & 0 & 0  \\   0 & 0 & 0 & . & 0 & 0 & 0  \\   0 & 0 & 0 & 0 & . & 0 & 0  \\   0 & 0 & 0 & 0 & 0 & -\tfrac{{{\partial }^{2}}\Lambda }{\partial a_{n}^{2}} & -\tfrac{{{\partial }^{2}}\Lambda }{\partial {{a}_{n}}\partial {{b}_{n}}}  \\   0 & 0 & 0 & 0 & 0 & -\tfrac{{{\partial }^{2}}\Lambda }{\partial {{a}_{n}}\partial {{b}_{n}}} & -\tfrac{{{\partial }^{2}}\Lambda }{\partial a_{n}^{2}}  \\ \end{matrix} \right)}^{-1}} \end{align}$$

where $$\Lambda $$  is the log-likelihood function of the failure distribution, described in Chapter 5.

Bounds on Reliability
The competing failure modes reliability function is given by:

$$\widehat{R}=\underset{i=1}{\overset{n}{\mathop \prod }}\,{{\hat{R}}_{i}}$$

where: •	 $${{R}_{i}}$$ is the reliability of the  $${{i}^{th}}$$  mode, •	 $$n$$ is the number of failure modes. The upper and lower bounds on reliability are estimated using the logit transformation:

$$\begin{align} & {{R}_{U}}= & \frac{\widehat{R}}{\widehat{R}+(1-\widehat{R}){{e}^{-\tfrac{{{K}_{\alpha }}\sqrt{Var(\widehat{R})}}{\widehat{R}(1-\widehat{R})}}}} \\ & {{R}_{L}}= & \frac{\widehat{R}}{\widehat{R}+(1-\widehat{R}){{e}^{\tfrac{{{K}_{\alpha }}\sqrt{Var(\widehat{R})}}{\widehat{R}(1-\widehat{R})}}}} \end{align}$$

where $$\widehat{R}$$  is calculated using Eqn. (CFMReliability). $${{K}_{\alpha }}$$ is defined by:

$$\alpha =\frac{1}{\sqrt{2\pi }}\underset{\overset{\infty }{\mathop \int }}\,{{e}^{-\tfrac{2}}}dt=1-\Phi ({{K}_{\alpha }})$$

(If $$\delta $$  is the confidence level, then  $$\alpha =\tfrac{1-\delta }{2}$$  for the two-sided bounds, and  $$\alpha =1-\delta $$  for the one-sided bounds.)

The variance of $$\widehat{R}$$  is estimated by:

$$Var(\widehat{R})=\underset{i=1}{\overset{n}{\mathop \sum }}\,{{\left( \frac{\partial R}{\partial {{R}_{i}}} \right)}^{2}}Var({{\hat{R}}_{i}})$$

$$\frac{\partial R}{\partial {{R}_{i}}}=\underset{j=1,j\ne i}{\overset{n}{\mathop \prod }}\,\widehat$$

Thus:

$$Var(\widehat{R})=\underset{i=1}{\overset{n}{\mathop \sum }}\,\left( \underset{j=1,j\ne i}{\overset{n}{\mathop \prod }}\,\widehat{R}_{j}^{2} \right)Var({{\hat{R}}_{i}})$$

$$Var({{\hat{R}}_{i}})=\underset{i=1}{\overset{n}{\mathop \sum }}\,{{\left( \frac{\partial {{R}_{i}}}{\partial {{a}_{i}}} \right)}^{2}}Var({{\hat{a}}_{i}})$$

where $$\widehat$$  is an element of the model parameter vector. Therefore, the value of $$Var({{\hat{R}}_{i}})$$  is dependent on the underlying distribution.

For the Weibull distribution:

$$Var({{\hat{R}}_{i}})={{\left( {{{\hat{R}}}_{i}}{{e}^{{{{\hat{u}}}_{i}}}} \right)}^{2}}Var({{\hat{u}}_{i}})$$

where:

$${{\hat{u}}_{i}}={{\hat{\beta }}_{i}}(\ln (t-{{\hat{\gamma }}_{i}})-\ln {{\hat{\eta }}_{i}})$$

and $$Var(\widehat)$$  is given in Chapter 6. For the exponential distribution:

$$Var({{\hat{R}}_{i}})={{\left( {{{\hat{R}}}_{i}}(t-{{{\hat{\gamma }}}_{i}}) \right)}^{2}}Var({{\hat{\lambda }}_{i}})$$

where $$Var(\widehat)$$  is given in Chapter 7. For the normal distribution:

$$Var({{\hat{R}}_{i}})={{\left( f({{{\hat{z}}}_{i}})\hat{\sigma } \right)}^{2}}Var({{\hat{z}}_{i}})$$

$${{\hat{z}}_{i}}=\frac{t-{{{\hat{\mu }}}_{i}}}$$

where $$Var(\widehat)$$  is given in Chapter 8. For the lognormal distribution:

$$Var({{\hat{R}}_{i}})={{\left( f({{{\hat{z}}}_{i}})\cdot {{{\hat{\sigma }}}^{\prime }} \right)}^{2}}Var({{\hat{z}}_{i}})$$

$${{\hat{z}}_{i}}=\frac{\ln \text{(}t)-\hat{\mu }_{i}^{\prime }}{\hat{\sigma }_{i}^{\prime }}$$

where $$Var(\widehat)$$  is given in Chapter 9.

Bounds on Time
The bounds on time are estimate by solving the reliability equation with respect to time. From Eqn. (CFMReliability) we have that:

$$\hat{t}=\varphi (R,{{\hat{a}}_{i}},{{\hat{b}}_{i}})$$

$$i=1,...,n$$

where: •	 $$\varphi $$ is inverse function for Eqn. (CFMReliability)

•	for the Weibull distribution $${{\hat{a}}_{i}}$$  is  $${{\hat{\beta }}_{i}}$$, and  $${{\hat{b}}_{i}}$$  is  $${{\hat{\eta }}_{i}}$$

•	for the exponential distribution $${{\hat{a}}_{i}}$$  is  $${{\hat{\lambda }}_{i}}$$, and  $${{\hat{b}}_{i}}$$  =0

•	for the normal distribution $${{\hat{a}}_{i}}$$  is  $${{\hat{\mu }}_{i}}$$, and  $${{\hat{b}}_{i}}$$  is  $${{\hat{\sigma }}_{i}}$$ , and

•	for the lognormal distribution $${{\hat{a}}_{i}}$$  is  $$\hat{\mu }_{i}^{\prime }$$, and  $${{\hat{b}}_{i}}$$  is  $$\hat{\sigma }_{i}^{\prime }$$

Set:

$$u=\ln (t)$$

The bounds on $$u$$  are estimated from:

$${{u}_{U}}=\widehat{u}+{{K}_{\alpha }}\sqrt{Var(\widehat{u})}$$

and:

$${{u}_{L}}=\widehat{u}-{{K}_{\alpha }}\sqrt{Var(\widehat{u})}$$

Then the upper and lower bounds on time are found by using the equations

$${{t}_{U}}={{e}^}$$

and:

$${{t}_{L}}={{e}^}$$

$${{K}_{\alpha }}$$  is calculated using Eqn. (ka) and $$Var(\widehat{u})$$  is computed as:

$$Var(\widehat{u})=\underset{i=1}{\overset{n}{\mathop \sum }}\,\left( {{\left( \frac{\partial u}{\partial {{a}_{i}}} \right)}^{2}}Var(\widehat)+{{\left( \frac{\partial u}{\partial {{b}_{i}}} \right)}^{2}}Var(\widehat)+2\frac{\partial u}{\partial {{a}_{i}}}\frac{\partial u}{\partial {{b}_{i}}}Cov(\widehat,\widehat) \right)$$

Complex Competing Failure Modes
In addition to being viewed as a series system, the relationship between the different competing failures modes can be more complex. After performing separate analysis for each failure mode, a diagram that describes how each failure mode can result in a product failure can be used to perform analysis for the item in question. Such diagrams are usually referred to as Reliability Block Diagrams (RBD) (for more on RBDs see ReliaSoft's System Analysis Reference and ReliaSoft's BlockSim software).

A reliability block diagram is made of blocks that represent the failure modes and arrows and connects the blocks in different configurations. Note that the blocks can also be used to represent different components or subsystems that make up the product. Weibull ++ provides the capability to use a diagram to model, series, parallel, k-out-of-n configurations in addition to any complex combinations of these configurations.

In this analysis, the failure modes are assumed to be statistically independent. (Note: In the context of this reference, statistically independent implies that failure information for one failure mode provides no information about, i.e. does not affect, other failure mode). Analysis of dependent modes is more complex. Advanced RBD software such as ReliaSoft's BlockSim can handle and analyze such dependencies, as well as provide more advanced constructs and analyses (see http://www.reliasoft.com/BlockSim).

Series Configuration
The basic competing failure modes configuration, which has already been discussed, is a series configuration. In a series configuration, the occurrence of any failure mode results in failure for the product.

The equation that describes series configuration is:

$$R(t)={{R}_{1}}(t)\cdot {{R}_{2}}(t)\cdot ...\cdot {{R}_{n}}(t)$$

where $$n$$  is the total number of failure modes considered.

Parallel
In a simple parallel configuration, at least one of the failure modes must not occur for the product to continue operation.

$$$$

The equation that describes parallel configuration is:

$$R(t)=1-\underset{i=1}{\overset{n}{\mathop \prod }}\,(1-{{R}_{i}}(t))$$

where $$n$$  is the total number of failure modes considered.

Combination of Series and Parallel
While many smaller products can be accurately represented by either a simple series or parallel configuration, there may be larger products that involve both series and parallel configurations in the overall model of the product. Such products can be analyzed by calculating the reliabilities for the individual series and parallel sections and then combining them in the appropriate manner.

$$$$

k-out-of-n Parallel Configuration
The k-out-of-n configuration is a special case of parallel redundancy. This type of configuration requires that at least $$k$$  failure modes do not happen out of the total  $$n$$  parallel failure modes for the product to succeed. The simplest case of a k-out-of-n configuration is when the failure modes are independent and identical and have the same failure distribution and uncertainties about the parameters (in other words they are derived from the same test data). In this case, the reliability of the product with such a configuration can be evaluated using the binomial distribution, or:

$$R(t)=\overset{n}{\mathop{\underset{r=k}{\mathop{\underset{}{\overset{}{\mathop \sum }}\,}}\,}}\,\left( \underset{k}{\mathop{\overset{n}{\mathop – }\,}}\, \right){{R}^{r}}(t){{(1-R(t))}^{n-r}}$$

In the case where the k-out-of-n failure modes are not identical, other approaches for calculating the reliability must be used (e.g. the event space method). Discussion of these is beyond the scope of this reference. Interested readers can consult ReliaSoft's System Reliability Reference.

Complex Systems
In many cases, it is not easy to recognize which components are in series and which are in parallel in a complex system.

$$$$

The previous configuration cannot be broken down into a group of series and parallel configurations. This is primarily due to the fact that failure mode C has two paths leading away from it, whereas B and D have only one. Several methods exist for obtaining the reliability of a complex configuration including the decomposition method, the event space method and the path-tracing method. Discussion of these is beyond the scope of this reference. Interested readers can consult ReliaSoft's System Reliability Reference.

Example 14
Assume that a product has five failure modes: A, B, C, D and F. Furthermore, assume that failure of the product will occur if mode A occurs, modes B and C occur simultaneously or if either modes C and D, C and F or D and F occur simultaneously. Times-to-failure for each mode is given in the next table.

$$\begin{matrix} \text{TTF for A} & \text{TTF for B} & \text{TTF for C} & \text{TTF for D} & \text{TTF for F}  \\ \text{276} & \text{23} & \text{499} & \text{467} & \text{67} \\ \text{320} & \text{36} & \text{545} & \text{540} & \text{72} \\ \text{323} & \text{57} & \text{661} & \text{716} & \text{81} \\ \text{558} & \text{89} & \text{738} & \text{737} & \text{108} \\ \text{674} & \text{99} & \text{987} & \text{761} & \text{110} \\ \text{829} & \text{154} & \text{1165} & \text{1093} & \text{127} \\ \text{878} & \text{200} & \text{1337} & \text{1283} & \text{148} \\ \end{matrix}$$

The RBD that describes this configuration is shown next.

$$$$

One folio with multiple data sheets was created in Weibull++ for each of the data sets and each file was analyzed using the two-parameter Weibull distribution, MLE as the analysis method and Fisher Matrix as the confidence bounds method. A diagram is created by choosing Add Diagram from the Project menu. The failure modes can be inserted into the diagram by dragging them from the template. The nodes are inserted by clicking choosing Add Node from the Diagram menu.

$$$$

The number of required paths can be specified by double clicking the node and entering the appropriate number (1 in the first node and 2 in the second node).

$$$$

Using the Quick Calculation Pad (QCP), the estimated R( $$100$$ hours $$)$$  and the 90% two-sided confidence bounds are:

$$\begin{matrix} {{{\hat{R}}}_{U}}(100)=0.991 \\ \hat{R}(100)=0.9905 \\ {{{\hat{R}}}_{L}}(100)=0.9080 \\ \end{matrix}$$