Life Distributions: Difference between revisions

From ReliaWiki
Jump to navigation Jump to search
No edit summary
Line 163: Line 163:
::<math>\sum_{i=1}^{s}p_{i}=1</math>  
::<math>\sum_{i=1}^{s}p_{i}=1</math>  


The mixed Weibull distribution and its characteristics are presented in more detail in Chapter 10.
The mixed Weibull distribution and its characteristics are presented in more detail in Chapter 11.


=== The Generalized Gamma Distribution  ===
=== The Generalized Gamma Distribution  ===

Revision as of 20:36, 16 September 2011

New format available! This reference is now available in a new format that offers faster page load, improved display for calculations and images, more targeted search and the latest content available as a PDF. As of September 2023, this Reliawiki page will not continue to be updated. Please update all links and bookmarks to the latest reference at help.reliasoft.com/reference/life_data_analysis

Chapter 3: Life Distributions


Weibullbox.png

Chapter 3  
Life Distributions  

Synthesis-icon.png

Available Software:
Weibull++

Examples icon.png

More Resources:
Weibull++ Examples Collection


We use the term Life Distributions to describe the collection of statististical probability distribution that we use in Reliability Engineering and Life Data Analysis.

Distributions

A statistical distribution is fully described by its pdf (or probability density function). In the previous sections, we used the definition of the pdf to show how all other functions most commonly used in reliability engineering and life data analysis can be derived, namely the reliability function, failure rate function, mean time function and median life function, etc. All of these can be determined directly from the pdf definition, or f(t). Different distributions exist, such as the normal, exponential, etc., and each one of them has a predefined form of f(t). These distribution definitions can be found in many references. In fact, entire texts have been dedicated to defining families of statistical distributions. These distributions were formulated by statisticians, mathematicians and engineers to mathematically model or represent certain behavior. For example, the Weibull distribution was formulated by Walloddi Weibull and thus it bears his name. Some distributions tend to better represent life data and are most commonly called lifetime distributions. One of the simplest and most commonly used distributions (and often erroneously overused due to its simplicity), is the exponential distribution. The pdf of the exponential distribution is mathematically defined as:

[math]\displaystyle{ f(t)=\lambda e^{-\lambda t} }[/math]

In this definition, note that [math]\displaystyle{ t }[/math] is our random variable which represents time and the Greek letter [math]\displaystyle{ \lambda }[/math] (lambda) represents what is commonly referred to as the parameter of the distribution. Depending on the value of [math]\displaystyle{ \lambda , }[/math] [math]\displaystyle{ f(t) }[/math] will be scaled differently. For any distribution, the parameter or parameters of the distribution are estimated from the data. For example, the most well-known distribution, the normal (or Gaussian) distribution, is given by:

[math]\displaystyle{ f(t)=\frac{1}{\sigma \sqrt{2\pi }}{e}^{-\frac{1}{2}(\frac{t-\mu}{\sigma})^2} }[/math]


[math]\displaystyle{ \mu }[/math], the mean, and [math]\displaystyle{ \sigma , }[/math] the standard deviation, are its parameters. Both of these parameters are estimated from the data, i.e. the mean and standard deviation of the data. Once these parameters have been estimated, our function [math]\displaystyle{ f(t) }[/math] is fully defined and we can obtain any value for [math]\displaystyle{ f(t) }[/math] given any value of [math]\displaystyle{ t }[/math].
Given the mathematical representation of a distribution, we can also derive all of the functions needed for life data analysis, which again will depend only on the value of [math]\displaystyle{ t }[/math] after the value of the distribution parameter or parameters have been estimated from data. For example, we know that the exponential distribution [math]\displaystyle{ pdf }[/math] is given by:

[math]\displaystyle{ f(t)=\lambda e^{-\lambda t} }[/math]


Thus, the exponential reliability function can be derived to be:

[math]\displaystyle{ \begin{align} R(t)= & 1-\int_{0}^{t}\lambda {{e}^{-\lambda s}}ds \\ = & 1-[ 1-{{e}^{-\lambda \cdot t}}] \\ = & {{e}^{-\lambda \cdot t}} \\ \end{align} }[/math]

The exponential failure rate function is:

[math]\displaystyle{ \begin{align} \lambda (t) =& \frac{f(t)}{R(t)} \\ =& \frac{\lambda {e}^{-\lambda t}}{e^{-\lambda t}} \\ =& \lambda \end{align} }[/math]

The exponential Mean-Time-To-Failure (MTTF) is given by:

[math]\displaystyle{ \begin{align} \mu = & \int_{0}^{\infty} t\cdot f(t)dt \\ = & \int_{0}^{\infty}{t \cdot {\lambda} \cdot e^{-\lambda t}}dt \\ = & \frac{1}{\lambda } \end{align} }[/math]

This exact same methodology can be applied to any distribution given its [math]\displaystyle{ pdf }[/math], with various degrees of difficulty depending on the complexity of [math]\displaystyle{ f(t) }[/math].

Parameter Types

Distributions can have any number of parameters. Do note that as the number of parameters increases, so does the amount of data required for a proper fit. In general, most distributions used for reliability and life data analysis, the lifetime distributions, usually are limited to a maximum of three parameters. These three parameters are usually known as the scale parameter, the shape parameter and the location parameter.

Scale Parameter

The scale parameter is the most common type of parameter. All distributions in this reference have a scale parameter. In the case of one-parameter distributions, the sole parameter is the scale parameter. The scale parameter defines where the bulk of the distribution lies, or how stretched out the distribution is. In the case of the normal distribution, the scale parameter is the standard deviation.

Shape Parameter

The shape parameter, as the name implies, helps define the shape of a distribution. Some distributions, such as the exponential or normal, do not have a shape parameter since they have a predefined shape that does not change. In the case of the normal distribution, the shape is always the familiar bell shape. The effect of the shape parameter on a distribution is reflected in the shapes of the [math]\displaystyle{ pdf }[/math], the reliability function and the failure rate function.

Location Parameter

The location parameter is used to shift a distribution in one direction or another. The location parameter, usually denoted as [math]\displaystyle{ \gamma }[/math], defines the location of the origin of a distribution and can be either positive or negative. In terms of lifetime distributions, the location parameter represents a time shift.

Graphical view of the location parameter

This means that the inclusion of a location parameter for a distribution whose domain is normally [math]\displaystyle{ [0,\infty] }[/math] will change the domain to [math]\displaystyle{ [\gamma ,\infty] }[/math] , where [math]\displaystyle{ \gamma }[/math] can be either positive or negative. This can have some profound effects in terms of reliability. For a positive location parameter, this indicates that the reliability for that particular distribution is always 100% up to that point. In other words, a failure cannot occur before this time [math]\displaystyle{ \gamma }[/math]. Many engineers feel uncomfortable in saying that a failure will absolutely not happen before any given time. On the other hand, the argument can be made that almost all life distributions have a location parameter, although many of them may be negligibly small. Similarly, many people are uncomfortable with the concept of a negative location parameter, which states that failures theoretically occur before time zero. Realistically, the calculation of a negative location parameter is indicative of quiescent failures (failures that occur before a product is used for the first time) or of problems with the manufacturing, packaging or shipping processes. More attention will be given to the concept of the location parameter in subsequent discussions of the exponential and Weibull distributions, which are the lifetime distributions that most frequently employ the location parameter.

Most Commonly Used Distributions

There are many different lifetime distributions that can be used to model reliability data. Leemis [22] and others present a good overview of many of these distributions. In this reference, we will concentrate on the most commonly used and most widely applicable distributions for life data analysis, as outlined in the following sections.

The Weibull Distribution

The Weibull distribution is a general purpose reliability distribution used to model material strength, times-to-failure of electronic and mechanical components, equipment or systems. In its most general case, the three-parameter Weibull [math]\displaystyle{ pdf }[/math] is defined by:

[math]\displaystyle{ f(t)=\frac{\beta}{\eta }( \frac{t-\gamma }{\eta } )^{\beta -1}{e}^{-(\tfrac{t-\gamma }{\eta }) ^{\beta}} }[/math]


with three parameters [math]\displaystyle{ \beta }[/math] , [math]\displaystyle{ \eta }[/math] and [math]\displaystyle{ \gamma , }[/math] where [math]\displaystyle{ \beta = }[/math] shape parameter, [math]\displaystyle{ \eta = }[/math] scale parameter and location parameter.
If the location parameter, [math]\displaystyle{ \gamma }[/math] , is assumed to be zero, the distribution then becomes the two-parameter Weibull or:

[math]\displaystyle{ f(t)=\frac{\beta}{\eta }( \frac{t }{\eta } )^{\beta -1}{e}^{-(\tfrac{t }{\eta }) ^{\beta}} }[/math]

One additional form is the one-parameter Weibull distribution, which assumes that the location parameter, [math]\displaystyle{ \gamma , }[/math] is zero, and the shape parameter is a known constant, or [math]\displaystyle{ \beta = }[/math] constant [math]\displaystyle{ =C }[/math], so:

[math]\displaystyle{ f(t)=\frac{C}{\eta}(\frac{t}{\eta})^{C-1}e^{-(\frac{t}{\eta})^C} }[/math]

Chapter 6 of this reference fully details the Weibull distribution and presents many examples of its use in Weibull++.


The Weibull-Bayesian Distribution

Another approach is the Weibull-Bayesian model which assumes that the analyst has some prior knowledge about the distribution of the shape parameter ( [math]\displaystyle{ \beta ) }[/math] of the Weibull distribution. There are many practical applications for this model, particularly when dealing with small sample sizes and/or some prior knowledge for the shape parameter is available. For example, when a test is performed, there is often a good understanding about the behavior of the failure mode under investigation, primarily through historical data or physics-of-failure.
Note that this is not the same as the so called WeiBayes model. The so called WeiBayes model is really a one-parameter Weibull distribution. It assumes a fixed value (constant) for the shape parameter and solves for the scale parameter. The Weibull-Bayesian model in Weibull++ 7 is actually a true WeiBayes model and offers an alternative to the one-parameter Weibull by including the variation and uncertainty that is present in the prior estimation of the shape parameter.
The Weibull-Bayesian distribution and its characteristics are presented in more detail in Chapter 6.

The Exponential Distribution

The exponential distribution is commonly used for components or systems exhibiting a constant failure rate and is defined in its most general case by:

[math]\displaystyle{ f(t)=\lambda {e}^{-\lambda(t-\gamma )} }[/math]

(also known as the two-parameter exponential in this form) with two parameters, namely [math]\displaystyle{ \lambda }[/math] and [math]\displaystyle{ \gamma . }[/math] If the location parameter, [math]\displaystyle{ \gamma }[/math], is assumed to be zero, the distribution then becomes the one-parameter exponential or,

[math]\displaystyle{ f(t)=\lambda {{e}^{-\lambda t}} }[/math]

The exponential distribution and its characteristics are presented in more detail in Chapter 7.


The Normal Distribution


The normal distribution is commonly used for general reliability analysis, times-to-failure of simple electronic and mechanical components, equipment or systems. The [math]\displaystyle{ pdf }[/math] of the normal distribution is given by:

[math]\displaystyle{ \begin{align} f(t)= \frac{1}{\sigma \sqrt{2\pi }}{e^{-\tfrac{1}{2}(\tfrac{t-\mu }{\sigma })^2}} \end{align} }[/math]


where,


[math]\displaystyle{ \begin{align} \mu = & \text{mean of the normal times to failure} \\ \sigma = & \text{standard deviation of the times to failure} \end{align} }[/math]

The normal distribution and its characteristics are presented in more detail in Chapter 8.

The Lognormal Distribution

The lognormal distribution is commonly used for general reliability analysis, cycles-to-failure in fatigue, material strengths and loading variables in probabilistic design. When the natural logarithms of the times-to-failure are normally distributed, then we say that the data follow the lognormal distribution.
The [math]\displaystyle{ pdf }[/math] of the lognormal distribution is given by:

[math]\displaystyle{ \begin{align} & f(t)=\frac{1}{t{\sigma}_{T'}\sqrt{2\pi}}e^{-\tfrac{1}{2}(\tfrac{T'-{\mu'}}{\sigma_{T'}})^2}\\ & f(t)\ge 0,t\gt 0,{{\sigma }_{T'}}\gt 0 \\ & {T'}= \ln (t) \end{align} }[/math]


Where,
[math]\displaystyle{ \begin{align} & {\mu'}= \text{mean of the natural logarithms of the times-to-failure} \\ & {\sigma_{T'}}= \text{standard deviation of the natural logarithms of the times to failure} \end{align} }[/math]

The lognormal distribution and its characteristics are presented in more detail in Chapter 9.


Other Distributions

In addition to the distributions mentioned, the following additional distributions, even though not as frequently used in Life Data Analysis, have a variety of applications and can be found in many statistical references. They are included in Weibull++ as well as discussed in this reference.


The Mixed Weibull Distribution

The mixed Weibull distribution is commonly used for modeling the behavior of components or systems exhibiting multiple failure modes (mixed populations). It gives the global picture of the life of a product by mixing different Weibull distributions for different stages of the product’s life and is defined by:

[math]\displaystyle{ f_{S}(t)=\sum_{i=1}^{S}p_{i}\frac{\beta_{i}}{\eta_{i}}(\frac{t}{\eta_{i}})^{\beta_{i}-1}e^{-(\frac{t}{\eta_{i}})^{\beta_{i}}} }[/math]

where the value of [math]\displaystyle{ S }[/math] is equal to the number of subpopulations. Note that this results in a total of [math]\displaystyle{ (3\cdot S-1) }[/math] parameters. In other words, each population has a portion or mixing weight for the [math]\displaystyle{ {{i}^{th}} }[/math] population, a [math]\displaystyle{ \beta_{i} }[/math] , or shape parameter for the [math]\displaystyle{ {{i}^{th}} }[/math] population and or scale parameter [math]\displaystyle{ \eta_{i} }[/math] for [math]\displaystyle{ {{i}^{th}} }[/math] population. Note that the parameters are reduced to [math]\displaystyle{ (3\cdot S-1) }[/math], given the fact that the following condition can also be used:

[math]\displaystyle{ \sum_{i=1}^{s}p_{i}=1 }[/math]

The mixed Weibull distribution and its characteristics are presented in more detail in Chapter 11.

The Generalized Gamma Distribution

While not as frequently used for modeling life data as the distributions discussed previously, the generalized gamma distribution does have the ability to mimic the attributes of other distributions, such as the Weibull or lognormal, based on the values of the distribution’s parameters and also offers a compromise between two lifetime distributions. The generalized gamma function is a three-parameter distribution with parameters μ , [math]\displaystyle{ \sigma }[/math] and λ . The pdf of the distribution is given by,

[math]\displaystyle{ f(x)=\begin{cases} \frac{|\lambda|}{\sigma \cdot t}\cdot \tfrac{1}{\Gamma( \tfrac{1}{\lambda}^2)}\cdot {e^{\tfrac{\lambda \cdot{\tfrac{\ln(t)-\mu}{\sigma}}+\ln( \tfrac{1}{{\lambda}^2})-e^{\lambda \cdot {\tfrac{\ln(t)-\mu}{\sigma}}}}{{\lambda}^2}}}, &\text{if} \lambda \ne 0 \\ \frac{1}{t\cdot \sigma \sqrt{2\pi }} e^{-\tfrac{1}{2}{(\tfrac{\ln(t)-\mu}{\sigma })^2}}, & \text{if} \lambda =0 \end{cases} }[/math]


where Γ(x) is the gamma function, defined by:

[math]\displaystyle{ \Gamma (x)=\int_{0}^{\infty}{s}^{x-1}{e^{-s}}ds }[/math]


This distribution behaves as do other distributions based on the values of the parameters. For example, if λ = 1, the distribution is identical to the Weibull distribution. If both λ = 1 and σ = 1, the distribution is identical to the exponential distribution and for λ = 0, it is identical to the lognormal distribution. While the generalized gamma distribution is not often used to model life data by itself, its ability to behave like other more commonly-used life distributions is sometimes used to determine which of those life distributions should be used to model a particular set of data.


The Gamma Distribution

The gamma distribution is a flexible distribution that may offer a good fit to some sets of life data. Sometimes called the Erlang distribution, gamma distribution has applications in Bayesian analysis as a prior distribution and is also commonly used in queuing theory. The [math]\displaystyle{ pdf }[/math] of the gamma distribution is given by:

[math]\displaystyle{ \begin{align} f(t)= & \frac{e^{kz-{e^{z}}}}{t\Gamma(k)} \\ z= & \ln{t}-\mu \end{align} }[/math]


where:


[math]\displaystyle{ \begin{align} \mu = & \text{scale parameter} \\ k= & \text{shape parameter} \end{align} }[/math]


where 0 [math]\displaystyle{ \lt t\lt \infty }[/math] , [math]\displaystyle{ -\infty \lt \mu \lt \infty }[/math] and [math]\displaystyle{ k\gt 0 }[/math].

The gamma distribution and its characteristics are presented in more detail in Chapter 10.

The Logistic Distribution

The logistic distribution has a shape very similar to the normal distribution (i.e. bell shaped), but with heavier tails. Since the logistic distribution has closed form solutions for the reliability, [math]\displaystyle{ cdf }[/math] and failure rate functions, it is sometimes preferred over the normal distribution, where these functions can only be obtained numerically. The [math]\displaystyle{ pdf }[/math] of the logistic distribution is given by:

[math]\displaystyle{ \begin{align} f(t)= & \frac{e^z}{\sigma {(1+{e^z})^{2}}} \\ z= & \frac{t-\mu }{\sigma } \\ \sigma \gt & 0 \end{align} }[/math]


where:


[math]\displaystyle{ \mu = \text{location parameter,also denoted as } }[/math] [math]\displaystyle{ \overline{T} }[/math]


[math]\displaystyle{ \sigma=\text{scale parameter} }[/math]


The logistic distribution and its characteristics are presented in more detail in Chapter 10.


The Loglogistic Distribution

As may be summarized from the name, the loglogistic distribution is similar to the logistic distribution. Specifically, the data follows a loglogistic distribution when the natural logarithms of the times-to-failure follow a logistic distribution. Accordingly, the loglogistic and lognormal distributions also share many similarities.
The [math]\displaystyle{ pdf }[/math] of the loglogistic distribution is given by:

[math]\displaystyle{ \begin{align} f(t)= & \frac{e^z}{\sigma t{(1+{e^z})^2}} \\ z= & \frac{T'-{\mu }'}{\sigma } \\ f(t)\ge & 0,t\gt 0,{{\sigma }_{T'}}\gt 0, \\ {T}'= & ln(t) \end{align} }[/math]


where,


[math]\displaystyle{ \begin{align} \mu'= & \text{scale parameter} \\ \sigma_{T}=& \text{shape parameter} \end{align} }[/math]


The loglogistic distribution and its characteristics are presented in more detail in Chapter 10.

The Gumbel Distribution

The Gumbel distribution is also referred to as the Smallest Extreme Value (SEV) distribution or the Smallest Extreme Value (Type 1) distribution. The Gumbel distribution is appropriate for modeling strength, which is sometimes skewed to the left (few weak units fail under low stress, while the rest fail at higher stresses). The Gumbel distribution could also be appropriate for modeling the life of products that experience very quick wear out after reaching a certain age.

The [math]\displaystyle{ pdf }[/math] of the Gumbel distribution is given by:

[math]\displaystyle{ \begin{align} f(t)= & \frac{1}{\sigma }{{e}^{z-{e^z}}} \\ z= &\frac{t-\mu }{\sigma } \\ f(T)\ge & 0,\sigma \gt 0 \end{align} }[/math]


where,
[math]\displaystyle{ \begin{align} \mu = & \text{location parameter} \\ \sigma = & \text{scale parameter} \end{align} }[/math]

The Gumbel distribution and its characteristics are presented in more detail in Chapter 10.