# Multivariable Relationships: General Log-Linear and Proportional Hazards

### From ReliaWiki

So far in this reference the life-stress relationships presented have been either single stress relationships or two stress relationships. In most practical applications, however, life is a function of more than one or two variables (stress types). In addition, there are many applications where the life of a product as a function of stress and of some engineering variable other than stress is sought. In this chapter, the general log-linear relationship and the proportional hazards model are presented for the analysis of such cases where more than two accelerated stresses (or variables) need to be considered.

# General Log-Linear Relationship

When a test involves multiple accelerating stresses or requires the inclusion of an engineering variable, a general multivariable relationship is needed. Such a relationship is the general log-linear relationship, which describes a life characteristic as a function of a vector of stresses, or ALTA includes this relationship and allows up to eight stresses. Mathematically the relationship is given by:

where:

- and are model parameters.

- is a vector of stresses.

This relationship can be further modified through the use of transformations and can be reduced to the relationships discussed previously, if so desired. As an example, consider a single stress application of this relationship and an inverse transformation on such that or:

It can be easily seen that the generalized log-linear relationship with a single stress and an inverse transformation has been reduced to the Arrhenius relationship, where:

or:

Similarly, when one chooses to apply a logarithmic transformation on such that , the relationship would reduce to the Inverse Power Law relationship. Furthermore, if more than one stress is present, one could choose to apply a different transformation to each stress to create combination relationships similar to the Temperature-Humidity and the Temperature-Non Thermal. ALTA has three built-in transformation options, namely:

None | X=V | Exponential LSR |

Reciprocal | Arrhenius LSR | |

Logarithmic | Power LSR |

The power of the relationship and this formulation becomes evident once one realizes that 6,561 unique life-stress relationships are possible (when allowing a maximum of eight stresses). When combined with the life distributions available in ALTA, almost 20,000 models can be created.

## Using the GLL Model

Like the previous relationships, the general log-linear relationship can be combined with any of the available life distributions by expressing a life characteristic from that distribution with the GLL relationship. A brief overview of the GLL-distribution models available in ALTA is presented next.

### GLL Exponential

The GLL-exponential model can be derived by setting in the exponential *pdf*, yielding the following GLL-exponential *pdf*:

The total number of unknowns to solve for in this model is (i.e.,

### GLL Weibull

The GLL-Weibull model can be derived by setting in Weibull *pdf*, yielding the following GLL-Weibull *pdf*:

The total number of unknowns to solve for in this model is (i.e.,

### GLL Lognormal

The GLL-lognormal model can be derived by setting
in the lognormal *pdf*, yielding the following GLL-lognormal *pdf*:

The total number of unknowns to solve for in this model is (i.e.,

### GLL Likelihood Function

The maximum likelihood estimation method can be used to determine the parameters for the GLL relationship and the selected life distribution. For each distribution, the likelihood function can be derived, and the parameters of model (the distribution parameters and the GLL parameters) can be obtained by maximizing the log-likelihood function. For example, the log-likelihood function for the Weibull distribution is given by:

where:

and:

- is the number of groups of exact times-to-failure data points.

- is the number of times-to-failure in the time-to-failure data group.

- is the failure rate parameter (unknown).

- is the exact failure time of the group.

- is the number of groups of suspension data points.

- is the number of suspensions in the group of suspension data points.

- is the running time of the suspension data group.

- is the number of interval data groups.

- is the number of intervals in the group of data intervals.

- is the beginning of the interval.

- is the ending of the interval.

## GLL Example

Consider the data summarized in the following tables. These data illustrate a typical three-stress type accelerated test.

The data in the second table are analyzed assuming a Weibull distribution, an Arrhenius life-stress relationship for temperature and an inverse power life-stress relationship for voltage. No transformation is performed on the operation type. The operation type variable is treated as an indicator variable that takes a discrete value of 0 for an on/off operation and 1 for a continuous operation. The following figure shows the stress types and their transformations in ALTA.

The GLL relationship then becomes:

The resulting relationship after performing these transformations is:

Therefore, the parameter of the Arrhenius relationship is equal to the log-linear coefficient , and the parameter of the inverse power relationship is equal to ( ). Therefore can also be written as:

The activation energy of the Arrhenius relationship can be calculated by multiplying B with Boltzmann's constant.

The best fit values for the parameters in this case are:

Once the parameters are estimated, further analysis on the data can be performed. First, using ALTA, a Weibull probability plot of the data can be obtained, as shown next.

Several types of information about the model as well as the data can be obtained from a probability plot. For example, the choice of an underlying distribution and the assumption of a common slope (shape parameter) can be examined. In this example, the linearity of the data supports the use of the Weibull distribution. In addition, the data appear parallel on this plot, therefore reinforcing the assumption of a common beta. Further statistical analysis can and should be performed for these purposes as well.

The Life vs. Stress plot is a very common plot for the analysis of accelerated data. Life vs. Stress plots can be very useful in assessing the effect of each stress on a product's failure. In this case, since the life is a function of three stresses, three different plots can be created. Such plots are created by holding two of the stresses constant at the desired use level, and varying the remaining one. The use stress levels for this example are 328K for temperature and 10V for voltage. For the operation type, a decision has to be made by the engineers as to whether they implement an on/off or continuous operation. The next two figures display the effects of temperature and voltage on the life of the product.

The effects of the two different operation types on life can be observed in the next figure. It can be seen that the on/off cycling has a greater effect on the life of the product in terms of accelerating failure than the continuous operation. In other words, a higher reliability can be achieved by running the product continuously.

# Proportional Hazards Model

Introduced by D. R. Cox, the Proportional Hazards (PH) model was developed in order to estimate the effects of different covariates influencing the times-to-failure of a system. The model has been widely used in the biomedical field, as discussed in Leemis [22], and recently there has been an increasing interest in its application in reliability engineering. In its original form, the model is non-parametric, (i.e., no assumptions are made about the nature or shape of the underlying failure distribution). In this reference, the original non-parametric formulation as well as a parametric form of the model will be considered utilizing a Weibull life distribution. In ALTA, the proportional hazards model is included in its parametric form and can be used to analyze data with up to eight variables. The GLL-Weibull and GLL-exponential models are actually special cases of the proportional hazards model. However, when using the proportional hazards in ALTA, no transformation on the covariates (or stresses) can be performed.

## Non-Parametric Model Formulation

According to the PH model, the failure rate of a system is affected not only by its operation time, but also by the covariates under which it operates. For example, a unit may have been tested under a combination of different accelerated stresses such as humidity, temperature, voltage, etc. It is clear then that such factors affect the failure rate of a unit.

The instantaneous failure rate (or hazard rate) of a unit is given by:

where:

- is the probability density function.

- is the reliability function.

Note that for the case of the failure rate of a unit being dependent not only on time but also on other covariates, the above equation must be modified in order to be a function of time and of the covariates. The proportional hazards model assumes that the failure rate (hazard rate) of a unit is the product of:

- an arbitrary and unspecified baseline failure rate, which is a function of time only.

- a positive function , independent of time, which incorporates the effects of a number of covariates such as humidity, temperature, pressure, voltage, etc.

The failure rate of a unit is then given by:

where:

- is a row vector consisting of the covariates:

- is a column vector consisting of the unknown parameters (also called regression parameters) of the model:

- where:

- = number of stress related variates (time-independent).

It can be assumed that the form of is known and is unspecified. Different forms of can be used.

However, the exponential form is mostly used due to its simplicity and is given by:

The failure rate can then be written as:

## Parametric Model Formulation

A parametric form of the proportional hazards model can be obtained by assuming an underlying distribution. In ALTA, the Weibull and exponential distributions are available. In this section we will consider the Weibull distribution to formulate the parametric proportional hazards model. In other words, it is assumed that the baseline failure rate is parametric and given by the Weibull distribution. In this case, the baseline failure rate is given by:

The PH failure rate then becomes:

It is often more convenient to define an additional covariate, , in order to allow the Weibull scale parameter raised to the beta (shape parameter) to be included in the vector of regression coefficients. The PH failure rate can then be written as:

The PH reliability function is given by:

The *pdf* can be obtained by taking the partial derivative of the reliability function with respect to time. The PH *pdf* is:

The total number of unknowns to solve for in this model is (i.e., ).

The maximum likelihood estimation method can be used to determine these parameters. The log-likelihood function for this case is given by:

where:

Solving for the parameters that maximize the log-likelihood function will yield the parameters for the PH-Weibull model. Note that for , the log-likelihood function becomes the log-likelihood function for the PH-exponential model, which is similar to the original form of the proportional hazards model proposed by Cox and Oakes [39].

Note that the likelihood function of the GLL model is very similar to the likelihood function for the proportional hazards-Weibull model. In particular, the shape parameter of the Weibull distribution can be included in the regression coefficients as follows:

where:

- are the parameters of the PH model.

- are the parameters of the general log-linear model.

In this case, the likelihood functions are identical. Therefore, if no transformation on the covariates is performed, the parameter values that maximize the likelihood function of the GLL model also maximize the likelihood function for the proportional hazards-Weibull (PHW) model. Note that for (exponential life distribution), the two likelihood functions are identical, and

# Indicator Variables

Another advantage of the multivariable relationships included in ALTA is that they allow for simultaneous analysis of continuous and categorical variables. Categorical variables are variables that take on discrete values such as the lot designation for products from different manufacturing lots. In this example, lot is a categorical variable, and it can be expressed in terms of indicator variables. Indicator variables only take a value of 1 or 0. For example, consider a sample of test units. A number of these units were obtained from Lot 1, others from Lot 2, and the rest from Lot 3. These three lots can be represented with the use of indicator variables, as follows:

- Define two indicator variables, and

- For the units from Lot 1, and

- For the units from Lot 2, and

- For the units from Lot 3, and

Assume that an accelerated test was performed with these units, and temperature was the accelerated stress. In this case, the GLL relationship can be used to analyze the data. From the GLL relationship we get:

where:

- and are the indicator variables, as defined above.

- where is the temperature.

The data can now be entered in ALTA and, with the assumption of an underlying life distribution and using MLE, the parameters of this model can be obtained.