Chapter 4

 

Home
Index
Chapter 1
Chapter 2
Chapter 3
Chapter 4
Chapter 5
Chapter 6
References
Appendix 13
Appendix 14
Appendix 15
Appendix 16
Appendix 17
Contact

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Chapter 4 – Analysis of the relationship between the Focus 2002 survey data and significant- injury frequency rates

4.1 - Introduction  

The purpose of this chapter is to describe the methodology that was employed to address research questions 8 to 13 inclusive listed in Table 1.1.  

4.2 – Method 

Using SIMCA P+ (Version 10.0.4.0), eleven separate PLS models were built.  The default mean centering and scaling option was selected within SIMCA P+.  Details of the X block and Y block information entered into each model are summarised in Table 4.1.  Section 2.9.3.2 explained the iterative nature of PLS modelling.  All of the models were run at least thirty times.  The purpose of the re-runs was to improve the predictive ability of the models.  The purpose of building the models was only to answer the research questions.  The models were not fully optimised.  Further optimisation and refinement of the models would have been possible to better represent the data however, the time it would have taken to fully optimise the models would be disproportional to the benefits obtained.  When a PLS model was capable of being built, the resultant models were validated using response permutation validation option within SIMCA P+ with the number of random shuffles set to the maximum value of 999. 

PLS Model Name

Nation

PLS Model ‘Y Block Descriptor

PLS Model X block Descriptor

PLS Model X block - Focus 2002 Questions

PLS-UK1

UK

The 2002 significant-injury frequency rates for all UK sites.

The average site response to those UK Focus 2002 questions that correlated with UK AstraZeneca SIFR at the 95% confidence level.

7, 8a, 8b, 8c, 10f, 12, 16, 20a, 20c, 20d, 20e, 22, 23, 24, 25, 27b, 37, 38a, 38b, 40, 47, 53a, 53b, 53c, 55a.

PLS-UK2

UK

The 2002 significant-injury frequency rates for all UK sites.

UK Site average responses to all Focus 2002 questions.

All UK Focus 2002 question responses.

PLS-UK3

UK

The 2002 significant-injury frequency rates for all UK sites.

UK Site standard deviations for all Focus 2002 question responses.

All UK Focus 2002 question responses.

PLS-SE1

SE

The 2002 significant-injury frequency rates for all SE sites.

The average site response to those SE Focus 2002 questions that correlated with SE AstraZeneca SIFR at the 95% confidence level.

5, 14, 17, 21, 23, 24, 25, 26, 28, 33, 34, 35, 44, 48, 55a, 55b, 58,

PLS-SE2

SE

The 2002 significant-injury frequency rates for all SE sites.

SE Site average responses to all Focus 2002 questions.

All SE Focus 2002 question responses.

PLS-SE3

SE

The 2002 significant-injury frequency rates for all SE sites.

SE Site standard deviations for all Focus 2002 question responses.

All SE Focus 2002 question responses.

 

Table 4.1 – Details of the PLS model inputs

 

PLS Model Name

Nation

PLS Model ‘Y Block Descriptor

PLS Model X block Descriptor

PLS Model X block - Focus 2002 Questions

PLS-US1

US

The 2002 significant-injury frequency rates for all US sites.

The average site response to those US Focus 2002 questions that correlated with US AstraZeneca SIFR at the 95% confidence level.

3, 4, 11, 20a, 20e, 22, 24, 30b, 38a, 38b, 38c, 53a, 53b, 53c, 55a, 57.

PLS-US2

US

The 2002 significant-injury frequency rates for all US sites.

US Site average responses to all Focus 2002 questions.

All US Focus 2002 question responses.

PLS-US3

US

The 2002 significant-injury frequency rates for all US sites.

US Site standard deviations for all Focus 2002 question responses.

All US Focus 2002 question responses.

PLS-UK-SE-US-1

UK, SE, US

The 2002 significant-injury frequency rates for all UK, SE and US sites.

X block information from Models PLS-UK1, PLS-SE1, and PLS-US1.

7, 8a, 8b, 8c, 10f, 12, 16, 20a, 20c, 20d, 20e, 22, 23, 24, 25, 27b, 37, 38a, 38b, 40, 47, 53a, 53b, 53c, 55a.

 

5, 14, 17, 21, 23, 24, 25, 26, 28, 33, 34, 35, 44, 48, 55a, 55b, 58,

 

3, 4, 11, 20a, 20e, 22, 24, 30b, 38a, 38b, 38c, 53a, 53b, 53c, 55a, 57.

PLS-UK-SE-US-2

UK, SE, US

The 2002 significant-injury frequency rates for all UK, SE and US sites.

The average site responses to all of the Focus 2002 questions.

All Focus 2002 questions.

Table 4.1 (Continued) – Details of PLS model inputs  

 

4.3 – Results 

The results of the modelling are summarised in Table 4.2.  A description of the columns within Table 4.2 is as follows:  

Column 1:  The PLS model name.

Column 2:  The number of model principal components.

Column 3:  The R2X value for the 1st principal component.

Column 4:  The R2X cumulative value for all of the model principal components.

Column 5:  The Q2 value for the 1st principal component

Column 6:  The Q2 cumulative value for all of the model principal components.

Column 7:  The R2Y value for the 1st principal component.

Column 8:  The R2Y cumulative value for all of the model principal components.

Column 9:  The R2 value for the best-fit line through actual versus predicted SIFR plot.

Column 10:  The R2 ordinate intercept value of the response permutation validation plot.

Column 11:  The Q2 ordinate intercept value of the response permutation validation plot.

Column 12:  The Focus 2002 question responses retained in the final model. 

The detailed graphical outputs of the PLS models are reproduced in Appendix 13.

 

PLS Model Name

No Of Components

R2X

(1st component)

R2X(cum)

 

Q2

(1st component)

Q2 (cum)

R2Y (1st Component)

R2Y (cum)

R2 Of Actual v

Predicted

R2 Intercept

Q2

Intercept

Question Responses Retained Within The Model

PLS -UK1

2

0.730

0.798

0.733

0.869

0.818

0.960

0.992

0.645

0.042

7, 8a, 8b, 8c, 10f, 12, 16, 20a, 20c, 20d, 20e, 22, 23, 24, 25, 27b, 37, 38a, 38b, 40, 47, 53a, 53b, 53c, 55a.

PLS- UK2

2

0.762

0.874

0.892

0.918

0.912

0.958

0.993

0.478

-0.156

2, 8a, 8c, 20a, 22, 23, 24, 25, 30b, 38a, 38b, 40, 53a, 53c.

PLS -UK3

2

0.730

0.885

0.594

0.861

0.726

0.935

0.983

0.468

-0.186

8c, 9a, 9b, 10b, 10c, 10g, 20a, 22, 32, 38a, 41, 43, 50, 53a,

PLS-SE1

1

0.814

0.814

0.578

0.578

0.622

0.622

0.941

0.071

-0.237

5, 14, 17, 21, 23, 24, 25, 26, 28, 33, 34, 35, 44, 48, 55a, 55b, 58,

PLS-SE2

2

0.823

0.894

0.356

0.710

0.499

0.883

0.883

0.048

-0.072

5, 6, 7, 8a, 8b, 9b, 10a, 10c, 10d, 16, 17, 18, 19, 20e, 21, 23, 25, 26, 28, 32, 33, 35, 40, 56, 58, 60

PLS-SE3

2

0.430

0.757

0.189

0.738

0.626

0.960

0.966

0.563

-0.125

1, 3, 5, 6, 9a, 10b, 10c, 10f, 11, 20c, 22, 27c, 31, 33, 35, 37, 45,46, 48, 50, 53a, 53b, 59.

PLS–US1

1

0.911

0.911

0.964

0.964

0.975

0.975

0.975

-0.553

-0.422

3, 4, 11, 20a, 20e, 22, 24, 30b, 38a, 38b, 38c, 53a, 53b, 53c, 55a, 57,

PLS-US2

1

0.718

0.718

0.895

0.895

0.921

0.921

0.994

0.373

-0.323

1, 2, 3, 4, 5, 7, 10d, 10f, 11, 13, 14, 15, 16, 20a, 20c, 20d, 20e, 21, 22, 24, 25, 26, 30a, 30b, 31, 33, 34, 35, 37, 38a, 38b, 38c, 39, 40, 43, 44, 46, 47, 52, 53a, 53b, 53c, 55a, 55b, 56, 57, 58.

PLS-US3

1

0.867

0.867

0.979

0.979

0.982

0.982

0.982

-0.036

-0.440

2, 4, 15, 20a, 20c, 20e, 22, 24, 26, 30b, 31, 35, 37, 38a, 38b, 38c, 47, 53a, 53b, 53c, 57.

PLS-UK-SE-US-1

na

na

na

na

na

na

na

na

na

na

na

PLS-UK-SE-US-2

na

na

na

na

na

na

na

na

na

na

na

Table 4.2 – Summary of the PLS modelling results


Table 4.2 – Note 1

SIMCA P+ was unable to model the data in models PLS-UK-SE-US-1 and PLS-UK-SE-US-2 (as indicated by the inability to produce principal components).

 Table 4.2 – Note 2.

The AstraZeneca SIFR distributions were logarithmically transformed within SIMCA P+ by application of Equation 6.  The transformation was performed to reduce the skewness of the SIFR Y block data.  Reducing the skewness made the data more normally distributed, hence increasing the ability of SIMCA P+ to model the data.

                                      Log transformed SIFR =  log10 (SIFR+1)                                           (6)

 

4.4 – Conclusions

Section 4.4 records the conclusions of the work performed in this chapter.   

4.4.1 – Validity of the PLS models  

Section 2.9.3.2 stated that for a PLS model to be valid the Q2 ordinate intercept of the cross permutation validation plot should be no greater than 0.05.  All of the cross permutation validation plots for all models produced were found to have Q2 ordinate intercepts less than 0.05.  Apart from PLS-UK1, all Q2 intercepts were found to be negative.  All models produced were therefore proven to have some validity i.e. they predict SIFR performance significantly better than chance.  Examination of the response permutation validation plots shows that the R2 and Q2 values of some of the permuted models are greater than the original model.  This increase is expected.  The expected increase in R2 and Q2 results from the permuted models being only slightly perturbed from the original model. 

4.4.2 - Predictive ability- national level   

The R2 value of the best-fit line of actual versus predicted SIFR was greater than 0.88 for all models produced.  Based upon the R2 values and having regard to the number of degrees of freedom, the ability of the PLS models to predict SIFR performance may be described as good in all cases.  This answers research question 8 listed in Table 1.1.  Inspection of the actual versus predicted SIFR prediction plots in Appendix 13 indicates that all of the models produced are able to discriminate between those sites with good SIFR performance from those that have poorer SIFR performance.  The ability of the PLS models to discriminate between sites with similar SIFR performance may be described as moderate.  The ability of PLS-SE2 to model SIFR performance appears to be poorer than PLS-UK2 and PLS-US2.  There may be several possible explanations for this difference that include:   

·        There may be inconsistencies in the threshold of SIFR reporting across the SE sites.

·        Factors other than organisational culture dominate or significantly affect SIFR performance at the SE sites. 

·        The SIFR performance at the SE sites is driven by a population whose organisational culture is not represented by the mean Focus 2002 question responses.

·        The way in which the Focus 2002 questions were translated from English to Swedish introduced degrees of interpretive freedom that in turn gave rise to a greater response variance.

The experimental work done in this project is unable to identify which, if any, of the above factors gave rise to the poorer predictive ability of PLS-SE2 compared to PLS-UK2 and PLS-US2.   Appendix 11 details Pearson correlation coefficients between the Focus 2002 question responses and SIFR performance for the UK, SE and US sites.  The R2 values of the best-fit line through the actual versus predicted SIFR for the UK and SE models are greater than the correlation between any single Focus 2002 question response and SIFR performance.  This finding answers research question 10 listed in Table 1.1.

4.4.3 - Predictive ability – international level

It was shown that it was not possible to produce a combined UK, SE, US PLS model (indicated by SIMCA P+ being unable to produce principal components for models PLS-UK-SE-US1 and PLS-UK-SE-US2).  Research question 11 asks:  Can a single model of organisational culture be used to predict SHE performance in more than one nation?  The inability to produce a combined UK, SE, US PLS model based upon the Focus 2002 responses does not necessarily preclude the possibility of creating a multi-nation model.  Subject to the identification of a suitable set of questions, it may be possible to create a PLS model that is able to predict SIFR performance in more than one nation.  It is recommended that further research should be undertaken to identify a modified suite of questions that correlates with SIFR performance in more than one nation.   The inability to produce a single model for all of the three nations is unsurprising.  Appendix 11 details the (Pearson) correlation matrix of the arithmetic mean Focus 2002 question responses and SIFR data for each of the UK, SE and US sites.  The matrix indicated that only one Focus 2002 question response is correlated with SIFR performance above the threshold of significance in all three nations.  It also indicated that a particular Focus 2002 question response may be positively correlated with SIFR in one nation and negatively correlated in another.  A possible explanation for this observation may be that the accident causation models may be different in different nations.  A possible explanation of the inability to identify a set of questions that correlate within all three nations may be the dominance of national culture over organisational culture.  If national culture dominates over organisational culture, the likelihood of finding a set of questions that correlate with safety performance in several nations is decreased.  This decreased likelihood will be a result of the question responses being influenced by national cultural differences rather than how sites in any one nation perceived a particular issue.  The modelling work within this chapter is unable to identify the relative dominance of national versus organisational climate.

 

4.4.4 – The use of the Focus 2002 question response standard deviation 

The literature review showed that previous research attempts to correlate organisational culture with SHE outcomes had exclusively used the arithmetic mean responses.  Models PLS-UK3, PLS-SE3 and PLS-US3 all used the standard deviation of the Focus 2002 question responses.  All three of the models were shown to be able to model SIFR performance.  The predictive ability of these models is comparable with those models based upon the arithmetic mean responses.   The ability to model SIFR performance based upon the standard deviation of Focus 2002 responses suggests that the distribution of the responses is related to SIFR performance.  Information regarding the relationship of question response arithmetic means and standard deviations can be obtained by inspection of the following model pairs:

·        PLS-UK2 and PLS-UK3.

·        PLS-SE2 and PLS-SE3.

·        PLS-US2 and PLS-US3. 

All of the question responses retained in PLS-US3 are present within PLS-US2.  5 of the questions in PLS-UK3 are also found in PLS-UK2.  5 of the questions retained in PLS-S3 are found in PLS-SE2.  The above comparisons suggest that, for those question responses retained in both the arithmetic mean and standard deviation models, the mean as well as the distribution around the mean is an important metric that is an indicator of SIFR performance.    

4.4.5 - PLS model structure

In Section 3.2 it was noted that the Focus 2002 survey contained questions that addressed all five of Flin et al’s [60] organisational safety culture factors.   PCA models PCA-UK2, PCA-SE2 and PCA-US2 are represented by 4, 4 and 2 principal components respectively.   Table 4.2 shows that all of the resultant PLS models were found to have a maximum number of 2 principal components, rather than an anticipated maximum of 4.  Table 4.2 also indicates that all of the models starting with all of the Focus 2002 question responses are able to account for at least 71% of the SIFR variation (as indicated by Q2 cum).  The amount of SIFR variation accounted for by the 1st component in the 2- component models varies from 0.499 (PLS-SE2) to 0.912 (PLS-UK2).  The average Q2 variation accounted for by the first component in all of the PLS models is about:   

0.8 for the UK;
0.6 for SE;
0.9 for the US.

 One can therefore conclude that:

One organisational cultural construct dominates SIFR performance at the AstraZeneca UK and US sites;
Two organisational cultural constructs govern SIFR performance at AstraZeneca SE sites;
The Focus 2002 survey is able to measure the organisational cultural constructs that dominate SIFR performance at AstraZeneca UK, SE and US sites separately.  

The above conclusions answer research question 12 listed in Table 1.1.   The following explanations may account for the low Q2 value for the first component in model PLS-SE2 compared to the other PLS models.   

·        The poorer SE models may be due to there being biases or inconsistencies in the way in which the SE sites report SIFR incidents.  Another possible explanation is that there may be greater cultural diversity in SE compared to the UK and US sites.

·        More than one organisational cultural construct is related to SIFR performance in SE.

·        Biases may have been introduced into the Swedish version of the Focus 2002 questionnaire when it was translated from English.

The Focus 2002 questions responses, the number of responses and the relative loadings of those responses retained in the final PLS models are dissimilar for the UK, SE and US nations.  Several possible explanations for the dissimilarities are possible and include:   

Each of the nations may have different accident causation models, i.e. different factors are associated with the events leading to an accident.
The Focus 2002 questions may be interpreted differently in each of the nations.  Reasons for the interpretation differences may be due to cultural issues or the way in which the Focus 2002 question set was translated.

4.4.6 – Focus 2002 question responses and their association with desirable SHE performance

The majority of the questions retained in the final PLS models (detailed in Table 4.2) are not directly related to safety.  This observation answers research question 9 listed in Table 1.1.  During the literature review it was found that it is usual to group those questions loading on a particular cultural construct.  Once grouped, it is common for researchers to examine the questions with a view to labelling the construct that best describes the grouped questions.  The usefulness of this practice in industry is questionable.  Grouping several questions together under a single construct label potentially causes useful information to be lost.  Rather than grouping of the questions together under a single construct it is suggested that industry would benefit more from inspection of those questions remaining in the PLS models with a view to identification of specific targeted remedial action to address the undesired state.  An example of this practical approach follows.   Appendix 13 reproduces the graphical outputs of the PLS modelling.  The actual versus predicted SIFR plot for model PLS-UK2 (Appendix 13.7) shows that sites UK2 and UK6 have significantly higher SIFR than the other UK sites.  Simultaneous inspection of the corresponding score weightings scatter plots (Appendix 13.9 and 13.10) illustrates that sites UK2 and UK6 tend to give a higher than average response to the following questions: 

22

:

My performance targets are clear.

38a

:

I am sufficiently informed about the performance of:  My team.

40

:

My job performance is evaluated fairly.

53a

:

I have a clear understanding of the performance targets of: My team.

All the other UK sites tended to give lower than average response to these questions.  Section 2.8.2 explained that the above questions were responded to on a five point scale ranging from 1= Agree to 5= Disagree.  Higher than average responses therefore tend toward the ‘disagree’ end of the scale.  Lower than average responses tend toward the ‘agree’ end of the scale.  The observation of the relationship between the above question and desirable safety performance therefore falls in line with expectations i.e. sites that have personnel who are well informed and have clear targets achieve better SIFR performance than those sites whose personnel do not have clear performance targets and are less well informed.  Knowing that sites with undesirable SIFR performance are characterised by responses that are ‘tending to disagree’ with the above questions, managers of the sites can go about instigating remedial action to shift attitudes and satisfactions toward the more desirable end of the scale, and in doing so, perhaps improve site SIFR performance. 

4.4.7 – The use of organisational culture as a proactive safety indicator 

Research question 13 asks: Does the measurement of organisational climate metrics offer a robust tool for industry to predict SHE performance and perform relative risk ranking? The results detailed in this chapter have shown that PLS modelling of the Focus 2002 question response is able to model SIFR performance at the national level within AstraZeneca.  PLS modelling at the multi-national level may also become possible subject to the identification of a set of organisational culture question responses that correlate with safety outcomes in all nations being studied.   The practical application and usefulness of PLS modelling by industry is dependent upon the dynamics of organisational culture.  Chapter 2 established that some researchers have found organisational culture remains stable over a period of several years [35] , whereas others have found that it is more dynamic [31] .  Subject to confirmation that i) organisational culture is stable with time and ii) the correlation between organisational culture and its relationship with SIFR is stable with time, the usefulness of PLS modelling as a proactive safety performance metric looks promising.  This project is unable to determine the stability of organisational culture and its relationship with SHE outcomes with time.  It is recommended that further research be carried out to examine the stability of organisational culture with time.  As PLS is able to identify those questions that are related to desirable and undesirable SIFR outcomes, the use of organisational culture surveys will remain of significant value to industry even if further research into organisational culture discovers that its relationship to SHE outcomes changes over a period of 12 months or more.   

4.4.8 - Summary and conclusions 

By the application of PLS to the Focus 2002 survey responses and GSHE SIFR data, a statistically significant correlation between organisational culture and SIFR performance at AstraZeneca UK, SE and US sites has been established.  Section 2.6 established that previous research attempts have concentrated upon correlation of single dimensions of organisational culture with SHE outcomes.  The results summarised in this chapter represent the first time that several organisational culture factors have been simultaneously correlated with an organisational SHE outcome above the level of statistical significance.   In Section 3.3.2 it was noted that the UK, SE and US AstraZeneca sites support a range of AstraZeneca activities.  The sites therefore represent a range of risk profiles.  The ability to model SIFR performance without consideration of the site relative risk profile suggests that, for AstraZeneca UK, SE and US sites, SIFR performance is not significantly affected by the levels of inherent risk.  This observation puts into question Heinrich’s [73] classic model that specifies that accidents happen as a probabilistic function of the joint occurrence of unsafe acts and unsafe conditions.  It is unlikely that Heinrich’s model is incorrect.  More likely is that industrial risks are already well controlled by engineering and procedural means.  The observed relationship between organisational culture and SIFR performance is most likely to be related to the residual risk remaining after establishment of efficient engineering and procedural controls.   Examination of the structure of the resultant PLS models indicates that one organisational culture construct in the UK and US and two constructs in SE dominate SIFR performance.  Dissimilar accident causation models in each nation may explain the difference in the number of constructs.  Further work in this area is recommended to investigate why the number of constructs differs at the  UK, US and SE sites.   The inability of PLS to resolve sites of similar SIFR performance is of little concern.  It is an established fact amongst SHE professionals that, despite all measures taken to control them, accident frequency rates tend to vary year on year.  Because of this variation, it is proposed that there is little value in attempting to predict the precise SIFR of similar SIFR performing sites.  Of greater practical value to industry is the ability to identify those organisational attributes that discriminate sites with good safety performance from those with significantly poorer safety performance.    Based upon the Focus 2002 survey, it was shown that it was not possible to produce a single PLS model that was capable of predicting SIFR performance across the UK, SE and US sites.  It was suggested that the production of a single PLS UK, SE and US model may be possible after identification of a set of questions that correlate with SIFR performance in each of the nations.