The Anatomy of a ‘Failure’

Updated: January 5, 2022

Reading Time: 5 minutes

Articles

Where does ‘failure’ come from? Why do some things not go as we planned? When bad things happen, at that time, it can be chaotic and appear very complex. Often, in hindsight (when the urgency has faded), we find that good people made bad decisions at that time. Most of the time such failures were not complicated and we find that most likely, anyone else put in the same position, would have made the same decision. So why do things not always go as planned?

We are going to take a very macro view of this together. This is applying a system’s thinking process to the effective application of RCA.  I will let you fill in the blanks about situations you have encountered, where this model just seems to fit. If it doesn’t, please ‘Comment’ about your experience(s) and deviations that you observe.

image4

Figure 1: Germination of a Failure

The preview above (Figure 1) shows the general pathway from when a ‘failure’ is beginning to form and the steps it will progress through until we must face the music and deal with its consequences. Let’s break that path down in very simplistic terms using Figure 2.

image1

Figure 2: Cultural Norms & Organizational Systems

Most failures originate in the form of flawed systems. They can be inadequate, insufficient and oftentimes just non-existent (there were no rules to follow so we relied on our knowledge to ‘wing it’). I have listed just some examples of these cultural norms and systems, but in effect, these contribute to our reasoning processes for our decisions.

Such organizational systems are put into place to assist those that use them, make better decisions. When flaws exist in such systems, this feeds less than adequate information to the decision-maker (see Figure 3) . We refer to these ‘systems’ failures as Latent Root Causes. This is appropriate because they are latent or dormant; they are always there, but by themselves they cannot hurt anyone because they are essentially just paper (or electronic files:-).  One of the key factors determining the success of an ‘RCA’ is leadership’s ability to face the facts and acknowledge, their actions could have contributed to the bad outcome being analyzed.  This is because one of leadership’s role is to set up such organizational systems to aid in effective decision making.

image2

Figure 3: Impact of Organizational Systems on Decision Making

So less than adequate information is fed to a well-intentioned decision-maker, and at this point they have to make a decision. This decision will normally come in the form of taking an action (potential error of commission) or choosing NOT to take an action (potential error of omission). The decision error itself becomes the ‘active’ error, or what activates the latent errors. Remember, at this point, all of the reasoning going on is between the ears of the decision-maker. As outsiders, we can’t see anything.

This is an important point because the decision itself will trigger a visible consequence. It may be a turn of a switch, a check on a checklist, the changing of a setting, the implementation of a procedure or a host of other possibilities…but as a result of the decision, we see the consequences now as they are visible (see Figure 4).

image3

At this point we can now view the cause-and-effect chain of consequences which we will call the Physical Root Causes. If people in our workplaces do not recognize/identify this chain of consequences forming and take actions to stop the chain, then eventually a bad outcome will occur that will have to be addressed.

We often hear the term ‘situational awareness’ to describe this sensory awareness to our surroundings. Oftentimes in low morale environments, such awareness is often dulled as employees because human robots and do nothing more, or less, than what is expected. They often operate as if they have blinders on and see only their workspaces. Nonetheless, High Reliability Organizations (HRO) go out of their way to train their staff on how to recognize these error chains and to take actions to stop them from progressing. This is also emphasized from the managerial oversight perspective to prevent normalization of deviance (when our norms or practices tend to slowly deviate from our standards).

image5

Figure 5: If We Don’t Breaker the ‘Error Chains’, We End Up With Undesirable Outcomes

If we are astute enough to identify the chain and break it, we will likely not suffer the full consequences that could have happened. We call these ‘near misses’…I call them ‘we got lucky!’ Regardless, stopping the full consequences can likely save lives and prevent catastrophic damages.

If we are not able to stop the chain, then it will progress through what I call ‘the threshold of pain’. This means a regulatory trigger will be met and a full-blown investigation will be required. At this point the suits show up and we will have no choice but to analyze the failure due to purely reactionary forces (See Figure 5).

So to recap, flawed systems influence decision reasoning. As a result, decision errors are made that trigger visible consequences. If the chain of consequences is not stopped, then bad outcomes will likely occur and have to be dealt with. Because this blog is a macro view, its oversimplified in the sense that it portrays failure as linear.  Unfortunately that is not the case in the real world and because failure is NOT linear.

image6

Figure 6: Applying ‘Root’ Labels to The Germination of a Failure

When you reflect on this macro view, it is not industry specific, it applies to any undesirable outcome. No matter where humans work, this process is likely at play. Can you cite some examples from your experience?


About the Author
Robert (Bob) J. Latino is former CEO of Reliability Center, Inc. a company that helps teams and companies do RCAs with excellence.  Bob has been facilitating RCA and FMEA analyses with his clientele around the world for over 35 years and has taught over 10,000 students in the PROACT® methodology.

Bob is co-author of numerous articles and has led seminars and workshops on FMEA, Opportunity Analysis and RCA, as well as co-designer of the award winning PROACT® Investigation Management Software solution.  He has authored or co-authored six (6) books related to RCA and Reliability in both manufacturing and in healthcare and is a frequent speaker on the topic at domestic and international trade conferences.

Bob has applied the PROACT® methodology to a diverse set of problems and industries, including a published paper in the field of Counter Terrorism entitled, “The Application of PROACT® RCA to Terrorism/Counter Terrorism Related Events.”

Follow Bob on LinkedIn!

Root Cause Analysis Software

Our RCA software mobilizes your team to complete standardized RCA’s while giving you the enterprise-wide data you need to increase asset performance and keep your team safe.

Get Free Team Trial

Root Cause Analysis Training

Your team needs a common methodology and plan to execute effective RCA's. With both in-person and on-demand options, our expert trainers will align and equip your team to complete RCA's better and faster.
View RCA Courses

Reliability's root cause analysis training and RCA software can quickly help your team capture ROI, increase asset uptime, and ensure safety.
Contact us for more information: