REPLY Definition of root cause (SD6789)

SDMAIL Ulrey, Michael L michael.l.ulrey at boeing.com
Thu Mar 6 06:19:43 CST 2008


Posted by  "Ulrey, Michael L" <michael.l.ulrey at boeing.com>

This thread is of particular interest to me because "root cause" is a term 
which is very popular in the fields of system engineering in general, and 
reliability and safety engineering in particular. (And among other things, 
I work in system safety engineering at Boeing.)

Within the the reliability and safety organizations of the aerospace industry, 
there is a long tradition of processes and methods for identifying root causes 
and either eliminating them or mitigating against their operational 
consequences if elimination is not possible. Of course, other industries also 
have similar practices, most notably the nuclear and chemical industries. This 
kind of approach works well for systems or processes which have a certain (low) 
level of complexity. However, for much more complex socio-technical systems 
(such as air traffic management), such an approach may come up short. Ever 
since the Uberlingen tragedy (see 
http://www.dcs.gla.ac.uk/~johnson/Eurocontrol/Ueberlingen/Ueberlingen_Final_Report.PDF 
or a variety of other discussions on the Internet), the safety community is 
starting to realize that, as one prominent accident expert said, "the root 
cause is simply the last contributing factor that was found before the 
resources ran out". 

In short, there is a movement away from traditional static, linear models such 
as the domino model or the Swiss cheese model. See the first link below for 
the revised thinking on the Swiss cheese model as a result of the Uberlingen 
accident. Instead, more complex systemic models are needed that reveal how 
accidents can arise naturally from the internal dynamics of the system, not
 necessarily from outside causes, or even from internal failures, as is 
 traditionally supposed. This idea is well-expressed in the second link below, 
 in which the "functional resonance accident model" (FRAM) is described. 


http://www.eurocontrol.int/eec/gallery/content/public/documents/conferences/2006_Barcelona/Hollnagel%20(FRAM_Barcelona_Arial).pdf

http://www.eurocontrol.int/eec/gallery/content/public/documents/EEC_notes/2006/EEC_note_2006_13.pdf 


Finally, I would offer up the recent book by Hollnagel, Woods, and Leveson, called 
"Resilience Engineering", which contains a series of essays on these topics. The 
view is that some systems that affect our lives and safety must be designed to be 
resilient, that is, able to recover from inevitable surprising upsets, rather than 
attempting to make them totally free of defects, which is practically impossible. 
I think this is an area that may be unknown to many SD theorists, and provides a 
rich area for research. In fact, one of the authors, Nancy Leveson of MIT, has used 
SD to model the NASA organizational structure and processes to try and understand 
the Challenger and Columbia shuttle accidents.

http://books.google.com/books?id=S8VbgW9pZGUC&dq=resilience+engineering+hollnagel&pg=PP1&ots=qgx6J-0G-Q&sig=hHFqRd8Q8OwZv_s4CisXxO6l17o&hl=en&prev=http://www.google.com/search?hl=en&q=resilience+engineering+hollnagel&btnG=Search&sa=X&oi=print&ct=title&cad=one-book-with-thumbnail

On a personal note, I have recently learned how to model in Vensim, and have already 
produced a simple model having to do with air traffic management, and how to increase 
capacity while preserving an appropriate level of safety. I am hoping to continue with 
this work, having proposed a research project within my organization at Boeing. I 
think SD can make a real contribution in this important arena.

Mike


Dr. Michael L. Ulrey
Associate Technical Fellow
Air Traffic Management  
The Boeing Co.
Posted by  "Ulrey, Michael L" <michael.l.ulrey at boeing.com>
posting date  Wed, 5 Mar 2008 10:01:35 -0800


More information about the SDMail mailing list