Featured Product
This Week in Quality Digest Live
Six Sigma Features
Richard Harpster
Good news? You are probably already doing it.
Donald J. Wheeler
Does your approach do what you need?
James J. Kline
Quality professional organizations need to adjust their body of knowledge to include an understanding of big data
Donald J. Wheeler
In spite of what everyone says to the contrary
Brittney McIver
Every CAPA should begin with investigation

More Features

Six Sigma News
Sept. 28–29, 2022, at the MassMutual Center in Springfield, MA
Elsmar Cove is a leading forum for quality and standards compliance
Is the future of quality management actually business management?
Too often process enhancements occur in silos where there is little positive impact on the big picture
Collect measurements, visual defect information, simple Go/No-Go situations from any online device
Good quality is adding an average of 11 percent to organizations’ revenue growth
Floor symbols and decals create a SMART floor environment, adding visual organization to any environment
A guide for practitioners and managers
Making lean Six Sigma easier and adaptable to current workplaces

More News

Donald J. Wheeler

Six Sigma

When Can We Trust the Limits On a Process Behavior Chart?

Don’t the outliers distort the limits?

Published: Tuesday, May 26, 2009 - 13:55

Last month we showed the X chart in figure 1. The four lowest values and the three highest values were seen to be “outliers” when we looked at the histogram. When we fitted a bell-shaped curve to the histogram, the outliers corrupted the model and resulted in a poor fit. Yet we used all the data to compute the limits seen in figure 1. How can the outliers corrupt one computation but not another?

The answer lies in how we compute limits for the X chart. The central line is commonly taken to be the average value. Now, while it’s true that the average may be influenced by extreme values, this effect is generally smaller than you might expect. In this case, deleting the seven outliers would only change the average from 595.4 to 595.6. The average value is a very robust measure of location. However, in cases where we think the average may have been unduly influenced by extreme values, we may always resort to using the median value instead. In this case the median is 596. Thus, one way or another, we’re going to have a reasonable estimate of location regardless of the outliers.

Figure 1. X Chart for NB10 Values

So what about dispersion? When working with process behavior charts we always use within-subgroup measures of dispersion. With an X chart, these within-subgroup measures are the two-point moving ranges (also known as the differences between successive values).

Once these values have been found, they are usually summarized by the average moving range. Because the operation of averaging provides a robust summary, the average moving range turns out to be a robust summary for dispersion. When the average moving range is divided by the bias correction factor of 1.128 for the pseudo-subgroups of size two, it becomes a measure of dispersion known as sigma(X).

Whenever we think that the average moving range may have been inflated by some very large moving ranges, we can always shift over to use the median moving range instead. When this summary is divided by its bias correction factor of 0.954, we again get a measure of dispersion known as sigma(X).

The limits on the X chart are computed according to average ±3 sigma(X). For the data in figure 1, the median moving range is 4, so sigma(X) is 4.2, and the limits are 12.6 units on either side of the average.

It’s instructive to compare the value of sigma(X) above with the standard deviation statistics computed in the usual way. For all 100 data, the global standard deviation statistic is 6.47 units. Using only the central 93 values, the global standard deviation statistic is 3.74 units. The sigma(X) value of 4.2 compares favorably with the value computed after the outliers had been removed. This robustness that’s built into the computations for the process behavior charts removes the need to polish the data prior to computing the limits. The computations work even in the presence of outliers and signals of exceptional variation. They allow you to get good limits from bad data.

Thus, the computations are robust. The key element in using an X chart is to make sure that successive values are logically comparable. This requires some knowledge of the context for your data. But since the only reason to collect data is to take action, you should know the context well enough to make this judgment call.


About The Author

Donald J. Wheeler’s picture

Donald J. Wheeler

Find out about Dr. Wheeler’s virtual seminars for 2022 at www.spcpress.com. Dr. Wheeler is a fellow of both the American Statistical Association and the American Society for Quality who has taught more than 1,000 seminars in 17 countries on six continents. He welcomes your questions; you can contact him at djwheeler@spcpress.com.