Separating the Signals From the Noise

Essential knowledge for all who seek to understand their data

The second principle for understanding data is that some data contain signals; however, all data contain noise. Therefore, before you can detect the signals you will have to filter out the noise. This act of filtration is the essence of all data analysis techniques. It is the foundation for our use of data and all the predictions we make based on those data. In this column we will look at the mechanism used by all modern data analysis techniques to filter out the noise.

Given a collection of data it is common to begin with the computation of some summary statistics for location and dispersion. Averages and medians are used to characterize location, while either the range statistic or the standard deviation statistic is used to characterize dispersion. This much is taught in every introductory class. However, what is usually not taught is that the structures within our data will often create alternate ways of computing these measures of dispersion. Understanding the roles of these different methods of computation is essential for anyone who wishes to analyze data.

…

Want to continue?

By logging in you agree to receive communication from Quality Digest. Privacy Policy.

Create a FREE account

Forgot My Password

Comments

Absolutely necessary knowledge...but RARE!

This is an important bit of knowledge...what Deming would call "simple...stupidly simple, but RARE!" If you google "control limits" you'll find many, many sites with "experts" telling you to use method one.