An Explanation of Statistical Tools from DocumentingExcellence.com
A consulting practice focusing on working with colleges', organizations', and individuals' utilization of quantitative and qualitative assessment tools to analyze and document their quality outcomes through providing staff development, research design and analysis, and psychometric evaluations.

Measurement Processes

Measurement is the process by which the fundamental quantities and qualities of members of a system, together with their relationships are derived and documented. The clearest example of this process is measurement of physical properties such as counts - how many, mass, weights, distance.  In these examples the processes are well know. This is the most accurate example of measurement. Sometimes one uses the same numbers (1,2,3,etc.) to given order. From the statistical perspective these two processes require different methods for proper analysis.

Issues of how measurement processes focus on the nature of phenomenon are key criteria for understanding measurement processes and appropriately interpreting results.  Based on the nature of the focus of measurement, four classifications are presented.

bullet Observations and counts
bullet As the description identifies, this process of measurement is based on counting.  This implies focus on observable phenomena.  These may be objects, behaviors, events, etc.  In many ways this is a simple form of criterion-reference measurement described below. 
bullet Example:
bullet How many...?
bullet Constraints and caveats:
bullet Not all measurement focuses on observable things.
 
bullet Opinion and belief
bullet When ask, a respondent can provide use information about a wide variety of their actions and beliefs.  These responses may, or may not, be accurate and/or truthful, but in many situations a research accepts them as "truth."  This acceptance is very appropriate in many settings.  In addition to basic statements of belief we can also ask about strength of value and belief. 
bullet Example:  Frequently this type of measurement user a Likert scale in which a respondent indicates a strength of response.
bullet Voting is an important action for every citizen.
___Strongly Agree   ___Agree   ___Disagree   ___Strongly Disagree
bullet Constraints and caveats:
bullet The most obvious challenge is getting people to report "truthfully."  While there are many questions that individuals will answer truthfully, there are others that are much hard to collect.
 
bullet Ipsative measurements
bullet The term "ipsative" (sometimes seen as ipsitive [sic.] ) is based on "ipso facto" which means "by the very fact itself, the very fact."   In this usage, ipsative measurement means measurement against itself.  While different usages focus on different traits of this type of measurement, two critical characteristics emerge. 
bullet First, these measurements are self-reported evaluations.  This then gets into the issues of how self-aware an individual is (see Johari window), and how situational demands bias a self-report.
bullet Second, these measurements utilize a forced choice ranking of two or more indicators.  Even when some strength of responses is added to these rankings, the forced choices used to build scales indicate relative, not quantitative results.  This then effects the statistically tools and interpretation of these findings. (see Barton, 1996)  One of the key issues is that the sum of the scores build on such forced choice items tend to sum to the same total or a very narrow range of possible scores.  This then means that the scales tend to be significantly correlated with each other.  Such that any one scale can be determined from knowing the sum of the other scales.

This form of measurement frequently focuses on hidden, or latent traits that are not "known" to the individual such as those in personality instruments.  In this form of measurement the respondent chooses, ranks, or associates themselves with some indicators, and disassociates themselves from other indicators.  These associations are then aggregated into multiple scales.
 

bullet Example:  (Each of the five items is an indicator of a trait.  The items are not a single dimension of theoretical ideas, but a list, with each item indicating a different trait.  Notice how the respondent is forced to order and select or reject items.)

Indicate which of the following items are most to least like you
Use 1 = Most like you  down to 5 = Least like you. 

___ Young at heart,     ___ Introspective,     ___ Pessimistic,    
___  Persuasive,           ___ Obedient

bullet Constraints and caveats:
bullet Scale scores are relative to the individual, not to comparison among individuals.  One does not know how a score of x for responder A relates to that same score for a different responder.
bullet Scores are relative to each other, not measures of a quantity.  One knows that the individual reports more association with the indicators of one trait rather than another, but self-reporting twice as many indicators does not mean that the level of a trait is twice as high. 
bullet Scores can be highly influenced by situational expectations.  Individuals may see themselves one way in a work setting and a different way in a social setting.  Thus, depending on the instrument scores may vary depending on in what context an instrument is utilized.
bullet Norm-referenced measurements
bullet This measurement process focuses on relative performances of individuals compared with that of the others in a set of data.  It is the form of measurement most frequently used in school.  A test is given, students produce a range of results from top performers to lowest.  If the test was very hard a teacher might curve it, or not.  In any case any one student's performance can be compared with the collective performances of all the other students.  With norm-references measurement, when a standard is set for acceptable performance it is specified in terms specifying minimum acceptable levels given the history of other performances.  In some more sophisticated systems relative performance may be translated from raw scores into "standardized" scores in which the characteristics of the standardized scores are known.  Two examples of these standardized scores are Z-scores, or T-scores.
bullet Example:
bullet Scores from ACT, College Boards, may teachers' tests
bullet Constraints and caveats:
bullet Scores may not be be consistent across instruments, one test may be a "killer" and another "easy."
 
bullet Criterion-reference measurements
bullet In many ways this is just a count form of measurement.  In this process some criterion is specified, and then some person, or event is compared with that criterion, and an evaluation made that the criterion is meet or not.  The measurement is a yes/no result.
bullet Example: 
bullet Disability is the inability of an individual to complete three or more of the following actions.
bullet Dress themselves
bullet Bath themselves
bullet Feed themselves
bullet and the list would continue etc.
bullet Constraints and caveats:
bullet Just as with some of the other measurement processes criterion-reference assessments are not always, easily defined in a yes/no judgment.  In the case of many learning and behavior settings there may be many levels of competence.

Johari window: An exploration of what we know and what others know about ourselves.

Barton, Helen. 1996. "Strengths and Limitations of Ipsative Measurement," Journal of Occupational and Organizational Psychology, 69, 49-56.



Send mail to with questions or comments about this web site.
Copyright © 2012 by Peter T. Klassen, Ph.D. Principal, www.DocumentingExcellence.com
2 March, 2012