## Floor Effect Ceiling Effect and Computing Internal Consistency Reliability at Post-test

Posted on: May 16, 2018, by :

A ceiling effectis the opposite, all of your subjects score near the top. There is very little variance because the ceiling of your test is too low. In layperson terms, your questions are too easy for the group you are testing. Here you dont have the problem of random guessing, but you do have low variance. Think back to Statistics 101 restriction of range attenuates correlations. Again, in layperson terms, if you correlate height and weight of NBA players, for example, you find almost no relationship between height and weight because they are ALL very tall and ALL very heavy. If you make the questions on your pretest easier, that may give you better internal consistency reliability at pre-test, but since a good percentage of your subjects knew the questions at the beginning, by the end of your training maybe nearly all of them will, and then you run into a ceiling effect.

Lets talk about floor and ceiling effects for a minute.

A floor effectis when most of your subjects score near the bottom. There is very little variance because the floor of your test is too high. In layperson terms, your questions are too hard for the group you are testing. This is even more of a problem with multiple choice tests. With other types, if the subject doesnt know, they arent likely to guess that the answer is, say (a+b)(a-b) and so they get it wrong. With a multiple-choice test with four choices, they will randomly get it correct 25% of the time. If there are a bunch of questions that are too hard, you have a bunch of people randomly getting each one right just by chance. Combine low variance with a lot of random error and your internal consistency reliability is going to be in the toilet.  So, lets say you have exactly that on your pre-test. Then, you test again after some time and your control group, having had no training in the meantime, is equally low, the problems are still too hard, you still have random guessing and low variance.

My suggestion is to compute internal consistency reliability at the beginning of your study for the whole group and at post-test for the control and intervention groups separately. You may find that, having successfully avoided both floor and ceiling effects for the post-test intervention group that you get good internal consistency reliability for them.

Can the same test suffer both floor- and ceiling- efects? Possible? Please explain to me.

[] a subject for a whole bunch of posts, that a test at or near their stated grade level is going to have a floor effect for the average student in a low-performing school. That is, most of the students are going to []

Whipping your data into shape with SAS : Part 1 for Today

Factor Analysis Tips: Unexpected Things I Learned at SAS Global Forum

Floor Effect, Ceiling Effect and Computing Internal Consistency Reliability at Post-test

Makes good sense! One more question:

4 Responses to Floor Effect, Ceiling Effect and Computing Internal Consistency Reliability at Post-test

What are the significant of Floor and Ceiling Effect

Simply Statistics, simply interesting

How SAS Helped Me Make Our Best-Selling Educational Game: Part 2

SAS Global Forum started out as planned

Very often, researchers (including me) use multiple-choice tests to collect data to determine whether or not an intervention has worked. Does the Dance Your Way to Math curriculum really result in higher test scores? Does Lollipop Spelling reduce the number of spelling errors? and on and on.

Andrew Gelmans statistics blog

Whipping your data into shape with SAS : Day 2 Fixing Errors Identifying Input Datasets

I remember being told that statistics to be generalized to the population, like internal consistency reliability or test-retest reliability should be computed either only using the pre-test scores (in the case of internal consistency) or only the control group in the case of both test-retest correlations and post-test internal consistency reliability. The reason, we are told, is that something has been done to the intervention group, which means that they are no longer representative of the population. While I agree with that reasoning in the case of test-retest correlation, I am not so convinced in the case of internal consistency.

The same test could not have both floor and ceiling effects for the same subjects. Most of the subjects could not score near the top and near the bottom. It could have floor effects for, say, 4th-graders and a ceiling effect for college students.