Calculator Inputs
Example Data Table
| Respondent | Half A | Half B | Total |
|---|---|---|---|
| 1 | 12 | 11 | 23 |
| 2 | 14 | 13 | 27 |
| 3 | 15 | 14 | 29 |
| 4 | 9 | 8 | 17 |
| 5 | 11 | 10 | 21 |
| 6 | 13 | 12 | 25 |
| 7 | 16 | 15 | 31 |
| 8 | 10 | 9 | 19 |
| 9 | 14 | 13 | 27 |
| 10 | 12 | 11 | 23 |
Formula Used
The corrected coefficient estimates whole-test reliability after correlating two shorter halves. Guttman provides a useful comparison when the halves differ slightly in spread.
How to Use This Calculator
- Split the assessment into two comparable halves, such as odd-even items.
- Enter one score per respondent for Half A and Half B.
- Keep the respondent order identical in both score lists.
- Press Calculate Reliability to show the result block above the form.
- Review the half correlation, corrected reliability, Guttman value, and SEM.
- Use CSV or PDF export when sharing results with colleagues or attaching evidence to reports.
Interpretation Notes
A higher Spearman-Brown value suggests the full assessment produces more consistent scores across equivalent halves. Values below 0.70 often signal weak internal consistency, unstable content balance, or scoring differences between the two sections.
If the corrected reliability is high but Guttman is lower, inspect whether one half is easier or more variable. Large SEM values indicate wider uncertainty around observed total scores.
Frequently Asked Questions
1. What does split-half reliability measure?
It measures internal consistency by comparing scores from two equivalent halves of the same test. Strong agreement suggests the full test is reliably measuring one construct.
2. Why is Spearman-Brown correction needed?
The two halves are shorter than the full assessment. Spearman-Brown adjusts the half-test correlation upward to estimate reliability for the complete instrument.
3. When should I use an odd-even split?
Use odd-even splits when items are ordered similarly in difficulty and content. It usually creates balanced halves without manually redesigning the assessment.
4. Can unequal half lengths be used?
This page assumes paired half scores from comparable sections. If halves differ greatly in length, reliability estimates can become harder to interpret correctly.
5. What is a good reliability value?
Many applied settings prefer 0.70 or higher. High-stakes testing often targets 0.80 to 0.90, depending on consequences and score precision needs.
6. Why might reliability become negative?
Negative reliability usually means the halves do not move together. Check reverse-coded items, scoring mistakes, mismatched respondent order, or poor split design.
7. What does SEM add beyond reliability?
SEM converts reliability into score uncertainty. It helps you estimate how much an observed total score may vary because of measurement error.