Ever wonder how a cutscore is set on a certification/licensure test? This is a very brief intro to the topic of standard setting, that is, how cutscores (passing points) are set on credentialing exams using scientifically-backed research and rigorous psychometrics. Some approaches include the modified-Angoff, Bookmark, and Contrasting Groups. Visit www.assess.com to learn more.
2. Why are cutscores necessary?
ī¨ As Glaser (1963) pointed out, the reason for
the existence of many tests is to make
decisions about people
ī¤ Mastery: Pass/Fail educational content
ī¤ Credentialing: Award/not professional credential
(certification, certificate, license)
ī¤ Pre-employment: Hire/not for job (or eligible as
candidate)
ī¤ University selection: Admission/not to university
or program
4. Why are cutscores necessary?
ī¨ What does that mean?
ī¨ That most of what we want to measure is in a
continuum (knowledge, intelligence) and not
naturally in âstatesâ (e.g., male/female)
ī¨ So we need to set a cutscore (or cutscores) on
the continuum to sort examinees into groups
that reflect interpretations and meanings that
are useful to us
ī¤ Pass is âqualifiedâ and Fail is âunqualifiedâ
5. How do we set a cutscore?
ī¨ As the Livingston excerpt notes, all cutscores
involve a level of subjectivity or arbitrariness
ī¨ The higher the stakes of the exam, the more
we need to reduce the arbitrariness
ī¨ Standard setting methods differ in their level of
objectivity
ī¨ A more objective method provides an anchor
to validity and defensibility
6. How do we set a cutscore?
Approach Example Arbitrarine
ss
Arbitrary round
number
70% of items MOST
Quota Whatever passes 85%
of people (z=-1.0)
MOST
Examinee-based Borderline,
Contrasting Groups
LEAST
Content-based Angoff, Bookmark LEAST
7. Examinee-based methods
ī¨ Borderline Method
ī¤ Experts familiar with content AND all examinees
identify those examinees they consider
âborderlineâ
ī¤ The mean or median score for those examinees
is the cutscore
ī¨ Contrasting Groups Method
ī¤ Experts familiar with content AND all examinees
sort examinees into Pass and Fail Groups (or
external criterion is used)
ī¤ The point where the two score distributions cross
is the cutscore
8. Examinee-based methods
ī¨ Are conceptually appealing but have two large
disadvantages:
ī¤ Require examinees to take the test first, so
pass/fail decisions cannot be made after they
finish the test
ī¤ Require a way to assign examinees into groups
WITHOUT test scores â either experts that are
familiar with all examinees or some sort of âgold
standardâ
īŽ Example: For a practice test, results on the real test
can be used as a gold standard to set cutscore
9. Content-based methods
ī¨ The Angoff and Bookmark methods require
experts to look at items rather than candidates
ī¨ Bookmark: pilot all items, analyze difficulty
statistics, order the items by difficulty in a
booklet, and ask experts to place a bookmark
ī¨ Angoff: All experts provide a rating 0 to 100 for
each item, average serves as cutscore
10. Content-based methods
ī¨ The Angoff method is the most commonly
used approach in certification testing and
therefore quite legally defensible
ī¨ Biggest advantage: does not require test to be
administered for data
ī¨ Can use data too, with Beuk Compromise, to
incorporate examinee-based aspects
ī¨ The drawback is that it requires a group of
subject matter experts to rate all items, which
can take time
11. Content-based methods
ī¨ The Bookmark method has the advantage that
a rating is not required for every item from
every expert (which takes a lot of time)
ī¨ The drawback is that it requires all items to be
delivered to a decent-sized sample in order to
obtain item difficulty statistics (might not be
feasible)