Calibrating Assessment Literacy Through Benchmarking Tasks

Calibrating Assessment
Literacy Through
Benchmarking
Simon Knight, Andy Leigh, Yvonne
Davila, Leigh Martin
@sjgknight

How do we give
feedback at scale?
• Assess infrequently
• Make assessments that are easy to give feedback on
(e.g. quizzes)
Teaching & Learning Forum 2018 3

How do we give
feedback at scale?
• Assess infrequently
• Make assessments that are easy to give feedback on
(quizzes, etc.)
• Use peer and self-assessment
• Ensure teams of tutors (and students) can give quality
feedback
• Provide opportunities for whole-cohort practice

How do we give
feedback at scale?
Benchmarking tasks require students to give feedback on
previously marked exemplars, typically of varying quality.
Benchmarking

How do we give
feedback at scale?
Benchmarking tasks require students to give feedback on
previously marked exemplars, typically of varying quality.
Why?
1. Students engage with criteria & their use
2. Students critically assess exemplars
3. Students & academics see how well students apply the
criteria (how they’re calibrated) – feedback opportunity
Benchmarking

How do we measure
impact?
Measuring impact of teaching innovation is hard
Typically lots of semester-semester changes (including
the students!)
Impact often measured via ‘happy sheets’ and few
enthusiastic learners

How do we measure
impact?
~500 students in first year Biocomplexity
Since 2012 they have done a benchmarking task via
SPARKPlus + self-assessment
Analysed 2012-15 data
Our context

Q1: How accurate are
students in their self-
assessments, and what
is the relationship of this
to their grade?
Histogram comparing the distribution of staff
marks to student self-assessments
Students over-estimate
their grades;

Q1: How accurate are
students in their self-
assessments, and what
is the relationship of this
to their grade?
Histogram comparing the distribution of staff
marks to student self-assessments
Students over-estimate
their grades; those who
overestimate do worse
Strong correlation; i.e. such that
a relationship between over-
estimating, and having a lower
mark (and vice-versa); r(2012)
= .68, p < .0001.

Q2: Do students who
complete the
benchmarking perform
better in their
assessment than those
who do not?
t(137.88) = 5.41, p < .0001. d = 0.62 (medium effect)
(ignoring students who dropped out). Students who didn’t do the task also
varied more in their criterion-level marks
Students who do not
complete the
benchmarking perform
worse
(SD = 9.28, N = 1979)
Did the task
M = 74.14
(SD = 12.16, N = 129)
Did not do the task
M = 68.24

Q3: Is accuracy on the
benchmarking predictive
of final mark?
No evidence of link
between benchmarking
accuracy and final mark

Comparing distances (i.e. the mark they gave themselves, subtracted
from their actual mark) a medium effect (d = .62) t(133.84) = 3.00, p = .0032.
Q4: Are students who
complete the
benchmarking
significantly more
accurate in their self-
assessment
Students who
benchmark, are better
self-assessors
(SD = 16.13, N = 1979)
Did the task
M = 1.15
(SD = 27.47, N = 129)
Did not do the task
M = 8.13

small significant relationship
between the benchmarking
distance scores and student self-
assessment distances, r(1887) =
.10, p < .0001.
Q5: Is accuracy on the
benchmarking related to
self-assessment
accuracy?
And, students who are
more accurate at
benchmarking are more
accurate self-assessors

2012 & 2013 430 students (~45% of the cohort) did a feedback survey.
The feedback from these cohorts was generally positive (>75% agree
or strongly agree on all qs)
The SPARK benchmarking process (week 4) helped me to engage early with the
report assessment criteria
The report assessment criteria helped me to understand what was expected in my
report
I followed the assessment criteria closely when writing my report
I understood how each assessment criterion contributed to a particular Graduate
Attribute
Self-assessing my report helped me to critically evaluated my own academic
performance in this task
I have a better understanding of why scientific writing skills are important for a
scientific career
Overall I was satisfied with the report-writing learning process
Q6: What are student perceptions
of feedback structures to support
their assignment completion?
Students think the task
is valuable

For example…
Benchmarking helped me to understand what
level of writing was expected for each grade.
The feedback and re-submission really helped
me to better my writing and to understand
how I could improve
That it forced me to be familiar with the
marking criteria BEFORE writing the
assignment. Usually I look at the criteria after
writing the assignment and seeing whether it
met the criteria, but with this method I made
sure to incorporate the points whilst writing.
Having previous reports to look at
and gain understanding how to
write and what the markers are
looking for.

18
Supporting benchmarking:
Feedback guide

How do we give
feedback at scale?
• Is quality of student’s written comments in
benchmarking related to their other learning outcomes?
• If students learn from giving feedback, how do we build
that capacity?
• How do we support students to understand the
feedback they receive, and to make sure they get
consistently good feedback from tutors?

How do we give
feedback at scale?
• Is quality of student’s written comments in
benchmarking related to their other learning outcomes?
(ongoing analysis)
• If students learn from giving feedback, how do we build
that capacity?
• How do we support students to understand the
feedback they receive, and to make sure they get
consistently good feedback from tutors?
• Created a tutor & student feedback guide

Your turn…

23Feedback on “References” criteria from Example B 1-5
“It has got 6 references which is good number of credible references.”
In this report, the person did demonstrate a well knowledge about referencing
“Harvard style referencing was well used. Next time include volumes/editions/page
number to specify what section of the book/journal information was obtained. The
quality of paraphrasing is really only of a credit standard - it’s evident you’ve used
some secondary sources, but the in-text referencing style needs a little work; to
improve, have a look at the rubric given, and familiarise yourself with the resources
provided on “in-text” referencing.”
The citation was well presented and done correctly however the reference list was
poorly set out. Out of the four resources on the reference list only one contained
authors. The others were scientific journals and books and therefore needed
authors on this list. The second resource lacks volume and page numbers whilst
the third lacks a sub heading reference that the second has. Lack of consistency is
found throughout this reference list and needs more work. Lastly only four
references is not enough to validate the argument. Very little citation of these
references are found in the discussion and therefore isn't linking the work of the
valid resources to the reasons within the experiment. Great improvement needed.
“Referencing was okay but infrequent and some references were ancient.”
Rank these pieces of feedback in order of “most useful” (1) to “least useful” (5)

Your turn…
https://tinyurl.com/BenchmarkingGuide

Thank
you
Simon Knight, Andy Leigh, Yvonne Davila,
Leigh Martin
@sjgknight
Thanks to Dan Krix and Alex Thompson for
their work on the benchmarking project, and
to other academics and students who have
contributed to the benchmarking
development.
Thanks to Shirley Alexander for VCLT
funding in support of this project
Draft paper available on request
https://tinyurl.com/BenchmarkingGuide

2012-15 data analysed Data from 2012-15 of this innovation was
analysed to investigate the relationship between accuracy of
student-assessments and learning outcomes, and to understand
the features of quality feedback in these tasks. Analysis indicates
that:
students who complete the benchmarking task perform better
that students who are more accurate self-assessors perform
better
That students who are more accurate in the benchmarking task
are also more accurate in the self-assessment task
The students are overwhelmingly positive about the task, and are
able to articulate its key intended learning outcomes

Calibrating Assessment Literacy Through Benchmarking Tasks

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Calibrating Assessment Literacy Through Benchmarking Tasks

Similar to Calibrating Assessment Literacy Through Benchmarking Tasks (20)

More from Simon Knight

More from Simon Knight (16)

Recently uploaded

Recently uploaded (20)

Calibrating Assessment Literacy Through Benchmarking Tasks

Editor's Notes