SLP Grading Metric

Go up to the main UVa page (md)

This course is somewhat of a challenge to grade, as there is very little graded work in the traditional sense. Nonetheless, here is how everything is graded for the fall 2017 semester. The details herein follow the grading guidelines described in the fall 2017 course introduction, with two slight modifications, described herein (search for 'difference' in this document if you want to find them quickly).

Promises

First, there were two promises that were made to the class:

Both of these were strictly adhered to.

Work-done and the grading factor

Two of the scores were project-based: the iteration evaluations and the final group project rating. The reason is that a student who does little work in an otherwise great group should not get as much credit for the quality of the system as somebody who does great work. Likewise, a great student in a weaker group should get more credit than students who do less work.

To determine this, all students are assigned a "work-done" rating, which is based on how much work we feel the student has done during the semester. This was originally based on a Likert scale (integer values from 1 to 5), but we realized that values between these were necessary as well. Thus, the scale is as follows.

rating as text rating as a number factor
well below average 1 0.35
a bit worse than below average 1.5 0.55
below average 2 0.75
slightly below average 2.5 0.875
ever so slightly below average 2.75 0.9375
average 3 1
ever so slightly above average 3.25 1.0375
slightly above average 3.5 1.075
above average 4 1.15
a bit more than above average 4.5 1.225
well above average 5 1.3

Yes, the names need a bit of work. And keep in mind that an 'average' rating is on track to get an 'A'.

The last column is the factor, which is used to scale the group scores. Thus, if a group receives a 80% on something, a student with an 'average' rating would receive the full 80%. A student with a 'below average' rating would receive 0.75*80% = 60% for that part. Likewise, a student with a 'well above average' would receive 1.3*80 = 104% for that part.

Note that a 'well below average' grade is an automatic failure for the course, regardless of any other grades. One can still fail with a score other than 'well below average'. It has always been the case, however, that students with a 'well below average' were clearly in the failure category anyway. Historically, it has been very rare for that to be issued a 'well below average' rating in the spring, as those students typically do not pass the fall semester.

Iteration evaluations

The progress done on the iteration evaluations were graded out of 4 points: 0 was no progress, 1 was poor, 2 was fair, 3 was good, 4 was excellent. Note that a 0 (none) has never (yet) been issued. Also note that the last iteration in a semester typically does not have an iteration evaluation rating, as that is encapsulated into the overall project evaluation.

The iteration evaluations were averaged to yield the evaluation score, and that average is shown in the grading metrics. An average of 1.0 (all poor ratings) would be 0% for this part, and an average of 3.0 (all average ratings) would be 93% (an A); this scale is linear. Thus, the conversion formula is:

percent = (average - 1) x 93.0/2.0 = (average - 1) x 46.5

Thus, all 'good' evaluations would yield (3.0-1) x 46.5 = 93%, which is an 'A'. An average evaluation score of 2.5 (half 'good' and half 'fair' evaluations) would result in (2.5-1) x 46.5 = 69.75%. As this is a group score, it is multiplied by the factor from above.

Leadership score

The leadership score is graded out of 10 points, and was based on the results of one's leadership iterations; these are shown in the evaluations tab of the course grading tool. One could be the secretary, lead, both, or neither in a given semester. Note that the 'neither' situation could only occur in the fall, and only for groups where there were more than six members.

Note that taking really poor notes would count as no notes taken for that occurrence.

Grade components

The spring grades were computed as follows:

One of the differences from the fall 2017 course introduction was that instead of counting leadership for 10%, we counted it for 5% and the journal entries for 5%. Note that only the first two items were group work. The last four items were were individual work, and thus are not multiplied by the factor.

The result of the above is called the base score.

Penalties

There were a few things that would deduct from one's grade. With one exception, these were all clearly stated throughout the semester, as well as the course syllabus (md) and the grades section of the fall course introduction.

The 'base score' (from above), minus the penalties, is the final course average.

Final course grade

This course used the standard decade scale. The final course average was rounded up (if appropriate), and the following table was used to compute the final course grade.

minimum score letter grade
-∞ F
60 D-
63 D
67 D+
70 C-
73 C
77 C+
80 B-
83 B
87 B+
90 A-
93 A
??? A+

Some notes on this:

Analysis

One may claim that this metric is too harsh. Perhaps, but it does take into account, in a formulaic way, all the factors that contribute to the grade. And students had chance to challenge every component that is used in the grade determination.

For the fall 2017 semester, these were the grade distributions:

Those percentages are higher than a typical 4th year course, so the distribution of grades is reasonable in comparison to other 4th year courses.

Also, it turned out this semester that the average work-done rating was exactly 3.0. We weren't aiming for that, it just turned out that way.

Done!

Whew! And you should see the Excel (actually LibreOffice) formulas used in the grade book worksheet...