I wonder if, with a big enough sample (and perhaps added details and controls) you could attempt to use the distribution of the survey to uncover inconsistencies in grading. For instance, if there's an unseemly number of of 7B's purporting to be 7C, then adjusting for that would create a decent normal distribution.