Content quality | Factuality/ accuracy | Is the response factually accurate? | Major error(s) | The response contains significant factual inaccuracies or false information. |
| | | Minor error(s) | The response contains minor factual inaccuracies that do not significantly affect the overall message. The response does not correct factual errors found in the prompt. |
| | | Perfect | No factual inaccuracies in the response. If applicable, the response corrects any factual errors found in the prompt. |
| Relevance | Is the prompt relevant to the topic? Is the response relevant to the prompt? | Major error(s) | The response is completely irrelevant to the prompt or the prompt is entirely off-topic. |
| | | Minor error(s) | The response partially addresses the prompt, or the prompt is only somewhat relevant to the topic. |
| | | Perfect | The response is fully relevant to the prompt and addresses it appropriately, and the prompt is completely relevant to the topic. |
| Completeness | Does the response comprehensively address all parts of the prompt? | Major error(s) | Response does not address several major parts of the prompt, either by omission or incompleteness. |
| | | Minor error(s) | Response addresses all major parts of the prompt, but may miss small details through incompleteness. |
| | | Perfect | The response is fully relevant to the prompt and addresses it appropriately, and the prompt is completely relevant to the topic. |
Linguistic quality | Spelling, grammar, and punctuation | Are the prompt and response in the correct language? Do both the prompt and response follow spelling, grammar, and punctuation rules of that language? | Major error(s) | Incorrect language in the prompt or response. Major spelling, grammatical, or punctuation errors in either the prompt or response. |
| | | Minor error(s) | Minor errors in spelling, grammar, or punctuation; for example, using British English vs American English (colour vs color) or excluding Oxford comma. Error(s) do not significantly hinder understanding. |
| | | Perfect | Both prompt and response in the correct language, with no spelling, grammatical, or punctuation errors. |
| Clarity | Is the response easy to understand? | Major error(s) | The response is unclear and difficult to understand due to poor language use. |
| | | Minor error(s) | The response is mostly clear but may contain some awkward phrasing or minor clarity issues. |
| | | Perfect | The response is very clear and easy to understand for a general audience. |
| Conciseness | Are the prompt and response both presented concisely without unnecessary details? | Major error(s) | The response contains many unnecessary details or is overly verbose. |
| | | Minor error(s) | The response is somewhat concise but could be more succinct. |
| | | Perfect | The response is concise and to the point, without unnecessary details. |
Adherence to guidelines | Formatting | Do the prompt and response both follow the formatting rules specified in the guidelines? | Major error(s) | The prompt or response does not follow the specified formatting rules at all. |
| | | Minor error(s) | The prompt or response mostly follows the formatting rules but contains some minor deviations. |
| | | Perfect | The prompt and response strictly follow all specified formatting rules. |
| Style/tone | Is the language of both the prompt and response consistent with the writing style and tone defined in the guidelines/style guide? | Major error(s) | The language of the prompt or response is completely inconsistent with the defined writing style and tone. |
| | | Minor error(s) | The language of the prompt or response is mostly consistent with the defined writing style and tone but contains some minor deviations. |
| | | Perfect | The language of both the prompt and response is fully consistent with the defined writing style and tone. |
| Content boundaries | Are the prompt and response free from biased or inappropriate content? | Major error(s) | The prompt or response contains biased, inappropriate, or offensive content. |
| | | Minor error(s) | The prompt and response do not contain clearly biased, inappropriate, or offensive content, but could be understood to be biased, inappropriate, or offensive. |
| | | Perfect | The prompt and response are entirely free from biased, inappropriate, or offensive content. |