# Evaluating large language models for criterion-based grading from agreement to consistency

> Research article (npj Science of Learning, 2024) · cited 21× · AI/ML

**Wikidata**: [openalex:W4405916612](https://www.wikidata.org/wiki/openalex:W4405916612)  
**Source**: https://4ort.xyz/entity/evaluating-large-language-models-for-criterion-based-grading-from-agreement-to-consistency
