Rater variability and reliability of constructed response questions in New York state high-stakes tests of English language arts and mathematics: implications for educational assessment policy
J. Huang and P. B. Whipple
Explore these studies to deepen your understanding
Adjacent work that informs or extends this paper's methodology and findings.
Design and validation of a scale for the assessment of educational competencies in traditional musical games
C. F. Amat, F. J. Zarza-alzugaray, et al.
Development of prediction models for screening depression and anxiety using smartphone and wearable-based digital phenotyping: protocol for the Smartphone and Wearable Assessment for Real-Time Screening of Depression and Anxiety (SWARTS-DA) observational study in Korea
Y. Shin, A. Y. Kim, et al.
A new scheme for low-carbon recycling of urban and rural organic waste based on carbon footprint assessment: A case study in China
K. Zhou, Y. Li, et al.
How Does ChatGPT Perform on the United States Medical Licensing Examination? The Implications of Large Language Models for Medical Education and Knowledge Assessment
A. Gilson, C. W. Safranek, et al.

