Schematic overview of the study design. The authors asked large language models (LLMs) to rate approximately 700 English words that are typically acquired early in childhood according to 21 ...
The rise of generative artificial intelligence has prompted claims that large language models (LLMs) can substitute for human participants, particularly in moral judgment tasks where correlations ...
To assess the similarity of model word ratings to human word ratings across each dimension, we calculated the Spearman rank correlation between model-generated and human-generated ratings at both the ...