LLM Creativity Tests Under Scrutiny in New Study

ai-technology · 2026-05-14

A new study on arXiv (2605.13450) takes a close look at how well we can measure human creativity when compared to large language models (LLMs). It zeroes in on three main aspects: creative writing, divergent thinking, and scientific ideation. The results show that the Divergent Association Task (DAT) and Conditional DAT are the best measures for creative writing and divergent thinking. However, the study also points out issues with how useful these assessments are for judging machine creativity, noting that they have limited validity even for measuring human creativity. This is the first detailed evaluation of its kind.

Key facts

Study from arXiv (2605.13450) assesses LLM creativity tests
Evaluates creative writing, divergent thinking, and scientific ideation
DAT and Conditional DAT are best predictors for creative writing and divergent thinking
Questions validity of human creativity tests for machines
First large-scale systematic study on this topic

LLM Creativity Tests Under Scrutiny in New Study

Key facts

Entities

Institutions

Sources