ARTFEED — Contemporary Art Intelligence

GPTs Show Promise but Remain Unreliable for Spreadsheet Modeling

other · 2026-04-30

A recent study published on arXiv (2604.25689) examines GPT-based applications for creating analytical spreadsheet models. Researchers evaluated five GPT extensions and chose Excel AI from pulsrai.com for in-depth analysis. They conducted structured experiments using straightforward problem statements to measure Excel AI against the ERFR criteria (inputs in each cell; formulas; absence of hardcoded numbers; labels; precision). Findings indicate that while Excel AI can generate well-organized models, it often lacks consistency and reproducibility. The study highlights two main obstacles: "the problem of confidence" and "the problem of workflow," underscoring the necessity for proficient users to validate and modify GPT-generated spreadsheets. Despite the potential of GPTs to create draft models that could expedite development and reduce skill demands, the current tools are deemed unreliable for professional applications. The paper wraps up with suggestions for future research on prompting techniques.

Key facts

  • Paper investigates GPT-based tools for building reusable analytical spreadsheet models.
  • Five GPT extensions were screened; Excel AI by pulsrai.com was selected for detailed testing.
  • Experiments conducted on simple problem statements.
  • Excel AI assessed against ERFR criteria: each input in a cell; cell formulas; no hardwired numbers; labels; accurate.
  • Excel AI produces well-structured models but is inconsistent and often non-reproducible.
  • Two central challenges identified: 'the problem of confidence' and 'the problem of workflow'.
  • Skilled users are needed to verify and adapt GPT-generated spreadsheets.
  • Current GPT tools remain unreliable for professional use.
  • Recommendations for future research into prompting strategies are provided.

Entities

Institutions

  • arXiv
  • pulsrai.com

Sources