ARTFEED — Contemporary Art Intelligence

New Method Measures Environmental Factors in LLM Behavior

ai-technology · 2026-04-25

A new arXiv preprint (2604.21098) introduces 'propensity inference,' a methodology for measuring language models' tendency toward unsanctioned behavior, addressing loss of control risks from misaligned AI. The authors contribute three improvements: analyzing environmental factor effects on behavior, quantifying effect sizes via Bayesian generalized linear models, and preventing circular analysis. They tested 12 environmental factors (6 strategic, 6 non-strategic) across 23 language models and 11 evaluation environments. Results show approximately equal contributions from strategic and non-strategic factors, with no evidence that strategic factors become more influential as capabilities improve, though some trend was observed.

Key facts

  • Preprint arXiv:2604.21098 introduces propensity inference for LLM behavior.
  • Method analyzes effects of environmental factors on unsanctioned behavior.
  • Uses Bayesian generalized linear models to quantify effect sizes.
  • Explicit measures against circular analysis are taken.
  • 12 environmental factors tested: 6 strategic, 6 non-strategic.
  • 23 language models and 11 evaluation environments used.
  • Strategic and non-strategic factors contribute equally to behavior.
  • No evidence strategic factors become more influential with improved capabilities.

Entities

Institutions

  • arXiv

Sources