New AI Framework Multi-Persona Thinking Reduces Social Bias in Language Models

ai-technology · 2026-04-22

A research paper introduces Multi-Persona Thinking (MPT), an inference-time framework designed to mitigate social biases in Large Language Models (LLMs). The approach guides models to reason from multiple contrasting social identities, such as male and female perspectives, alongside neutral viewpoints. These different perspectives interact through iterative reasoning processes to identify and correct biased judgments. The framework transforms persona assignment from a potential weakness into a bias mitigation mechanism. Researchers evaluated MPT on two widely used bias benchmarks using both open-source and closed-source models. Results demonstrate that MPT achieves lower bias levels than existing prompting-based methods while maintaining core reasoning capabilities. The paper addresses concerns about harmful stereotypes and unfair outcomes that can emerge from biased LLM outputs. The research was published on arXiv, a repository for scientific papers in fields including computer science and computational linguistics. The framework represents a technical advancement in addressing ethical challenges in artificial intelligence development.

Key facts

Multi-Persona Thinking (MPT) is a new inference-time framework for bias mitigation in LLMs
MPT guides models to consider multiple social identities like male and female perspectives
The framework uses iterative reasoning between different viewpoints to correct biased judgments
MPT transforms persona assignment into a bias reduction mechanism
Researchers evaluated MPT on two established bias benchmarks
Testing included both open-source and closed-source language models
MPT achieved lower bias than existing prompting-based methods
The framework maintains core reasoning abilities while reducing bias

New AI Framework Multi-Persona Thinking Reduces Social Bias in Language Models

Key facts

Entities

Institutions

Sources