ChatSR: MLLM for Scientific Formula Discovery

other · 2026-05-14

Researchers propose ChatSR, a multimodal large language model designed for scientific data understanding. Unlike existing MLLMs focused on perceptual modalities like images and video, ChatSR treats scientific data as a new modality, using specialized encoders and alignment mechanisms to map it into a representation space for LLMs. This enables the model to grasp structural characteristics and regularities in scientific data, leveraging domain knowledge and reasoning abilities to emulate a human scientist. The model operates based on user-specified prior constraints and preferences.

Key facts

ChatSR is a multimodal large language model for scientific data understanding.
It treats scientific data as a new modality analogous to visual content.
It uses carefully designed encoders and modality alignment mechanisms.
It maps scientific data into a representation space for LLMs.
The model grasps structural characteristics and underlying regularities.
It exploits domain knowledge and reasoning abilities of LLMs.
It emulates a knowledgeable human scientist.
It operates based on user-specified prior constraints and preferences.

Entities

—

Sources

arXiv cs.AI — 2026-05-14