FormalScience: Human-in-the-Loop Autoformalisation for Scientific Proofs in Lean

ai-technology · 2026-04-29

Researchers have introduced FormalScience, a domain-agnostic human-in-the-loop agentic pipeline designed to autoformalise informal mathematical reasoning into formally verifiable code using the Lean4 theorem prover. The system targets scientific fields like physics, where domain-specific notation (e.g., Dirac notation, vector calculus) poses additional formalisation challenges that current LLMs and agentic approaches have not addressed. FormalScience enables a single domain expert without deep formal language experience to produce syntactically correct and semantically aligned formal proofs at low economic cost. Applied to physics, the pipeline generated FormalPhysics, a dataset of 200 university-level LaTeX physics problems and solutions—primarily in quantum mechanics and electromagnetism—along with their Lean4 formal representations. This work addresses the scalability of autoformalisation in scientific domains.

Key facts

FormalScience is a human-in-the-loop agentic pipeline for autoformalisation.
It targets scientific fields such as physics with domain-specific notation.
The system uses Lean4 for formal verification.
A single domain expert without deep formal language experience can use it.
FormalPhysics dataset contains 200 university-level physics problems.
Problems are in quantum mechanics and electromagnetism.
The pipeline produces syntactically correct and semantically aligned proofs.
It aims to reduce economic cost of formalisation.

FormalScience: Human-in-the-Loop Autoformalisation for Scientific Proofs in Lean

Key facts

Entities

Institutions

Sources