ARTFEED — Contemporary Art Intelligence

FormalScience: Human-in-the-Loop Autoformalisation for Scientific Proofs in Lean

ai-technology · 2026-04-29

Researchers have introduced FormalScience, a domain-agnostic human-in-the-loop agentic pipeline designed to autoformalise informal mathematical reasoning into formally verifiable code using the Lean4 theorem prover. The system targets scientific fields like physics, where domain-specific notation (e.g., Dirac notation, vector calculus) poses additional formalisation challenges that current LLMs and agentic approaches have not addressed. FormalScience enables a single domain expert without deep formal language experience to produce syntactically correct and semantically aligned formal proofs at low economic cost. Applied to physics, the pipeline generated FormalPhysics, a dataset of 200 university-level LaTeX physics problems and solutions—primarily in quantum mechanics and electromagnetism—along with their Lean4 formal representations. This work addresses the scalability of autoformalisation in scientific domains.

Key facts

  • FormalScience is a human-in-the-loop agentic pipeline for autoformalisation.
  • It targets scientific fields such as physics with domain-specific notation.
  • The system uses Lean4 for formal verification.
  • A single domain expert without deep formal language experience can use it.
  • FormalPhysics dataset contains 200 university-level physics problems.
  • Problems are in quantum mechanics and electromagnetism.
  • The pipeline produces syntactically correct and semantically aligned proofs.
  • It aims to reduce economic cost of formalisation.

Entities

Institutions

  • arXiv

Sources