FormalScience: Human-in-the-Loop Autoformalisation for Scientific Proofs in Lean
Researchers have introduced FormalScience, a domain-agnostic human-in-the-loop agentic pipeline designed to autoformalise informal mathematical reasoning into formally verifiable code using the Lean4 theorem prover. The system targets scientific fields like physics, where domain-specific notation (e.g., Dirac notation, vector calculus) poses additional formalisation challenges that current LLMs and agentic approaches have not addressed. FormalScience enables a single domain expert without deep formal language experience to produce syntactically correct and semantically aligned formal proofs at low economic cost. Applied to physics, the pipeline generated FormalPhysics, a dataset of 200 university-level LaTeX physics problems and solutions—primarily in quantum mechanics and electromagnetism—along with their Lean4 formal representations. This work addresses the scalability of autoformalisation in scientific domains.
Key facts
- FormalScience is a human-in-the-loop agentic pipeline for autoformalisation.
- It targets scientific fields such as physics with domain-specific notation.
- The system uses Lean4 for formal verification.
- A single domain expert without deep formal language experience can use it.
- FormalPhysics dataset contains 200 university-level physics problems.
- Problems are in quantum mechanics and electromagnetism.
- The pipeline produces syntactically correct and semantically aligned proofs.
- It aims to reduce economic cost of formalisation.
Entities
Institutions
- arXiv