BODHI: LLM-Based OS Kernel Specification Inference
A new method called BODHI has been introduced by researchers to enhance the automated creation of formal specifications for operating system kernels through large language models. This approach enriches few-shot prompts with a detailed C-to-Python translation guide that encompasses 15 categories of domain-specific patterns. Drawing inspiration from Structured Chain-of-Thought prompting, it distinguishes between the extraction of pre-conditions and the generation of post-conditions. BODHI was tested on nine models from six different providers—Anthropic, Mistral, Amazon, DeepSeek, Meta, and Alibaba—addressing the OSV-Bench benchmark, which includes 245 specification generation tasks from the Hyperkernel OS kernel, achieving a highest reported Pass@1 rate of 55.10%.
Key facts
- BODHI is a domain knowledge prompting method for LLM-based OS kernel specification inference.
- It augments few-shot prompts with a structured C-to-Python translation guide covering 15 categories.
- The method is inspired by Structured Chain-of-Thought (SCoT) prompting.
- It separates pre-condition extraction and post-condition generation as distinct categories.
- Evaluated on nine models from six providers: Anthropic, Mistral, Amazon, DeepSeek, Meta, Alibaba.
- OSV-Bench benchmark includes 245 specification generation tasks from Hyperkernel OS kernel.
- Best reported Pass@1 on OSV-Bench is 55.10%.
- The work is published on arXiv with ID 2605.23931.
Entities
Institutions
- Anthropic
- Mistral
- Amazon
- DeepSeek
- Meta
- Alibaba