BODHI: LLM-Based OS Kernel Specification Inference

ai-technology · 2026-05-26

A new method called BODHI has been introduced by researchers to enhance the automated creation of formal specifications for operating system kernels through large language models. This approach enriches few-shot prompts with a detailed C-to-Python translation guide that encompasses 15 categories of domain-specific patterns. Drawing inspiration from Structured Chain-of-Thought prompting, it distinguishes between the extraction of pre-conditions and the generation of post-conditions. BODHI was tested on nine models from six different providers—Anthropic, Mistral, Amazon, DeepSeek, Meta, and Alibaba—addressing the OSV-Bench benchmark, which includes 245 specification generation tasks from the Hyperkernel OS kernel, achieving a highest reported Pass@1 rate of 55.10%.

Key facts

BODHI is a domain knowledge prompting method for LLM-based OS kernel specification inference.
It augments few-shot prompts with a structured C-to-Python translation guide covering 15 categories.
The method is inspired by Structured Chain-of-Thought (SCoT) prompting.
It separates pre-condition extraction and post-condition generation as distinct categories.
Evaluated on nine models from six providers: Anthropic, Mistral, Amazon, DeepSeek, Meta, Alibaba.
OSV-Bench benchmark includes 245 specification generation tasks from Hyperkernel OS kernel.
Best reported Pass@1 on OSV-Bench is 55.10%.
The work is published on arXiv with ID 2605.23931.

Entities

Institutions

Anthropic
Mistral
Amazon
DeepSeek
Meta
Alibaba

Sources

arXiv cs.AI — 2026-05-26