White-Basilisk: Efficient 200M-Parameter AI for Code Vulnerability Detection

ai-technology · 2026-05-01

The recently introduced AI model, White-Basilisk, sets a new benchmark in identifying software vulnerabilities with merely 200 million parameters, defying conventional beliefs regarding model scaling. Created by a team of researchers, it incorporates Mamba layers, linear self-attention, and a Mixture of Experts architecture. This model can analyze lengthy code sequences in one go, overcoming the context restrictions faced by existing large language models. Additionally, it demonstrates strong performance on uneven real-world datasets while ensuring computational efficiency for practical applications.

Key facts

White-Basilisk uses 200M parameters
Integrates Mamba layers, linear self-attention, and Mixture of Experts
Processes sequences of unprecedented length
Surpasses context limitations of current LLMs
Robust on imbalanced real-world datasets
Computationally efficient for deployment
Challenges prevailing assumptions in AI model scaling

Entities

—

Sources

arXiv cs.AI — 2026-04-23