EngiAI: Multi-Agent Framework for LLM-Driven Engineering Design

ai-technology · 2026-05-20

A new benchmark suite and multi-agent system called EngiAI has been introduced for evaluating LLM-driven engineering design. The benchmark comprises three dimensions: a workflow benchmark with seven prompt styles targeting cognitive demands like direct tool use and semantic disambiguation; a Retrieval-Augmented Generation (RAG) benchmark with gated scoring for parameter selection; and an HPC benchmark for end-to-end ML training orchestration on a SLURM cluster. EngiAI is a reference implementation built on LangGraph, coordinating seven specialized agents through a supervisor. The framework addresses the lack of evaluation standards for multi-agent systems combining simulation, retrieval, and manufacturing preparation.

Key facts

EngiAI is a multi-agent system for LLM-driven engineering design.
The benchmark suite has three dimensions: workflow, RAG, and HPC.
Workflow benchmark includes seven prompt styles.
RAG benchmark uses gated scoring for retrieval contributions.
HPC benchmark evaluates ML training orchestration on SLURM.
EngiAI is built on LangGraph.
Seven specialized agents are coordinated by a supervisor.
The framework targets simulation, retrieval, and manufacturing preparation.

EngiAI: Multi-Agent Framework for LLM-Driven Engineering Design

Key facts

Entities

Institutions

Sources