EdgeFlow: Edge-Map Augmented VLM for Flowchart Processing

other · 2026-05-27

EdgeFlow is a method that enhances Vision Language Models (VLMs) for converting static flowchart images into machine-readable Mermaid code, targeting industrial requirements engineering. By augmenting VLM input with a deterministically extracted Canny edge map as a structural prior, it improves topology-critical detail capture without requiring annotated training data or fine-tuning. Evaluated on the IndusReqFlow dataset from real-world requirements, EdgeFlow achieves node-level F1 improvement of 17.39 percentage points, edge-level F1 improvement of 16.94 percentage points, and path-level F1 improvement of 11.06 percentage points over off-the-shelf VLMs. The approach addresses failures in direct VLM application to flowchart conversion, enabling better support for model-based requirements engineering activities.

Key facts

EdgeFlow augments VLM input with a Canny edge map as a structural prior.
No annotated training data or domain-specific fine-tuning required.
Evaluated on IndusReqFlow dataset sourced from real-world requirements.
Node-level F1 improved by 17.39 percentage points.
Edge-level F1 improved by 16.94 percentage points.
Path-level F1 improved by 11.06 percentage points.
Targets conversion of static flowchart images to machine-readable Mermaid code.
Addresses topology-critical visual detail failures in direct VLM application.

Entities

—

Sources

arXiv cs.AI — 2026-05-27