EdgeFlow: Edge-Map Augmented VLM for Flowchart Processing
EdgeFlow is a method that enhances Vision Language Models (VLMs) for converting static flowchart images into machine-readable Mermaid code, targeting industrial requirements engineering. By augmenting VLM input with a deterministically extracted Canny edge map as a structural prior, it improves topology-critical detail capture without requiring annotated training data or fine-tuning. Evaluated on the IndusReqFlow dataset from real-world requirements, EdgeFlow achieves node-level F1 improvement of 17.39 percentage points, edge-level F1 improvement of 16.94 percentage points, and path-level F1 improvement of 11.06 percentage points over off-the-shelf VLMs. The approach addresses failures in direct VLM application to flowchart conversion, enabling better support for model-based requirements engineering activities.
Key facts
- EdgeFlow augments VLM input with a Canny edge map as a structural prior.
- No annotated training data or domain-specific fine-tuning required.
- Evaluated on IndusReqFlow dataset sourced from real-world requirements.
- Node-level F1 improved by 17.39 percentage points.
- Edge-level F1 improved by 16.94 percentage points.
- Path-level F1 improved by 11.06 percentage points.
- Targets conversion of static flowchart images to machine-readable Mermaid code.
- Addresses topology-critical visual detail failures in direct VLM application.
Entities
—