ARTFEED — Contemporary Art Intelligence

Task Aggregation Strategies for Ultrasound Foundation Models

other · 2026-05-25

A recent investigation published on arXiv (2603.18123) explores the reasons behind the underperformance of unified ultrasound foundation models compared to task-specific benchmarks. The researchers suggest that this decline in performance may result from task aggregation methods that overlook the interplay between the diversity of tasks and the scale of training data. They present M2DINO, a framework designed for multiple organs and tasks, which utilizes DINOv3 and incorporates task-conditioned Mixture-of-Experts blocks for flexible capacity management. The framework was tested across 27 ultrasound tasks, including segmentation, classification, detection, and regression, under three different paradigms: task-specific, clinically-grouped, and all-task. The findings provide guidelines for effectively learning heterogeneous ultrasound tasks together without sacrificing performance.

Key facts

  • arXiv paper 2603.18123
  • Title: Understanding Task Aggregation for Generalizable Ultrasound Foundation Models
  • Announce Type: replace-cross
  • Hypothesis: degradation from task aggregation strategies
  • Introduces M2DINO framework
  • Built on DINOv3 with task-conditioned Mixture-of-Experts
  • Evaluated on 27 ultrasound tasks
  • Tasks include segmentation, classification, detection, regression

Entities

Institutions

  • arXiv

Sources