ARTFEED — Contemporary Art Intelligence

SkyPart: Prototype-Based Semantic Part Discovery for Weather-Robust Cross-View Geo-Localization

ai-technology · 2026-05-13

A new method called SkyPart improves cross-view geo-localization (CVGL) for autonomous drone navigation when GNSS signals are unavailable. CVGL matches oblique drone views to geo-referenced satellite tiles, but existing approaches suffer from compressing patch grids into single vectors without separating layout from texture, retaining altitude-related scale variation, and relying on hand-tuned scalars for multi-objective training. SkyPart is a lightweight swappable head for patch-based vision transformers that institutes explicit part grouping over the patch grid. It features four components: learnable prototypes competing for patch tokens via single-pass cosine assignment, altitude-conditioned linear modulation, and other theory-grounded mechanisms. The method is designed to be robust to weather conditions, addressing a key limitation in prior work. The paper is available on arXiv under identifier 2605.11654.

Key facts

  • SkyPart is proposed for cross-view geo-localization (CVGL).
  • CVGL matches oblique drone views to geo-referenced satellite tiles.
  • It is an alternative for autonomous drone navigation when GNSS signals are jammed or unavailable.
  • Existing methods have three limitations: global-descriptor designs, altitude-related scale variation, and hand-tuned scalars.
  • SkyPart is a lightweight swappable head for patch-based vision transformers.
  • It uses learnable prototypes competing for patch tokens via single-pass cosine assignment.
  • It includes altitude-conditioned linear modulation.
  • The paper is on arXiv with ID 2605.11654.

Entities

Institutions

  • arXiv

Sources