SkyPart: Prototype-Based Semantic Part Discovery for Weather-Robust Cross-View Geo-Localization
A new method called SkyPart improves cross-view geo-localization (CVGL) for autonomous drone navigation when GNSS signals are unavailable. CVGL matches oblique drone views to geo-referenced satellite tiles, but existing approaches suffer from compressing patch grids into single vectors without separating layout from texture, retaining altitude-related scale variation, and relying on hand-tuned scalars for multi-objective training. SkyPart is a lightweight swappable head for patch-based vision transformers that institutes explicit part grouping over the patch grid. It features four components: learnable prototypes competing for patch tokens via single-pass cosine assignment, altitude-conditioned linear modulation, and other theory-grounded mechanisms. The method is designed to be robust to weather conditions, addressing a key limitation in prior work. The paper is available on arXiv under identifier 2605.11654.
Key facts
- SkyPart is proposed for cross-view geo-localization (CVGL).
- CVGL matches oblique drone views to geo-referenced satellite tiles.
- It is an alternative for autonomous drone navigation when GNSS signals are jammed or unavailable.
- Existing methods have three limitations: global-descriptor designs, altitude-related scale variation, and hand-tuned scalars.
- SkyPart is a lightweight swappable head for patch-based vision transformers.
- It uses learnable prototypes competing for patch tokens via single-pass cosine assignment.
- It includes altitude-conditioned linear modulation.
- The paper is on arXiv with ID 2605.11654.
Entities
Institutions
- arXiv