RS-Paper-Hub — UAV Papers

RS-Paper-Hub — UAV Papers https://rspaper.top/output/feed_uav.xml 2026-04-20T10:19:45Z Latest remote sensing papers (last 7 days) — 1 entries RS-Paper-Hub https://rspaper.top PixDLM: A Dual-Path Multimodal Language Model for UAV Reasoning Segmentation http://arxiv.org/abs/2604.15670v1 2026-04-17T00:00:00Z 2026-04-17T00:00:00Z Shuyan Ke Yifan Mei Changli Wu Yonghan Zheng Jiayi Ji Liujuan Cao Rongrong Ji

Reasoning segmentation has recently expanded from ground-level scenes to remote-sensing imagery, yet UAV data poses distinct challenges, including oblique viewpoints, ultra-high resolutions, and extreme scale variations. To address these issues, we formally define the UAV Reasoning Segmentation task and organize its semantic requirements into three dimensions: Spatial, Attribute, and Scene-level reasoning. Based on this formulation, we construct DRSeg, a large-scale benchmark for UAV reasoning segmentation, containing 10k high-resolution aerial images paired with Chain-of-Thought QA supervision across all three reasoning types. As a benchmark companion, we introduce PixDLM, a simple yet effective pixel-level multimodal language model that serves as a unified baseline for this task. Experiments on DRSeg establish strong baseline results and highlight the unique challenges of UAV reasoning segmentation, providing a solid foundation for future research.

Publication: CVPR 2026 Category: Method