<?xml version='1.0' encoding='utf-8'?>
<feed xmlns="http://www.w3.org/2005/Atom">
  <title>RS-Paper-Hub — UAV Papers</title>
  <id>https://rspaper.top/output/feed_uav.xml</id>
  <link href="https://rspaper.top/output/feed_uav.xml" rel="self" type="application/atom+xml" />
  <link href="https://rspaper.top" rel="alternate" type="text/html" />
  <updated>2026-04-20T10:19:45Z</updated>
  <subtitle>Latest remote sensing papers (last 7 days) — 1 entries</subtitle>
  <author>
    <name>RS-Paper-Hub</name>
    <uri>https://rspaper.top</uri>
  </author>
  <entry>
    <title>PixDLM: A Dual-Path Multimodal Language Model for UAV Reasoning Segmentation</title>
    <link href="http://arxiv.org/abs/2604.15670v1" rel="alternate" type="text/html" />
    <id>http://arxiv.org/abs/2604.15670v1</id>
    <published>2026-04-17T00:00:00Z</published>
    <updated>2026-04-17T00:00:00Z</updated>
    <author>
      <name>Shuyan Ke</name>
    </author>
    <author>
      <name>Yifan Mei</name>
    </author>
    <author>
      <name>Changli Wu</name>
    </author>
    <author>
      <name>Yonghan Zheng</name>
    </author>
    <author>
      <name>Jiayi Ji</name>
    </author>
    <author>
      <name>Liujuan Cao</name>
    </author>
    <author>
      <name>Rongrong Ji</name>
    </author>
    <summary type="text">Reasoning segmentation has recently expanded from ground-level scenes to remote-sensing imagery, yet UAV data poses distinct challenges, including oblique viewpoints, ultra-high resolutions, and extreme scale variations. To address these issues, we formally define the UAV Reasoning Segmentation task and organize its semantic requirements into three dimensions: Spatial, Attribute, and Scene-level reasoning. Based on this formulation, we construct DRSeg, a large-scale benchmark for UAV reasoning segmentation, containing 10k high-resolution aerial images paired with Chain-of-Thought QA supervision across all three reasoning types. As a benchmark companion, we introduce PixDLM, a simple yet effective pixel-level multimodal language model that serves as a unified baseline for this task. Experiments on DRSeg establish strong baseline results and highlight the unique challenges of UAV reasoning segmentation, providing a solid foundation for future research.</summary>
    <content type="html">&lt;p&gt;&lt;strong&gt;Publication:&lt;/strong&gt; CVPR 2026&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Category:&lt;/strong&gt; Method&lt;/p&gt;</content>
    <category term="Computer Vision" />
  </entry>
</feed>