<?xml version='1.0' encoding='utf-8'?>
<feed xmlns="http://www.w3.org/2005/Atom">
  <title>RS-Paper-Hub — UAV Papers</title>
  <id>https://rspaper.top/output/feed_uav.xml</id>
  <link href="https://rspaper.top/output/feed_uav.xml" rel="self" type="application/atom+xml" />
  <link href="https://rspaper.top" rel="alternate" type="text/html" />
  <updated>2026-05-18T02:08:55Z</updated>
  <subtitle>Latest remote sensing papers (last 7 days) — 2 entries</subtitle>
  <author>
    <name>RS-Paper-Hub</name>
    <uri>https://rspaper.top</uri>
  </author>
  <entry>
    <title>ELDOR: A Dataset and Benchmark for Illegal Gold Mining in the Amazon Rainforest</title>
    <link href="http://arxiv.org/abs/2605.15397v1" rel="alternate" type="text/html" />
    <id>http://arxiv.org/abs/2605.15397v1</id>
    <published>2026-05-14T00:00:00Z</published>
    <updated>2026-05-14T00:00:00Z</updated>
    <author>
      <name>Kangning Cui</name>
    </author>
    <author>
      <name>Surendra Bohara</name>
    </author>
    <author>
      <name>Suraj Prasai</name>
    </author>
    <author>
      <name>Zishan Shao</name>
    </author>
    <author>
      <name>Wei Tang</name>
    </author>
    <author>
      <name>Martin Pillaca</name>
    </author>
    <author>
      <name>Edwin Flores</name>
    </author>
    <author>
      <name>Zhen Yang</name>
    </author>
    <author>
      <name>Gregory Larsen</name>
    </author>
    <author>
      <name>Evan Dethier</name>
    </author>
    <author>
      <name>David Lutz</name>
    </author>
    <author>
      <name>Jean-Michel Morel</name>
    </author>
    <author>
      <name>Miles Silman</name>
    </author>
    <author>
      <name>Victor Pauca</name>
    </author>
    <author>
      <name>Fan Yang</name>
    </author>
    <summary type="text">Illegal gold mining in the Amazon rainforest causes deforestation, water contamination, and long-term ecosystem disruption, yet remains difficult to monitor at fine spatial scales. Satellite imagery supports large-scale observation, but often misses small mining-related structures and subtle land-cover transitions, especially under frequent cloud cover. We introduce ELDOR, a large-scale UAV benchmark for monitoring environmental and landscape disturbance from illegal gold mining in the rainforest. ELDOR contains manually annotated orthomosaic imagery covering over 2,500 hectares, with pixel-level semantic labels for both mining-related activities and surrounding ecological structures. With this unified annotation source, we establish four benchmark tasks: semantic segmentation, segmentation-derived recognition, direct multi-label classification, and class-presence recognition with vision-language models. Across these tasks, we compare generic and remote-sensing-specific segmentation models, vision foundation model-related segmentation methods, direct multi-label classification methods, and vision-language models under a controlled closed-set protocol. Results show that current methods still struggle with rare small-scale mining structures and fine-grained recovery classes, suggesting the need for context-aware and multimodal modeling. To support domain analysis and practical use, we further build an interactive explorer for domain experts that provides a unified interface for data exploration and model inference.</summary>
    <content type="html">&lt;p&gt;&lt;strong&gt;Publication:&lt;/strong&gt; 70 pages, 35 figures&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Category:&lt;/strong&gt; Dataset&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tasks:&lt;/strong&gt; CLS;SEG&lt;/p&gt;</content>
    <category term="Computer Vision" />
  </entry>
  <entry>
    <title>Multimodal Object Detection Under Sparse Forest-Canopy Occlusion</title>
    <link href="http://arxiv.org/abs/2605.15326v1" rel="alternate" type="text/html" />
    <id>http://arxiv.org/abs/2605.15326v1</id>
    <published>2026-05-14T00:00:00Z</published>
    <updated>2026-05-14T00:00:00Z</updated>
    <author>
      <name>Nitik Jain</name>
    </author>
    <author>
      <name>Mangal Kothari</name>
    </author>
    <summary type="text">Reliable detection of humans beneath forest canopy remains a difficult remote-sensing challenge due to sparse, structured, and viewpoint-dependent occlusion. This paper presents a multimodal proof-of-concept pipeline that integrates three complementary approaches: (i) experimental evaluation of LiDAR returns through vegetation to assess the feasibility of active sensing, (ii) visible--thermal image fusion using a multi-scale transform and sparse-representation framework to enhance human saliency, and (iii) synthetic-aperture image formation via Airborne Optical Sectioning (AOS) to suppress canopy clutter. A YOLOv5 detector is fine-tuned on the Teledyne FLIR thermal dataset and evaluated on thermal and fused imagery. Results show that the tested terrestrial LiDAR configuration provides limited penetration for object-level detection, while visible--thermal fusion improves target visibility in low-contrast scenes and AOS enhances ground-plane detection in synthetic forest imagery. The fine-tuned YOLOv5 achieves a mean average precision of $\sim$0.83 on the top three FLIR classes. These findings establish an initial baseline for UAV-deployable search-and-rescue and surveillance systems operating in forested environments, and motivate future work on dedicated forest datasets and real-time multimodal integration.</summary>
    <content type="html">&lt;p&gt;&lt;strong&gt;Category:&lt;/strong&gt; Method&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tasks:&lt;/strong&gt; OD&lt;/p&gt;</content>
    <category term="Computer Vision" />
  </entry>
</feed>