Overview
RS Paper Hub is an automated system that collects, filters, and categorizes remote sensing papers from arXiv. It helps researchers stay updated with the latest developments in remote sensing, vision-language models, and AI agents.
The project provides multiple ways to access papers: a web interface, RSS feeds, and direct JSON downloads. All papers are automatically tagged with categories, tasks, and VLM-related keywords.
Features
Daily Automated Updates
Papers are fetched and processed automatically every day, ensuring you always have access to the latest research.
Task Tagging (11 types)
Each paper is automatically tagged with one or more task types: Classification, Retrieval, Geolocation, VQA, Captioning, Change Detection, Object Detection, Visual Grounding, Segmentation, Super-Resolution, and 3D Reconstruction.
VLM Filtering
Papers related to Vision-Language Models are automatically detected and can be filtered separately.
Agent Filtering
Papers related to AI Agents in remote sensing are automatically identified and can be filtered separately.
Paper Classification
Papers are classified into Method, Survey, or Dataset categories using rule-based patterns.
Paper Collection & BibTeX Export
Collect favorite papers and export them as BibTeX for easy reference management.
RSS Feeds & Zotero
Subscribe to RSS feeds for all papers, VLM papers, or Agent papers. Integrate with Zotero for automatic bibliography management.
Clickable Tag Filtering
Click on any tag (date, type, category, task, VLM, etc.) on a paper card to instantly filter by that value. Multiple tags can be stacked for progressive filtering, and clicking the same tag again removes the filter.
Group Export & Sharing
Export selected papers as a Group JSON file with a custom name. Groups appear in the "All Groups" dropdown for quick filtering. Submit your group via Pull Request to share it with the community.
Web Viewer Guide
The web interface provides powerful tools for exploring and filtering papers:
Four-Tab Data Source Switching
Use the tabs at the top to switch between: All Papers, UAV papers, Vision-Language Model papers, and Agent papers. The selection is preserved in the URL for sharing.
Search & Filtering
- Search by title, author, or abstract with exact phrase or fuzzy (split-word) matching
- Filter by year, month, paper type, category, VLM keywords, and task tags
- Quick filters for "Today" and "This Week" to see recently published papers
- Toggle "With Code" to show only papers with available code
Clickable Tag Filtering
- Click any tag on a paper card (date, type, category, task, VLM keyword, etc.) to filter all papers by that value
- Click different tags to stack filters progressively — e.g., click "Computer Vision" then "Classification" to find papers matching both
- Click the same tag again to remove that filter; active tags are highlighted with a border
Charts & Visualization
Click "Charts" to view distribution visualizations. Click on bars to filter by that category. Multiple selections are supported.
Paper Actions
- Click on paper title to open the arXiv page
- Click on abstract to expand/collapse
- Collect papers to your personal collection
- Export filtered results or collection as BibTeX
- View BibTeX citation or open in Google Scholar
Recent Papers Panel
Click the "NEW PAPERS" button on the right side to view papers published today or this week.
RSS & Zotero
RS Paper Hub provides RSS feeds that can be used with any RSS reader or integrated with Zotero:
Available RSS Feeds
- All Papers:
output/feed.xml - VLM Papers:
output/feed_vlm.xml - Agent Papers:
output/feed_agent.xml - UAV Papers:
output/feed_uav.xml
Using with Zotero
In Zotero, go to File → New Feed, and paste one of the RSS URLs above. Zotero will automatically fetch new papers as they are added.
Direct JSON Download
For programmatic access, JSON files are also available:
output/papers.json- All papersoutput/papers_vlm.json- VLM papersoutput/papers_agent.json- Agent papers
Pipeline
The data processing pipeline consists of 11 steps:
- Fetch arXiv: Query the arXiv API for recent submissions in relevant categories
- Filter RS: Apply keyword-based filtering to identify remote sensing papers
- Extract Metadata: Parse paper titles, abstracts, authors, and dates
- VLM Detection: Use regex patterns to identify vision-language model related papers
- Agent Detection: Identify AI agent related papers in remote sensing
- Task Tagging: Classify papers into 11 task categories based on content
- Classification: Categorize as Method, Survey, or Dataset
- Generate Feeds: Create RSS feeds and JSON outputs
- Deploy: Update the website with new data
CLI Reference
Run the pipeline using Python:
python main.py [--category CAT] [--max-results N] [--output DIR]
Arguments
| Argument | Description | Default |
|---|---|---|
--category, -c | arXiv category to search (e.g., cs.CV, eess.IV) | cs.CV |
--max-results, -m | Maximum number of papers to fetch | 300 |
--output, -o | Output directory for JSON files | output/ |
--keywords, -k | Additional keywords for filtering (comma-separated) | remote sensing |
--vlm-keywords | JSON file with VLM keyword patterns | config/vlm_keywords.json |
--task-tags | JSON file with task tag patterns | config/task_tags.json |
--dry-run | Show what would be done without making changes | False |
--verbose, -v | Enable verbose logging | False |
Examples
# Fetch latest papers
python main.py
# Fetch with custom category and limit
python main.py --category eess.IV --max-results 500
# Dry run to see what would be fetched
python main.py --dry-run
# Verbose output
python main.py --verbose
Output Schema
Each paper entry in the JSON output contains the following fields:
| Field | Type | Description |
|---|---|---|
Paper_link | string | URL to the arXiv paper page |
Title | string | Paper title |
Authors | string | Comma-separated list of authors |
Abstract | string | Paper abstract (may contain LaTeX) |
Date | string | Submission date (YYYY-MM-DD) |
Year | integer | Publication year |
Month | integer | Publication month (1-12) |
Type | string | Paper type (e.g., cs.CV, eess.IV) |
Subtype | string | Paper subtype or arXiv ID |
Category | string | Classification: Method, Survey, or Dataset |
Publication | string | Conference or journal name if published |
code | string | URL to code repository if available |
BibTex | string | BibTeX citation entry |
_is_vlm | boolean | Whether paper is VLM-related |
_vlm_keywords | string | Semicolon-separated VLM keywords matched |
_is_agent | boolean | Whether paper is Agent-related |
_tasks | string | Semicolon-separated task tags |
Submit Group
You can create a custom paper group to curate and share a reading list with the community. Groups appear in the "All Groups" dropdown on the main page.
1. Prepare your paper list
Create a JSON file containing an array of arXiv paper links. You can use the "Export Group" button in the Export dialog (By author). Or create it manually — each link should be a full arXiv abstract URL:
[
"http://arxiv.org/abs/2401.12345v1",
"http://arxiv.org/abs/2401.67890v2"
]2. Register your group
Add an entry to groups/index.json with your group's meta
[
{
"key": "my-group",
"label": "My Research Group",
"label_zh": "我的研究组",
"file": "my-group.json"
}
]| Field | Description |
|---|---|
key | Unique identifier for your group (use lowercase and hyphens) |
label | Display name in English |
label_zh | Display name in Chinese (optional) |
file | Filename of your paper list JSON (placed in the groups/ directory) |
3. Submit a Pull Request
Fork the repository, add your JSON file to the groups/ directory, update groups/index.json, and submit a Pull Request. Once merged, your group will appear in the dropdown for all users.
