NTIRE2026 Infrared Super-Resolution Models
Pre-trained models for the NTIRE2026 Infrared Image Super-Resolution (x4) Challenge.
Competition
| Item | Detail |
|---|---|
| Task | Single-image super-resolution (x4) for infrared remote sensing |
| Metric | Score = PSNR + 20 x SSIM (intensity channel, 4px border shave) |
| Dataset | 919 train, 52 val, 222 test infrared images |
Available Models
| Model | File | Score | PSNR | SSIM | Params | Architecture |
|---|---|---|---|---|---|---|
| RFRSR v10 (split) | rfrsr_v10_split_iter46k.pth |
51.57 | 33.88 | 0.8822 | 2.05M | Recurrent Feature Refinement |
| RFRSR v2 | rfrsr_v2_iter250.pth |
27.59 | 15.89 | 0.5806 | 2.05M | Recurrent Feature Refinement |
| MambaOutRS v12 | mambaoutrs_v12_iter500.pth |
27.52 | 16.03 | 0.5811 | 2.96M | Gated CNN + Fourier Filter Gate |
| MambaOutRS v10 | mambaoutrs_v10_iter500.pth |
25.64 | 14.81 | 0.5372 | 2.96M | Gated CNN + Fourier Filter Gate |
| HSRMamba | hsrmamba_iter20k.pth |
25.56 | 14.74 | 0.5365 | 2.4M | Context-SSM + Spectral Reordering |
Usage
import torch
from PIL import Image
import numpy as np
# Load model (example with RFRSR)
checkpoint = torch.load("rfrsr_v10_split_iter46k.pth", map_location="cpu")
# For full inference pipeline with TTA and post-processing:
# See https://github.com/danghoangnhan/NTIRE2026
Full Inference with Submission Creator
git clone https://github.com/danghoangnhan/NTIRE2026.git
cd NTIRE2026
pip install -e .
python src/create_submission.py \
--input-dir data/test_LR_X4/X4 \
--output-dir output/ \
--weights-path rfrsr_v10_split_iter46k.pth \
--arch-config src/options/train/train_rfr_sr_x4_v10_split.yml \
--tta
Architecture Details
RFRSR (Best)
- 3-iteration recurrent feature refinement loop
- 6 residual blocks, 48 channels per stage
- PixelShuffle 4x upsampling
- Only 2.05M parameters
MambaOutRS
- Gated CNN with Fourier Filter Gate (no SSM)
- 4-stage [6,6,6,6] block design, 48 embedding dims
- 2.96M parameters
HSRMamba
- Context Selective State Space Model
- Global Spectral Reordering for token reorganization
- Uniquely stable (improves past 20k iterations)
- 2.4M parameters
Key Findings
- Smaller models win: 2.05M RFRSR > 9.5M MiM-ISTD
- SSIM in loss is critical: Prevents catastrophic training collapse
- Models peak early (250-1000 iters), then degrade
- TTA adds +0.1-0.3 dB for free at test time
Training
All models trained with:
- Loss: Charbonnier + SSIM + Gradient + FFT + SWT (IRSRCombinedLoss)
- Optimizer: Adam
- Framework: BasicSR
- Input: Grayscale infrared images (
force_gray: true)
License
MIT
Citation
@misc{ntire2026_infraredsr,
title={NTIRE2026 Infrared Super-Resolution: A Study of Architectures and Training Strategies},
author={Daniel Ho},
year={2026},
url={https://github.com/danghoangnhan/NTIRE2026}
}