Benchmarking Deep Learning Models for 10m to 2.5m Upsampling
2026-04-04
The problem
The opportunity
Core idea: Learn a mapping \(f: I_{LR} \rightarrow I_{SR}\) from low-resolution to high-resolution images.
Three main paradigms:
LR Image (10m)
│
Feature Extraction
│
├─ CNN ──────── L1/L2 loss
│ │
├─ GAN ──────── Adversarial loss
│ │
├─ Diffusion ── Score matching
│ │
▼ ▼
SR Output (2.5m)
Used by: SEN2SRLite, EvoLand, DiffFuSR, OpenSR, L1BSR
Used by: Satlas (8 images), WorldStrat (8 images)
Multi-image SR has a theoretical advantage: information from multiple views can resolve ambiguities that no single image can.
| Model | Type | Bands | Input | Output | Architecture |
|---|---|---|---|---|---|
| SEN2SRLite | CNN | RGBN | 1 image | 2.5m | Lightweight CNN via mlstac |
| EvoLand | CNN (ONNX) | RGBN | 1 image | 2.5m | ONNX-deployed spatial SR |
| L1BSR | CNN | RGBN | 1 image | 5m | Registration-aware SR |
| DiffFuSR | Diffusion | RGB | 1 image | 2.5m | Score-based diffusion |
| OpenSR | Diffusion | RGBN | 1 image | 2.5m | Latent diffusion (LDSR) |
| Satlas | GAN (ESRGAN) | RGB | 8 images | 2.5m | Multi-temporal ESRGAN |
Setup:
S2A_MSIL2A_20241016T072901






Input (10m → upscaled)

SEN2SRLite (2.5m)

OpenSR (2.5m)

Both resolve building edges and road network well; SEN2SRLite appears slightly smoother while OpenSR adds finer texture detail.
Input (10m → upscaled)

SEN2SRLite (2.5m)

EvoLand (2.5m)

Very similar sharpness; EvoLand has slightly darker shadows and higher contrast compared to SEN2SRLite’s more neutral tone.
Input (10m → upscaled)

DiffFuSR (2.5m)

OpenSR (2.5m)

Both diffusion models produce sharp results; DiffFuSR has slightly warmer tones, while OpenSR preserves a cooler radiometry closer to the input.
Input (10m → upscaled)

DiffFuSR (2.5m)

Satlas (2.5m, 8-image)

Satlas aggregates 8 dates producing a noticeably different color tone; DiffFuSR is sharper on single-date detail while Satlas resolves road markings better from temporal fusion.
Input (10m → upscaled)

L1BSR (5m)

SEN2SRLite (2.5m)

L1BSR outputs 5m (2x) rather than 2.5m (4x), so it appears slightly softer; building outlines are resolved but fine urban texture is less detailed.
| Model | Observation |
|---|---|
| SEN2SRLite | Clean sharpening with neutral colors; best all-round edge definition for buildings and roads |
| OpenSR | Fine texture detail from diffusion; slightly cooler tone than input |
| EvoLand | Comparable to SEN2SRLite in sharpness; darker shadows, higher contrast |
| DiffFuSR | Natural-looking textures with warm tones; RGB-only limits multispectral use |
| L1BSR | Good for 5m but noticeably softer than the 2.5m models on fine urban structure |
| Satlas | Multi-date fusion produces distinct color tone; resolves some details others miss but introduces temporal averaging |
| Model | Runtime | GPU Mem | CPU Mem | Output |
|---|---|---|---|---|
| SEN2SRLite | 9.1 s | 0.44 GB | 1.12 GB | 2.5m, 4-band |
| L1BSR | 9.0 s | 1.20 GB | – | 5m, 4-band |
| EvoLand | 29.6 s | 0 GB | 1.76 GB | 2.5m, 4-band |
| Satlas | 34.8 s | 0.59 GB | 1.03 GB | 2.5m, 3-band |
| DiffFuSR | 83.2 s | 9.75 GB | 4.03 GB | 2.5m, 3-band |
| OpenSR | 136.8 s | 4.25 GB | 2.47 GB | 2.5m, 4-band |
Key takeaway: CNN models are 10–15x faster than diffusion models with fraction of GPU memory.
| Model | Complexity |
|---|---|
| EvoLand | Low – ONNX runtime only |
| SEN2SRLite | Low – pip install |
| L1BSR | Medium – repo clone |
| Satlas | High – preprocessing pipeline |
| DiffFuSR | High – manual checkpoint wiring |
| OpenSR | High – custom low-mem wrapper |
| Use Case | Best Model | Why |
|---|---|---|
| Default baseline | SEN2SRLite | Fast (9s), low memory, 4-band output |
| Operational deployment | EvoLand | ONNX-based, no GPU required, deterministic |
| Multi-temporal analysis | Satlas | Only working multi-image model |
| Highest perceptual quality | DiffFuSR | Best textures, but RGB-only & GPU-heavy |
| Modern diffusion pipeline | OpenSR | Geospatial-aware tiling, but slowest |
| 5m is sufficient | L1BSR | Fastest, lightweight |
For urban regions, SEN2SRLite offers the best balance of sharpness, speed, and band coverage.
Aman Bagrecha
All code, data, and benchmarks available in this repository.

amanbagrecha.com