100 × Resolution Enhancement
We enhance the spatial resolution of predicted DEMs by a factor of 100, from 30 m to 30 cm, surpassing previous attempts by an order of magnitude.
1Ben-Gurion University of the Negev
*Corresponding author: osher@bgu.ac.il
High-resolution elevation estimations are essential to understand catchment and hillslope hydrology, study urban morphology and dynamics, and monitor the growth, decline, and mortality of terrestrial ecosystems. Various deep learning approaches (e.g., super-resolution techniques, monocular depth estimation) have been developed to create high-resolution Digital Elevation Models (DEMs). However, super-resolution techniques are limited by the upscaling factor, and monocular depth estimation lacks global elevation context, making its conversion to a seamless DEM restricted. The recently introduced technique of prompt-based monocular depth estimation has opened new opportunities to extract estimates of absolute elevation in a global context. We present here a framework for the estimation of high-resolution DEMs as a new paradigm for absolute global elevation mapping. It is exemplified using low-resolution Shuttle Radar Topography Mission (SRTM) elevation data as prompts and high-resolution RGB imagery from the National Agriculture Imagery Program (NAIP). The approach fine-tunes a vision transformer encoder with LiDAR-derived DEMs and employs a versatile prompting strategy, enabling tasks such as DEM estimation, void filling, and updating. Our framework achieves a 100x resolution gain (from 30-m to 30-cm), surpassing prior methods by an order of magnitude. Evaluations across three diverse U.S. landscapes show robust generalization, capturing urban structures and fine-scale terrain features with < 5 m MAE relative to LiDAR, improving over SRTM by up to 18%. Hydrological analysis confirms suitability for hazard and environmental studies. We demonstrate scalability by applying the framework to large regions in the U.S. and Israel.
We enhance the spatial resolution of predicted DEMs by a factor of 100, from 30 m to 30 cm, surpassing previous attempts by an order of magnitude.
We leverage freely available SRTM DEMs as absolute-height prompts, ensuring a globally consistent elevation context.
We blend patch-wise Vision Transformer predictions into seamless mosaics that are ready for slope, aspect, and flow-routing analyses.
Processing ≈ 150 km² h-1 on a single GPU; and achieving up to an 18% improvement in vertical accuracy compared with the original SRTM dataset.
RGB
Elevation
Aspect
Hillshade
Slope
RGB
Elevation
Aspect
Hillshade
Slope
RGB
Elevation
Aspect
Hillshade
Slope
We thank the Ministry of Agriculture Chief Scientist (grant 16-17-0005, 2022) and the Negev Scholarship of the Kreitman School, Ben-Gurion University of the Negev, for supporting Osher Rafaeli’s PhD studies.
@misc{rafaeli2025prompt2demhighresolutiondemsurban,
title={Prompt2DEM: High-Resolution DEMs for Urban and Open Environments from Global Prompts Using a Monocular Foundation Model},
author={Osher Rafaeli and Tal Svoray and Ariel Nahlieli},
year={2025},
eprint={2507.09681},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2507.09681},
}
This page was built using the Academic Project Page Template, which was adopted from the Nerfies project page.