Computer Science > Computers and Society

arXiv:2508.21738 (cs)

[Submitted on 29 Aug 2025 (v1), last revised 3 Nov 2025 (this version, v2)]

Title:From Drone Imagery to Livability Mapping: AI-powered Environment Perception in Rural China

Authors:Weihuan Deng, Yaofu Huang, Luan Chen, Xun Li, Yu Gu, Yao Yao

View PDF

Abstract:The high cost of acquiring rural street view images has constrained comprehensive environmental perception in rural areas. Drone photographs, with their advantages of easy acquisition, broad coverage, and high spatial resolution, offer a viable approach for large-scale rural environmental perception. However, a systematic methodology for identifying key environmental elements from drone photographs and quantifying their impact on environmental perception remains lacking. To address this gap, a Vision-Language Contrastive Ranking Framework (VLCR) is designed for rural livability assessment in China. The framework employs chain-of-thought prompting strategies to guide multimodal large language models (MLLMs) in identifying visual features related to quality of life and ecological habitability from drone photographs. Subsequently, to address the instability in pairwise village comparison, a text description-constrained drone photograph comparison strategy is proposed. Finally, to overcome the efficiency bottleneck in nationwide pairwise village comparisons, an innovation ranking algorithm based on binary search interpolation is developed, which reduces the number of comparisons through automated selection of comparison targets. The proposed framework achieves superior performance with a Spearman Footrule distance of 0.74, outperforming mainstream commercial MLLMs by approximately 0.1. Moreover, the mechanism of concurrent comparison and ranking demonstrates a threefold enhancement in computational efficiency. Our framework has achieved data innovation and methodological breakthroughs in village livability assessment, providing strong support for large-scale village livability analysis.
Keywords: Drone photographs, Environmental perception, Rural livability assessment, Multimodal large language models, Chain-of-thought prompting.

Subjects:	Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2508.21738 [cs.CY]
	(or arXiv:2508.21738v2 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2508.21738

Submission history

From: Weihuan Deng [view email]
[v1] Fri, 29 Aug 2025 16:04:06 UTC (4,328 KB)
[v2] Mon, 3 Nov 2025 03:53:42 UTC (4,126 KB)

Computer Science > Computers and Society

Title:From Drone Imagery to Livability Mapping: AI-powered Environment Perception in Rural China

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:From Drone Imagery to Livability Mapping: AI-powered Environment Perception in Rural China

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators