Advanced Search

Indexed by SCI、CA、РЖ、PA、CSA、ZR、etc .

Turn off MathJax
Article Contents
Weitao Chen, Zi Li, Haoyi Wang, Wenxi He, Zhengchao Chen, Jun Li. HTransNet: A Hierarchical Transformer Network with Dual Attention for Large-Scale Mining Scene Classification Using Multi-Category HighResolution Remote Sensing Dataset. Journal of Earth Science. doi: 10.1007/s12583-025-0335-x
Citation: Weitao Chen, Zi Li, Haoyi Wang, Wenxi He, Zhengchao Chen, Jun Li. HTransNet: A Hierarchical Transformer Network with Dual Attention for Large-Scale Mining Scene Classification Using Multi-Category HighResolution Remote Sensing Dataset. Journal of Earth Science. doi: 10.1007/s12583-025-0335-x

HTransNet: A Hierarchical Transformer Network with Dual Attention for Large-Scale Mining Scene Classification Using Multi-Category HighResolution Remote Sensing Dataset

doi: 10.1007/s12583-025-0335-x
Funds:

in part by the Opening Fund of Key Laboratory of Geological Survey and Evaluation of Ministry of Education under Grant No. GLAB2024ZR01.

supported by the Fundamental Research Funds for the Natural Science Foundation of China under Grant No. U1803117, No. T2225019

the China Geological Survey's geological survey project under Grant NO. DD20230104

supported by the High-performance GPU Server (TX321203) Computing Centre of the National Education Field Equipment Renewal and Renovation Loan Financial Subsidy Project of China University of Geosciences, Wuhan

  • Available Online: 20 Aug 2025
  • Remote sensing-based mining scene classification plays a pivotal role in monitoring mineral extraction activities and evaluating sustainable development practices. Nevertheless, this critical task confronts two principal challenges: (1) the scarcity of diversified remote sensing datasets encompassing multiple mining types, and (2) the limitations of conventional models in processing complex spatial characteristics inherent to mining landscapes, including sparse target distribution, multi-scale variations, high inter-class similarity, and heterogeneous feature mixing. To address these limitations, this study presents CUG_MA, a novel large-scale mining scene dataset characterized by multi-category coverage and high spatial resolution (0.5-2 m). Comprising 2,649 annotated samples across nine representative mining types from Hubei, Jiangxi, and Heilongjiang provinces, this dataset systematically captures diverse geological and geographical characteristics of Chinese mining regions. We further propose a hierarchical Transformer architecture (HTransNet) incorporating dual attention mechanisms for enhanced mining scene recognition. The model employs multi-head self-attention to establish global contextual relationships across varying scales while utilizing cross-level attention fusion to hierarchically integrate multi-stage features, effectively addressing scale discrepancies and enhancing discriminative feature representation. Comprehensive experiments demonstrate that HTransNet achieves superior performance with 69.4% mean accuracy on CUG_MA, outperforming eight state-of-the-art models by a 5.0% margin. Notably, all benchmark models attained acceptable classification performance, substantiating the dataset's robustness and generalization capability. This research provides both a valuable benchmark dataset and an advanced framework for intelligent monitoring of mining ecosystems.

     

  • loading
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views(129) PDF downloads(5) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return