基于可解释性机器学习的流纹岩构造背景智能判别
DOI:
作者:
作者单位:

1.南华大学;2.湖南省国土空间调查监测所;3.浙江省核工业二六二大队

作者简介:

通讯作者:

中图分类号:

基金项目:

湖南省自然科学基金面上项目(编号2023JJ30507)、湖州市地质资源与地质工程重点实验室开放基金(编号2024KLAB01)


Interpretable Machine learning-Based intelligent discrimination of Rhyolite Tectonic Settings
Author:
Affiliation:

1.University of South China;2.Hunan Provincial Land and Space Survey and Monitoring Institute;3.Zhejiang Province Nuclear Industry 262 Brigade

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    流纹岩构造背景的判别在地球动力学研究中具有重要意义。然而,传统图解法受限于低维参数空间和后期蚀变影响,难以有效解析复杂的岩浆成因机制,构造判别结果存在不确定性。近年来,机器学习技术在地球科学领域得到广泛应用,依托多维地球化学特征分析与模式挖掘对构造背景进行智能识别,已成为该领域的重要研究方向。本文基于全球流纹岩地球化学数据库(GEOROC),清洗并筛选了5874条高质量流纹岩数据,涵盖主量、微量、稀土元素及其比值等46项指标,建立了系统的数据集。研究构建了九种机器学习分类器,包括集成树模型(XGBoost、LightGBM、CatBoost等)、支持向量机(SVM)及深度学习模型(MLP、TabPFN),通过Optuna算法优化超参数以提升模型性能。实验结果表明,在未调优的状态下,基于Transformer架构的TabPFN模型在无需调优的条件下表现最佳,测试集准确率达88.37%,AUC值为98.39%,充分体现其在地球化学数据学习中的优势;经优化的LightGBM模型准确率提升至87.98%,验证了超参数调优的有效性。进一步结合SHAP与LIME可解释性方法,揭示了Cs、Hf、Th/Ta等关键判别因子,其贡献度与俯冲流体活动、地幔部分熔融及壳-幔相互作用等地球动力学过程密切相关;同时,传统研究中被忽视的微量元素(Ga、Co等)在模型中展现出显著分类能力,表明多元素协同分析对构造背景判别的潜在价值。研究表明,融合高维地球化学数据与可解释机器学习模型可显著提升构造背景识别精度,为长英质岩浆岩构造归属判别及区域地球动力学演化研究提供了新方法。

    Abstract:

    The discrimination of tectonic settings for rhyolites holds significant importance in geodynamic research. However, traditional graphical methods, constrained by low-dimensional parameter spaces and the influence of post-emplacement alteration, struggle to effectively resolve complex magmatic genetic mechanisms, leading to uncertainties in tectonic discrimination results. In recent years, machine learning (ML) techniques have been widely applied in Earth sciences, with intelligent identification of tectonic settings based on multi-dimensional geochemical feature analysis and pattern mining emerging as a critical research direction. This study utilizes the global rhyolite geochemical database (GEOROC) to establish a systematic dataset comprising 5,874 high-quality rhyolite samples, incorporating 46 parameters including major, trace, and rare earth element (REE) concentrations and ratios. Nine ML classifiers were developed, encompassing ensemble tree models (XGBoost, LightGBM, CatBoost), support vector machines (SVM), and deep learning architectures (MLP, TabPFN), with hyperparameters optimized via the Optuna algorithm to enhance model performance. Experimental results demonstrate that the Transformer-based TabPFN model achieved optimal performance without hyperparameter tuning, attaining a test-set accuracy of 88.37% and AUC value of 98.39%, highlighting its superiority in geochemical data learning. The optimized LightGBM model exhibited an improved average accuracy of 87.98%, confirming the efficacy of hyperparameter optimization. Interpretability analyses using SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations) identified critical discriminators including Cs, Hf, and Th/Ta ratios, whose contributions correlate closely with geodynamic processes such as subduction-related fluid activity, mantle partial melting, and crust-mantle interactions. Meanwhile, trace elements traditionally overlooked in conventional studies (e.g., Ga, Co) exhibited significant classification power, suggesting the potential value of multi-element synergy analysis in tectonic discrimination. This research demonstrates that integrating high-dimensional geochemical data with interpretable ML models substantially enhances tectonic setting identification accuracy, providing a novel methodology for determining the tectonic affinity of felsic magmatic rocks and advancing regional geodynamic evolution studies.

    参考文献
    相似文献
    引证文献
引用本文
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-04-10
  • 最后修改日期:2025-09-20
  • 录用日期:2025-11-19
  • 在线发布日期:
  • 出版日期: