基于XGBoost和SHAP的可解释性滑坡位移预测模型

    AN INTERPRETABLE LANDSLIDE DISPLACEMENT PREDICTION MODEL BASED ON XGBOOST AND SHAP

    • 摘要: 滑坡位移预测是监测预警工作的重要一环。机器学习算法具有较高的预测精度,广泛应用于滑坡位移预测,但其具有可解释性差的不足。为此,本文提出一种融合极限梯度提升(Extreme Gradient Boosting,XGBoost)算法和可解释机器学习模型(Shapley Additive Explanations,SHAP)的可解释滑坡位移预测模型(XGB-SHAP)。以四川雅安多营滑坡为例,将滑坡位移时间曲线分解为趋势项和周期项,选取前期降雨量和前期位移等因子作为模型的输入特征,通过XGBoost和差分自回归滑动平均模型(Autoregressive Integrated Moving Average model,ARIMA)分别对其周期项和趋势项位移进行预测,最后对该预测模型进行解释。结果表明:XGB-SHAP预测总位移的平均绝对误差和均方误差分别为2.46和3.38,相比于其他模型具有更高的精度;模型解释结果表明当前期30 d累计位移超过20 mm或7 d最大降雨超过19 mm时,位移增长明显。XGB-SHAP模型不仅具有较高的预测精度,同时能解释致灾因子与滑坡位移的关系,可为滑坡预警阈值及模型建立等工作提供参考。

       

      Abstract: Landslide displacement prediction is an important part of monitoring and warning work. Machine learning algorithm has high prediction accuracy and is widely used in landslide displacement prediction,but its interpretability is poor. For this reason,this paper proposes an interpretable landslide displacement prediction model(XGB-SHAP)that combines the Extreme Gradient Boosting(XGBoost) algorithm and the Shapley Additive Expansions(SHAP). Taking the Duoying landslide in Ya ′an,Sichuan Province as an example,the landslide displacement time curve was decomposed into trend term and periodic term,and factors such as previous rainfall and previous displacement were selected as input features of the model. Through XGBoost and ARIMA(Autoregressive Integrated Moving Average model),the displacement of its cycle term and trend term are predicted respectively,and finally the prediction model is explained. The results show that the mean absolute error and mean square error of XGB-SHAP are 2.46 and 3.38 respectively,which is more accurate than other models. The results show that the mean absolute error and mean square error of XGB-SHAP are 2.46 and 3.38 respectively,which is more accurate than other models. The results of model interpretation show that the displacement increases obviously when the 30-day cumulative displacement exceeds 20 mm or the 7-day maximum rainfall exceeds 19 mm. The model interpretation results show that the displacement increases significantly when the cumulative displacement in the current period exceeds 20 mm in 30 days or the maximum rainfall in 7 days exceeds 19 mm.

       

    /

    返回文章
    返回