面向热带橡胶林LAI遥感反演的多模型比较与最优方案筛选研究

    Multi-model Comparison and Optimal Scheme Selection for LAI Remote Sensing Retrieval of Tropical Rubber Plantations

    • 摘要:
      目的 叶面积指数(LAI)是表征植被冠层结构与生理功能的关键参数,对橡胶林生长监测和生态功能评估具有重要意义。现有橡胶林LAI遥感研究缺乏在统一框架下对多类反演方法的横向比较,不同建模方案的精度差异与误差结构尚不明确。本研究旨在筛选面向热带橡胶林LAI遥感反演的最优建模方案。
      方法 以海南岛橡胶林为对象,基于2017年Landsat 8逐月影像及551条有效地面实测LAI记录,在相同数据集、相同输入特征与相同随机10折交叉验证框架下,构建并比较一元线性回归(SLR)、多元线性回归(MLR)、随机森林(RF)、支持向量机(SVM)和XGBoost共5类LAI反演模型,输入特征包括5个植被指数、1个衍生差值特征及2个月份周期编码时间变量,共8维。
      结果 XGBoost综合精度最优,R2为0.9072;RF次之,R2为0.9026,两者均显著优于SVM和线性模型。全局特征重要性分析显示,月份周期编码时间变量的增益贡献高于各植被指数。残差诊断表明,5类模型在高LAI区间均存在不同程度的低估倾向,与光学遥感在高郁闭度冠层下的信号饱和特性相关,非线性模型的偏差幅度明显低于线性模型。基于最优模型生成的月尺度LAI产品在空间格局、地形分异和季节变化3个维度上均与已有认知高度吻合。
      结论 在统一比较框架下,XGBoost是橡胶林LAI遥感反演的最优方案,高LAI区间的低估是基于光学遥感反演LAI的固有局限,后续研究可通过补充消融实验定量评估时间变量贡献,并结合多源数据缓解信号饱和问题。

       

      Abstract:
      Objective Leaf Area Index (LAI) is a key parameter for characterizing vegetation canopy structure and physiological function, with important implications for rubber plantations growth monitoring and ecological assessment. Existing remote sensing studies on rubber plantations LAI lack systematic cross-method comparisons under a unified framework. This study aimed to identify the optimal modelling approach for tropical rubber plantations LAI remote sensing retrieval.
      Method Focusing on rubber plantations on Hainan Island, five LAI retrieval models were systematically constructed and compared, including Simple Linear Regression (SLR), Multiple Linear Regression (MLR), Random Forest (RF), Support Vector Machine (SVM), and XGBoost, based on 2017 monthly Landsat 8 imagery, and 551 valid ground-measured LAI records, under the same dataset, identical input features, and a fixed random 10-fold cross-validation framework. The input feature set comprised five vegetation indices, one derived difference feature, and two cyclically-encoded monthly temporal variables.
      Result XGBoost achieved the best overall accuracy with an R2 of 0.9072, closely followed by RF with an R2 of 0.9026, both significantly outperforming SVM and linear models. Global feature importance analysis indicated that cyclically-encoded temporal variables contributed more gain than all vegetation indices. Residual diagnostics revealed a systematic underestimation tendency in the high-LAI range across all five models, associated with optical signal saturation under highly closed canopies, with nonlinear models exhibiting substantially smaller bias than linear models. Monthly LAI products generated by the optimal model is highly consistent with existing knowledge in terms of spatial pattern, topographic differentiation, and seasonal variation.
      Conclusion Under the unified comparison framework, XGBoost was the optimal approach for rubber plantations LAI retrieval. Systematic underestimation in the high-LAI range represents an inherent limitation of optical remote sensing-based LAI inversion. Future work should quantify the contribution of temporal variables through ablation experiments and explore multi-source data integration to alleviate signal saturation.