基于融合可见光与红外热成像的母猪发情识别方法研究

    A Sow Estrus Detection Method Based on Feature-level Fusion of Visible Light and Infrared Thermal Imaging

    • 摘要:
      目的 母猪发情识别影响授精时机与繁殖效率,针对传统人工查情主观性强、连续监测能力不足及单一模态在复杂猪舍环境下鲁棒性有限的问题,提出可见光与红外热成像特征层融合的非接触式母猪智能查情方法。
      方法 以断奶经产大白母猪为对象,采集其阴户区域可见光图像与红外热成像数据,依据时间戳构建可见光-红外成对样本。采用双分支网络,其中可见光分支采用保留主干网络的ResNet-50提取图像和空间特征,并以BiLSTM与时间注意力机制进行时序建模;而红外分支则提取外阴温度统计特征并经全连接编码;双分支高层语义特征经线性投影至统一低维度后拼接融合,由MLP学习模态间互补关系,末端Sigmoid分类器输出结果。
      结果 试验对比单模态模型、不同融合策略及消融组件的性能,提出特征层融合模型在测试集上取得准确率93.1%、召回率94.5%、AUC 96.2%,相比可见光单模态准确率提升1.5个百分点,相比红外单模态准确率提升4.6个百分点、精确率提升8.3个百分点。相比输入层融合和决策层融合,特征层融合准确率分别高出1.0和0.3个百分点。消融实验中,逐步引入LSTM、BiLSTM和时间注意力机制后,准确率从89.8%依次提升至91.4%、92.2%和93.1%。
      结论 可见光外阴表型信息与红外热生理信息具有较强互补性,特征层融合可有效规避原始数据层异构差异,在高语义层次实现跨模态特征交互,显著提升母猪发情识别精度。

       

      Abstract:
      Objective Sow estrus recognition is essential for determining insemination timing and improving reproductive efficiency. Aiming at the problems of strong subjectivity and poor continuous monitoring capability of traditional manual estrus detection, as well as the limited robustness of single-modal methods in complex pig house environments, this study explored the effect of feature-level fusion of visible light and infrared thermal imaging on estrus recognition, and proposed a non-contact intelligent estrus detection method.
      Method Weaned multiparous Large White sows were selected as the research objects. Visible light images and infrared thermal imaging data of their vulva regions were collected twice daily continuously. Paired visible light-infrared samples were constructed based on timestamps. A two-branch structure was adopted for feature extraction. The visible light branch employed ResNet-50 with the backbone network retained to extract image spatial features, and adopted BiLSTM combined with temporal attention mechanism for time-series modeling. The infrared branch extracted statistical temperature features of the vulva and performed full connection encoding. The high-level semantic features of the two branches were linearly projected to a unified low-dimensional space for concatenated feature-level fusion. MLP was used to learn the complementary relationship between modalities, and the Sigmoid classifier at the output end generated the final recognition results.
      Result By comparing the performance of single-modal models, different fusion strategies and ablation components, the feature-level fusion model achieved an accuracy of 93.1%, a recall rate of 94.5%, and an AUC of 96.2% on the test set. Compared with the single visible light modality, the accuracy increased by 1.5 percentage; compared with the single infrared modality, the accuracy increased by 4.6 percentage and the precision increased by 8.3 percentage. In addition, its accuracy was 1.0 and 0.3 percentage points higher than that of input-level fusion and decision-level fusion respectively. In ablation experiments, with the gradual introduction of LSTM, BiLSTM and temporal attention mechanism, the accuracy increased sequentially from 89.8% to 91.4%, 92.2% and 93.1%.
      Conclusion Visible light vulvar phenotypic information and infrared thermal physiological information present strong complementarity. Feature-level fusion can effectively eliminate heterogeneous differences in the raw data layer, realize cross-modal feature interaction at a high semantic level, and significantly improve the accuracy of sow estrus recognition.