Multimodal Data Fusion in Smart Agriculture: Integrating Soil, Climate, Imaging, and Biological Signals Using Machine Learning: Scoping Review

Authors

R.N.I. Basnayake

Department of Computer and Data Science, NSBM Green University (Sri Lanka)

Article Information

DOI: 10.51584/IJRIAS.2026.111500010

Subject Category: Biology

Volume/Issue: 11/15 | Page No: 101-107

Publication Timeline

Submitted: 2026-04-16

Accepted: 2026-04-22

Published: 2026-05-14

Abstract

The evolution of smart agriculture has expedited the adoption of machine learning-based solutions for crop monitoring, disease detection, yield estimation, and environmental management. However, most of the prevailing machine learning-based solutions utilize unimodal data, such as image, soil, or climatic-based information. These solutions face the limitation of effectively handling the complex interactions between the agricultural ecosystems. Multimodal data fusion-based solutions have been recognized as a powerful tool for effectively utilizing the strength of heterogeneous data sources, including soil-based information, climatic-based information, image-based information, and plant physiological-based information. This systematic review focuses on the machine learning-based multimodal data fusion solutions developed using research published between 2020 and 2026. A structured review-based methodology was followed to identify the relevant research. A review of the relevant literature was conducted to determine the effectiveness of the machine learning-based multimodal data fusion solutions. It was found that the effectiveness of the machine learning-based solutions, including early, intermediate, and late fusion-based solutions, attention-based solutions, transformer-based solutions, graph learning-based solutions, and deep ensemble-based solutions, was better compared to unimodal-based solutions. The applications of the machine learning-based multimodal data fusion solutions were found to be effective for applications such as crop disease detection, soil moisture estimation, yield estimation, plant stress detection, and intelligent decision-making. The challenges associated with the machine learning-based multimodal data fusion solutions were identified to be associated with issues such as heterogeneity, synchronization, the unavailability of standardized multimodal datasets, and the limitations associated with real-time applications. The research direction of the machine learning-based multimodal data fusion solutions was identified to be associated with cross-modal transformer-based solutions, explainable machine learning-based solutions, federated learning-based solutions, the utilization of biological-based information with remote sensing-based information, and the utilization of graph learning-based solutions.

Keywords

Multimodal learning; Deep learning; Information fusion; Crop monitoring

Downloads

References

1. Choi, J. W., Cho, S. B., Hidayat, M. S., Hwang, W.-H., Cho, Y.-S., Lee, H., Cho, B.-K., & Kim, G. (2025). Application of multimodal data fusion and explainable AI for classifying water stress in sweet potatoes. Frontiers in Plant Science, 16. [Google Scholar] [Crossref]

2. Katharria, A., Rajwar, K., Pant, M., Velásquez, J. D., Snášel, V., & Deep, K. (2025). Information fusion in smart agriculture: Machine learning applications and future research directions. arXiv preprint arXiv:2405.17465. [Google Scholar] [Crossref]

3. Khoirudin, P. T., Pungkasanti, P. T., & Hidayati, N. (2024). A systematic review on data fusion techniques for agricultural yield prediction: Integrating satellite imagery with climatic data. Systematic Literature Review Journal, 1(4). [Google Scholar] [Crossref]

4. Lamichhane, M., Mehan, S., & Mankin, K. R. (2025a). Multimodal machine learning approaches for soil moisture estimation in agricultural systems. [Google Scholar] [Crossref]

5. Lamichhane, M., Mehan, S., & Mankin, K. R. (2025b). Surface soil moisture prediction using multimodal remote sensing data fusion and machine learning algorithms in semi-arid agricultural regions. [Google Scholar] [Crossref]

6. Liu, R., Chang, C., Zhong, R., & Lu, S. (2025). Soil moisture monitoring method and data products: Current research status and future development trends. Remote Sensing, 17, 3945. [Google Scholar] [Crossref]

7. Lu, Y., Lu, X., Zheng, L., Sun, M., Chen, S., Chen, B., Wang, T., Yang, J., & Lv, C. (2024a). Application of multimodal transformer model in intelligent agricultural disease detection and question-answering systems. Plants, 13, 972. [Google Scholar] [Crossref]

8. Lu, Y., et al. (2024b). Multimodal transformer-based intelligent systems for agricultural decision support. Plants. [Google Scholar] [Crossref]

9. Njoroge, T. K., Sindu, K. M., & Kibuku, R. (2025). Edge-optimized multimodal cross-fusion model with statistical validation for multi-crop disease detection. International Journal of Advances in Intelligent Informatics, 11(3). [Google Scholar] [Crossref]

10. Shoaib, M., Khan, S. U., AbdelHameed, H., & Qahmash, A. (2026a). Plant stress detection using multimodal imaging and machine learning: From leaf spectra to smartphone applications. Frontiers in Plant Science. [Google Scholar] [Crossref]

11. Shoaib, M., et al. (2026b). Machine learning-driven multimodal imaging techniques for plant stress analysis. Frontiers in Plant Science. [Google Scholar] [Crossref]

12. Wang, X., Yan, F., Li, B., Yu, B., Zhou, X., Tang, X., Jia, T., & Lv, C. (2025). A multimodal data fusion and embedding attention mechanism-based method for eggplant disease detection. Plants, 14, 786. [Google Scholar] [Crossref]

13. Wittstruck, L., Waske, B., & Jarmer, T. (2025). Multi-modal vision transformer for high-resolution soil texture prediction using remote sensing imagery. Remote Sensing of Environment. [Google Scholar] [Crossref]

14. Wu, X., Zhang, J., Zou, Z., Chen, C., Yu, Y., Yu, P., Xiao, Y., Wang, Q., & Hao, G. (2025a). PlantIF: Multimodal semantic interactive fusion via graph learning for plant disease diagnosis. [Google Scholar] [Crossref]

15. Wu, X., et al. (2025b). Graph-based multimodal learning for intelligent plant disease diagnosis systems. [Google Scholar] [Crossref]

16. Yang, Z.-X., Li, Y., Wang, R.-F., Hu, P., & Su, W.-H. (2025a). Deep learning in multimodal fusion for sustainable plant care: A comprehensive review. Sustainability, 17(12), 5255. [Google Scholar] [Crossref]

17. Yang, Z.-X., et al. (2025b). Advances in multimodal deep learning frameworks for sustainable agriculture applications. Sustainability. [Google Scholar] [Crossref]

18. Yewle, A. D., Mirzayeva, L., & Karakuş, O. (2025). Multi-modal data fusion and deep ensemble learning for accurate crop yield prediction. arXiv preprint. [Google Scholar] [Crossref]

19. Yeom, M.-S., Lee, Y., Oh, H., Lee, E., & Oh, M.-M. (2025). Applying machine learning for the classification of environmental conditions using plant electrical signals. Horticulture, Environment, and Biotechnology. [Google Scholar] [Crossref]

20. Zhou, Y., Yan, H., Ding, K., Cai, T., & Zhang, Y. (2024). Few-shot image classification of crop diseases based on vision–language models. Sensors, 24, 6109. [Google Scholar] [Crossref]

Metrics

Views & Downloads

Similar Articles