International Journal of Emerging Research in Science, Engineering, and Management
Vol. 2, Issue 3, pp. 342-350, March 2026.
This work is licensed under a Creative Commons Attribution 4.0 International License.
A Transformer-Oriented Deep Learning Approach for Intelligent Plant Disease Diagnosis
G Ravi Kumar, Saikumar Panchireddi, Gownalla Sirisha, Putluru Sarath Kumar Reddy, Nagaram Sai Teja Chary
Department of CSE, Siddartha Institute of Science and Technology, Puttur, AP, India.
Abstract: This research addresses the critical challenge of global food security by introducing a Transformer-oriented deep learning framework for the automated and intelligent diagnosis of plant diseases. Recognizing the economic devastation and yield loss caused by agricultural pathogens, the Paper moves beyond the limitations of traditional Convolutional Neural Networks (CNNs) by implementing a Vision Transformer (ViT) architecture. The system leverages the power of self-attention mechanisms, allowing the model to process leaf images as sequences of patches. This approach enables the capture of both intricate local lesion details and global structural context more effectively than localized filters. To ensure high diagnostic performance, the framework utilizes high-resolution leaf imagery, which undergoes rigorous preprocessing and data augmentation to simulate diverse environmental conditions. By learning visual representations directly from image patches, the ViT model eliminates the need for manual feature engineering and demonstrates significant robustness against real-world complexities, such as inconsistent lighting, cluttered backgrounds, and varying leaf orientations. The experimental results validate that this transformer-based approach provides a highly accurate, scalable, and automated solution for crop health monitoring. By integrating state-of-the-art computer vision with agricultural science, this Paper offers a powerful tool for timely intervention, empowering farmers with actionable insights to mitigate disease spread and ensure sustainable crop management.
Keywords: Plant Disease Diagnosis, Vision Transformer, Self-Attention, Deep Learning, Digital Agriculture.
References:
- G S. Aboelenin, F. A. Elbasheer, M. M. Eltoukhy, W. M. El-Hady, and K. M. Hosny, “A hybrid Framework for plant leaf disease detection and classification using convolutional neural networks and vision transformer,” Complex & Intelligent Systems, vol. 11, no. 2, Jan. 2025, doi: 10.1007/s40747-024-01764-x.
- S. M. Alhammad, D. S. Khafaga, W. M. El-Hady, F. M. Samy, and K. M. Hosny, “Deep learning and explainable AI for classification of potato leaf diseases,” Frontiers in Artificial Intelligence, vol. 7, p. 1449329, Feb. 2025, doi: 10.3389/frai.2024.1449329.
- Z. Salman, A. Muhammad, and D. Han, “Plant disease classification in the wild using vision transformers and mixture of experts,” Frontiers in Plant Science, vol. 16, p. 1522985, Jun. 2025, doi: 10.3389/fpls.2025.1522985.
- J. Chen, S. Wang, J. Guo, F. Chen, Y. Li, and H. Qiu, “Improved FasterViT model for citrus disease diagnosis,” Heliyon, vol. 10, no. 16, p. e36092, Aug. 2024, doi: 10.1016/j.heliyon.2024.e36092.
- P. Christakakis, N. Giakoumoglou, D. Kapetas, D. Tzovaras, and E.-M. Pechlivani, “Vision Transformers in Optimization of AI-Based Early Detection of Botrytis cinerea,” AI, vol. 5, no. 3, pp. 1301–1323, Aug. 2024, doi: 10.3390/ai5030063.
- J. Feng, W. E. Ong, W. C. Teh and R. Zhang, “Enhanced Crop Disease Detection With EfficientNet Convolutional Group-Wise Transformer,” in IEEE Access, vol. 12, pp. 44147-44162, 2024, doi: 10.1109/ACCESS.2024.3379303.
- R. Karthik, A. Ajay, A. Singh Bisht, T. Illakiya and K. Suganthi, “A Deep Learning Approach for Crop Disease and Pest Classification Using Swin Transformer and Dual-Attention Multi-Scale Fusion Network,” in IEEE Access, vol. 12, pp. 152639-152655, 2024, doi: 10.1109/ACCESS.2024.3481675.
- R. Sudhakar, K. Nithya, C. R. Dhivyaa, and C. Sharmila, “High-performance parallel multi-scale attention network with explainable AI for intelligent diagnosis of leaf diseases in agricultural systems,” Scientific Reports, vol. 15, no. 1, p. 42060, Nov. 2025, doi: 10.1038/s41598-025-26144-4.
- H. Gunasekaran, W. B. N. R, N. V. Watson, H. B, A. G. Jesudoss, and A. C. G, “SWIN-HVIT: a Hybrid Transformer approach for accurate Early-Stage crop Disease diagnosis,” Research Square, Jan. 2026, doi: 10.21203/rs.3.rs-8588269/v1.
- Y. Maqsood, S. M. Usman, M. Alhussein, K. Aurangzeb, S. Khalid, and M. Zubair, “Model agnostic Meta-Learning (MAML)-Based ensemble model for accurate detection of wheat diseases using vision transformer and graph neural networks,” Computers, Materials & Continua/Computers, Materials & Continua (Print), vol. 79, no. 2, pp. 2795–2811, Jan. 2024, doi: 10.32604/cmc.2024.049410.
- B. N. E. Nyarko, W. B. Wu, Z. J. Zhou, and M. M. J. Mohd, “Improved YOLOv5m model based on Swin Transformer, K-means++, and Efficient Intersection over Union (EIoU) loss function for cocoa tree (Theobroma cacao) disease detection,” Journal of Plant Protection Research, pp. 265–274, Sep. 2024, doi: 10.24425/jppr.2024.151253.
- E. Kapase, P. Bhandari, A. Bodake, and U. Chaudhari, “Revolutionizing plant Disease Detection: A review of deep learning and machine learning algorithms,” International Journal of Scientific Research in Science Engineering and Technology, vol. 11, no. 2, pp. 204–210, Apr. 2024, doi: 10.32628/ijsrset2411227.
- L. J. Galarza, M. Realpe, M. S. Viñán-Ludeña, M. F. Calderón, and S. Jaramillo, “AgriTransformer: A Transformer-Based Model with Attention Mechanisms for Enhanced Multimodal Crop Yield Prediction,” Electronics, vol. 14, no. 12, p. 2466, Jun. 2025, doi: 10.3390/electronics14122466.
- S. Sennan, R. Somula, Y. Cho, and S. Sennan, “A hybrid vision transformer and ResNet18 based model for biotic rice leaf disease detection,” Frontiers in Plant Science, vol. 16, p. 1711700, Nov. 2025, doi: 10.3389/fpls.2025.1711700.
