| Frontiers in Plant Science | |
| PMVT: a lightweight vision transformer for plant disease identification on mobile devices | |
| Plant Science | |
| Peiyan Yuan1  Baofang Chang1  Yuchao Wang1  Guoqiang Li2  Qing Zhao2  | |
| [1] College of Computer and Information Engineering, Henan Normal University, Xinxiang, Henan, China;Key Laboratory of Artificial Intelligence and Personalized Learning in Education of Henan Province, Xinxiang, Henan, China;Institute of Agricultural Economics and Information, Henan Academy of Agricultural Sciences, Zhengzhou, Henan, China; | |
| 关键词: plant disease identification; vision transformer; lightweight model; attention module; APP; | |
| DOI : 10.3389/fpls.2023.1256773 | |
| received in 2023-07-11, accepted in 2023-09-08, 发布年份 2023 | |
| 来源: Frontiers | |
PDF
|
|
【 摘 要 】
Due to the constraints of agricultural computing resources and the diversity of plant diseases, it is challenging to achieve the desired accuracy rate while keeping the network lightweight. In this paper, we proposed a computationally efficient deep learning architecture based on the mobile vision transformer (MobileViT) for real-time detection of plant diseases, which we called plant-based MobileViT (PMVT). Our proposed model was designed to be highly accurate and low-cost, making it suitable for deployment on mobile devices with limited resources. Specifically, we replaced the convolution block in MobileViT with an inverted residual structure that employs a 7×7 convolution kernel to effectively model long-distance dependencies between different leaves in plant disease images. Furthermore, inspired by the concept of multi-level attention in computer vision tasks, we integrated a convolutional block attention module (CBAM) into the standard ViT encoder. This integration allows the network to effectively avoid irrelevant information and focus on essential features. The PMVT network achieves reduced parameter counts compared to alternative networks on various mobile devices while maintaining high accuracy across different vision tasks. Extensive experiments on multiple agricultural datasets, including wheat, coffee, and rice, demonstrate that the proposed method outperforms the current best lightweight and heavyweight models. On the wheat dataset, PMVT achieves the highest accuracy of 93.6% using approximately 0.98 million (M) parameters. This accuracy is 1.6% higher than that of MobileNetV3. Under the same parameters, PMVT achieved an accuracy of 85.4% on the coffee dataset, surpassing SqueezeNet by 2.3%. Furthermore, out method achieved an accuracy of 93.1% on the rice dataset, surpassing MobileNetV3 by 3.4%. Additionally, we developed a plant disease diagnosis app and successfully used the trained PMVT model to identify plant disease in different scenarios.
【 授权许可】
Unknown
Copyright © 2023 Li, Wang, Zhao, Yuan and Chang
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO202310129946614ZK.pdf | 5445KB |
PDF