Dynamic detection method for falling ears of maize harvester based on improved YOLO-V4

Ang Gao, Aijun Geng, Zhilong Zhang, Ji Zhang, Xiaolong Hu, Ke Li

Abstract


Traditional maize ear harvesters mainly rely on manual identification of fallen maize ears, which cannot realize real-time detection of ear falling. The improved You Only Look Once-V4 (YOLO-V4) algorithm was combined with the channel pruning algorithm to detect the dropped ears of maize harvesters. K-means clustering algorithm was used to obtain a prior box matching the size of the dropped ears, which improves the Intersection Over Union (IOU). Compare the effect of different activation functions on the accuracy of the YOLO-V4 model, and use the Mish activation function as the activation function of this model. Improve the calculation of the regression positioning loss function, and use the CEIOU loss function to balance the accuracy of each category. Use improved Adam optimization function and multi-stage learning optimization technology to improve the accuracy of the YOLO-V4 model. The channel pruning algorithm was used to compress the model and distillation technology was used in the fine-tuning of the model. The final model size was only 10.77% before compression, and the test set mean Average Precision (mAP) was 93.14%. The detection speed was 112 fps, which can meet the need for real-time detection of maize harvester ears in the field. This study can provide a technical reference for the detection of the ear loss rate of intelligent maize harvesters.
Keywords: maize ear detection, YOLO-V4, channel pruning algorithm, real-time detection
DOI: 10.25165/j.ijabe.20221503.6660

Citation: Gao A, Geng A J, Zhang Z L, Zhang J, Hu X L, Li K. Dynamic detection method for falling ears of maize harvester based on improved YOLO-V4. Int J Agric & Biol Eng, 2022; 15(3): 22–32.

Keywords


maize ear detection, YOLO-V4, channel pruning algorithm, real-time detection

Full Text:

PDF

References


Jin X B, Yu X H, Wang X Y, Bai Y T, Su T L, Kong J L. Deep learning predictor for sustainable precision agriculture based on internet of things system. Sustainability, 2020; 12(4): 1–18.

Tian Y, Yang G, Wang Z, Wang H, Li E, Liang Z. Apple detection during different growth stages in orchards using the improved YOLO-V3 model. Computers and Electronics in Agriculture, 2019; 157: 417–426.

Lyu S X, Noguchi N, Ospina R, Kishima Y. Development of phenotyping system using low altitude UAV imagery and deep learning. Int J Agric & Biol Eng, 2021; 14(1): 207–215.

Yang Z K, Li W Y, Li M, Yang X T. Automatic greenhouse pest recognition based on multiple color space features. Int J Agric & Biol Eng, 2021; 14(2): 188–195.

Pang Y, Shi Y, Gao S, Jiang F, Veeranampalayam-Sivakumar A N, Thompson L, et al. Improved crop row detection with deep neural network for early-season maize stand count in UAV imagery. Computers and Electronics in Agriculture, 2020; 178: 105766. doi: 10.1016/ j.compag.2020.105766.

Monhollen N S, Shinners K J, Friede J C, Rocha E M, Luck B L. In-field machine vision system for identifying corn kernel losses. Computers and Electronics in Agriculture, 2020; 174: 105496. doi: 10.1016/j.compag.2020.105496.

Ni C, Wang D, Vinson R, Holmes M, Tao Y. Automatic inspection machine for maize kernels based on deep convolutional neural networks. Biosystems Engineering, 2019; 178: 131–144.

Yeom S K, Seegerer P, Lapuschkin S, Binder A, Samek W. Pruning by explaining: a novel criterion for deep neural network pruning. Pattern Recognition, 2021; 12: 107899. doi: 10.1016/j.patcog.2021.107899.

Wu D, Lyu S, Jiang M, Song H. Using channel pruning-based YOLO v4 deep learning algorithm for the real-time and accurate detection of apple flowers in natural environments. Computers and Electronics in Agriculture, 2020; 178(4): 105742. doi: 10.1016/j.compag.2020.105742.

Run S, Li, T X, Yamaguchi Y. An attribution-based pruning method for real-time mango detection with YOLO network. Computers and Electronics in Agriculture, 2020; 169: 105214. doi: 10.1016/ j.compag.2020.105214.

Fountsop A N, Fendji, J, Atemkeng M. Deep learning models compression for agricultural plants. Applied Sciences, 2020; 10(19): 6866. doi: 10.3390/app10196866.

Peng H Y, Yu S Q. A systematic IOU-related method: Beyond simplified regression for better localization. IEEE Transactions on Image Processing, 2021; 30: 5032–5044.

Kingma D, Ba J. Adam: a method for stochastic optimization. Computer Science. 3rd International Conference for Learning Representations, San Diego, 2015; arXiv. doi: 10.48550/arXiv.1412.6980.

Ketkar N. Stochastic gradient descent. In Deep learning with Python. Apress, Berkeley, CA, 2017; pp.113–132.

Bochkovskiy A, Wang C Y, Liao H Y M. YOLOv4: Optimal speed and accuracy of object detection. arXiv, 2020. doi: 10.48550/ arXiv.2004.10934.

Tian Y, Yang G, Wang Z, Wang H, Li E, Liang Z. Apple detection during different growth stages in orchards using the improved YOLO-V3 model. Computers and Electronics in Agriculture, 2019; 157: 417–426.

Wang C-Y, Liao H-Y, Wu Y-H, Chen P-Y, Yeh I-H. CSPNet: A new backbone that can enhance learning capability of CNN. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2020; pp.1571–1580. doi: 10.1109/CVPRW50498.2020. 00203.

He K, Zhang X, Ren S, Sun J. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2015; 37(19): 4–16.

Shu L, Lu Q, Haifang Q, Jianping S, Jiaya J. Path aggregation network for instance segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018; pp.8759–8768. doi: 10.1109/CVPR.2018.00913.

Zhang X, Zou Y, Shi W. Dilated convolution neural network with LeakyReLU for environmental sound classification. In: 2017 22nd International Conference on Digital Signal Processing (DSP). IEEE, 2017; pp.1–5. doi: 10.1109/ICDSP.2017.8096153.

Misra D. Mish: A self regularized non-monotonic neural activation function. 2019. arXiv: 1908.08681, 4.

Liu S, Qi L, Qin H, Shi J, Jia J. Path aggregation network for instance segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: pp.8759–8768. doi: 10.1109/ CVPR.2018.00913.

Long X, Deng K, Wang G, Zhang Y, Dang Q, Gao Y, et al. PP-YOLO: An effective and efficient implementation of object detector. arXiv, 2020. doi: 10.48550/arXiv:2007.12099.

Zheng Z, Wang P, Liu W, Li J, Ye R, Ren D. Distance-IOU loss: Faster and better learning for bounding box regression. In Proceedings of the AAAI Conference on Artificial Intelligence, 2020; 34(7): 12993–13000.

Dhanachandra N, Manglem K, Chanu Y J. Image segmentation using K-means clustering algorithm and subtractive clustering algorithm. Procedia Computer Science, 2015; 54: 764–771.

Ramachandran P, Zoph B, Le Q V. Searching for activation functions. arXiv, 2017. doi: 10.48550/arXiv.1710.05941.

Peng H, Yu S. A Systematic IOU-Related Method: Beyond Simplified Regression for Better Localization. IEEE Transactions on Image Processing, 2021; 30: 5032–5044.

Kim K, Choi Y. HyAdam C: A new Adam-based hybrid optimization algorithm for Convolution Neural Networks. Sensors, 2021; 21(12): 4054. doi: 10.3390/s21124054.

Iiboudo W, Kobayashi T, Sugimoto K. Robust stochastic gradient descent with student-t distribution based first-order momentum. IEEE Transactions on Neural Networks and Learning Systems, 2020; 99: 1–14.

Wu Xu, Qi Z, Wang L J, Yang J, Xia X. Apple detection method based on light-YOLOV3 convolutional neural network. Transactions of the CSAM, 2020; 51(8): 17–25. (in Chinese)

Chen G, Choi W, Yu X, Han T, Chandraker M. Learning efficient object detection models with knowledge distillation. In Proceedings of the 31st International Conference on Neural Information Processing Systems, 2017; pp.742–751.

Ren S, He K, Girshick, R, Sun J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017; 39(6): 1137–1149.

He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016; pp.770–778.




Copyright (c) 2022 International Journal of Agricultural and Biological Engineering

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

2023-2026 Copyright IJABE Editing and Publishing Office