Classification of forestry images based on the BoW Model

ZHANG Guangqun; LI Yingjie; WANG Hangjun

doi:10.11833/j.issn.2095-0756.2017.05.004

Volume 34 Issue 5

Sep. 2017

Turn off MathJax

Article Contents

Article Navigation > Journal of Zhejiang A&F University > 2017 > 34(5): 791-797

ZHANG Guangqun, LI Yingjie, WANG Hangjun. Classification of forestry images based on the BoW Model[J]. Journal of Zhejiang A&F University, 2017, 34(5): 791-797. DOI: 10.11833/j.issn.2095-0756.2017.05.004

Citation:

ZHANG Guangqun, LI Yingjie, WANG Hangjun. Classification of forestry images based on the BoW Model[J]. Journal of Zhejiang A&F University, 2017, 34(5): 791-797. DOI: 10.11833/j.issn.2095-0756.2017.05.004

Classification of forestry images based on the BoW Model

DOI: 10.11833/j.issn.2095-0756.2017.05.004

ZHANG Guangqun^{1,2
,},
LI Yingjie³,
WANG Hangjun^{3
,
,}

1.
School of Information Engineering, Zhejiang A & F University, Lin'an 311300, Zhejiang, China
2.
Zhejiang Provincial Key Laboratory of Forestry Intelligent Monitoring and Information Technology, Zhejiang A & F University, Lin'an 311300, Zhejiang, China
3.
Jiyang College, Zhejiang A & F University, Zhuji 311800, Zhejiang, China

Received Date: 2016-09-19
Rev Recd Date: 2016-11-30
Publish Date: 2017-10-20

Abstract

For characteristics of forestry images, an image classification method was put forward based on Dense SIFT and the BoW Model with support vector machine (SVM) using a histogram intersection kernel in order to improving to meet the need of the forest resources management. First, using the BoW Model, the Dense SIFT features of forestry images were extracted to describe the image. Then SVM was used for classification to identify the category of the images. Different kinds of kernel functions like Poly, RBF, Sigmoid, and the histogram intersection kernel were used to find the best recognition rate. Experimental results showed that using Dense SIFT had a shorter detection time (t=60.143 s) and a higher recognition rate (r=86.7%) than SIFT (t=95.567 s and r=83.3% respectively), and it was suited for high real-time applications. Also the histogram intersection kernel had a higher average recognition (r=86.7%). Combining Dense SIFT and the BoW Model with SVM and using the histogram intersection kernel, algorithms used with three kinds of forestry images had a better average recognition (r=86.7%).
- forest measurement,
- forestry images,
- image classification,
- feature extraction,
- BoW Model,
- support vector machine

Relative Article

[1]	LIANG Hao, CAI Chentao, ZHAO Wei, LI Yajie, WANG Wenkun, HU Yuhang, SHEN Yuxiao, LI Yonghua, SUN Tianxiao. Prediction of chlorophyll and nitrogen contents in leaves of Yulania based on smartphone RGB images . Journal of Zhejiang A&F University, 2025, 42(5): 1090-1101. doi: 10.11833/j.issn.2095-0756.20250460
[2]	YANG Fan, YANG Bokai, LI Rongrong. Surface defect detection technology of wood-based panel based on image segmentation and deep learning . Journal of Zhejiang A&F University, 2024, 41(1): 176-182. doi: 10.11833/j.issn.2095-0756.20230280
[3]	YU Lu, HUANG Yanxia, LIU Jingjian, DUAN Lian. Extraction of polarization characteristics of Oryza sativa growth under the rainfall fluctuation . Journal of Zhejiang A&F University, 2020, 37(5): 992-998. doi: 10.11833/j.issn.2095-0756.20190605
[4]	DU Yufei, WU Baoguo, CHEN Yuling. Eucalyptus suitability in Guangxi based on machine learning algorithms . Journal of Zhejiang A&F University, 2020, 37(1): 122-128. doi: 10.11833/j.issn.2095-0756.2020.01.016
[5]	GUO Ruixia, LI Chonggui, LIU Sihan, MA Ting, QUAN Qingqing. Classification of Larix gmelini plantation based on multi-temporal characteristics . Journal of Zhejiang A&F University, 2020, 37(2): 235-242. doi: 10.11833/j.issn.2095-0756.2020.02.006
[6]	WANG Li, HONG Zubing, FANG Luming, CHEN Xun, WU Chao. iOS-based recognition of ornamental plants . Journal of Zhejiang A&F University, 2018, 35(5): 900-907. doi: 10.11833/j.issn.2095-0756.2018.05.015
[7]	TAO Jiangyue, LIU Lijuan, PANG Yong, LI Dengqiu, FENG Yunyun, WANG Xue, DING Youli, PENG Qiong, XIAO Wenhui. Automatic identification of tree species based on airborne LiDAR and hyperspectral data . Journal of Zhejiang A&F University, 2018, 35(2): 314-323. doi: 10.11833/j.issn.2095-0756.2018.02.016
[8]	GUAN Fangli, XU Aijun. Tree DBH measurement method based on smartphone and machine vision technology . Journal of Zhejiang A&F University, 2018, 35(5): 892-899. doi: 10.11833/j.issn.2095-0756.2018.05.014
[9]	YANG Liyan, FENG Zhongke, LIU Yingchun, LIU Jincheng. Tree volume estimates based on QPSO-LSSVM . Journal of Zhejiang A&F University, 2018, 35(5): 868-876. doi: 10.11833/j.issn.2095-0756.2018.05.011
[10]	BAI Xuebing, XU Jingtao, GUO Jingqiu, CHEN Kai. Segmentation of wood surface knots and wormholes based on an improved LBF Model . Journal of Zhejiang A&F University, 2016, 33(2): 306-314. doi: 10.11833/j.issn.2095-0756.2016.02.017
[11]	YAO Fei, YE Kang, ZHOU Jianhua. Automatic image classification and retrieval by analyzing plant leaf features . Journal of Zhejiang A&F University, 2015, 32(3): 426-433. doi: 10.11833/j.issn.2095-0756.2015.03.015
[12]	CHEN Fang, ZHANG Guangqun, CUI Kunpeng, WANG Hangjun. Design and implementation of an embedded automatic plant recognition system . Journal of Zhejiang A&F University, 2013, 30(3): 379-384. doi: 10.11833/j.issn.2095-0756.2013.03.012
[13]	ZHANG Guang-qun, WU Wei-zhi, WANG Hang-jun. A new wood microscopic image registration approach based on speeded up robust features （SURF） . Journal of Zhejiang A&F University, 2012, 29(4): 600-605. doi: 10.11833/j.issn.2095-0756.2012.04.018
[14]	HAO Hong, XU Chang-qing, ZHANG Xin-ping. Aerial image information extraction based on non-negative matrix factorization . Journal of Zhejiang A&F University, 2012, 29(1): 72-77. doi: 10.11833/j.issn.2095-0756.2012.01.013
[15]	CHEN Jian-zhen, HE Chao, YUE Cai-rong. Atmospheric correction of an advance land imager （ALI） image based on the FLAASH module . Journal of Zhejiang A&F University, 2011, 28(4): 590-596. doi: 10.11833/j.issn.2095-0756.2011.04.011
[16]	JIN Ming, DING Gui-jie. Two-dimensional yield-rate tables for merchantable volumes of Pinus massoniana in Guizhou Province . Journal of Zhejiang A&F University, 2011, 28(4): 576-582. doi: 10.11833/j.issn.2095-0756.2011.04.009
[17]	FANG Yi－ming, ZHENG Hong－ping, FENG Hai－lin. Feature extraction and recognition of wood micrograph based on FFT and ICA . Journal of Zhejiang A&F University, 2010, 27(6): 826-830. doi: 10.11833/j.issn.2095-0756.2010.06.004
[18]	LIU Cheng-lin. Compilation and application of normal bamboo table of Phyllostachys pubescens stump diameter . Journal of Zhejiang A&F University, 2009, 26(4): 549-553.
[19]	GONG Zhi-wen, KANG Xin-gang, GU Li, ZHAO Jun-hui, ZHENG Yan-feng, YANG Hua. Research methods on natural forest stand structure：a review . Journal of Zhejiang A&F University, 2009, 26(3): 434-443.
[20]	LIN Xin-chun, YU Zhi-xiong. Characters of leaf epidermis of Magnoliaceae and its taxonomic significance . Journal of Zhejiang A&F University, 2004, 21(1): 33-39.

References

[1]	CHEN Jinbiao, ZHANG Chunhua. Design and implementation of forestry picture management information system based on classification [J]. Cent South For Invent Plann, 2010, 29(2): 30-33.
[2]	LIU Yihua, LI Yuanyuan. Distributed systematic analysis for massive data of forestry image [J]. For Invent Plann, 2010, 35(4): 10-14.
[3]	SIVIC J, ZISSERMAN A. Video Google: a text retrieval approach to object matching in videos [J]. IEEE Int Conf Comput Vis, 2003, 2: 1470-1478.
[4]	WU Lei, HOI S C H, YU Nenghai. Semantics-preserving bag-of-words models and applications [J]. IEEE Trans Image Proc, 2010, 19(7): 1908-1920.
[5]	UIJLINGS J R R, SMEULDERS A W M, SCHA R J H. Real-time visual concept classification [J]. IEEE Trans Multimedia, 2010, 12(7): 665-681.
[6]	ZHAO Chunhui, WANG Ying, KANEKO M. An optimized method for image classification based on bag of words model [J]. J Electron Inf Technol, 2012, 34(9): 2064-2070.
[7]	AI Haojun, ZHANG Min, FANG Yu, et al. Principal component linear coding for visual words [J]. J Software, 2013, 24(supp 2): 42-49.
[8]	ZHU Yingying, ZHU Yanyan, WEN Zhenkun. Sports video classification based on marked genre shots and bag of words model [J]. J Comput-Aid Des Comput Graph, 2013, 25(9): 1375-1383.
[9]	LI Zhen, YAP K H. An efficient approach for scene categorization based on discriminative codebook learning in bag-of-words framework [J]. Image Vision Comput, 2013, 31(10): 748-755.
[10]	MUMTAZ A, COVIELLO E, LANCKRIET G R, et al. A scalable and accurate descriptor for dynamic textures using bag of system trees [J]. IEEE Trans Pattern Anal Mach Intell, 2015, 37(4): 697-712.
[11]	SHENG Haidi. The Improvement of Bag-of-Visual-Words Model and Its Application Research in Images Classification [D]. Ji'nan: Shandong Normal University, 2015.
[12]	WANG Tao. Research on Bag of Words Model-Based Facial Expression Recognition [D]. Wuhan: Huazhong University of Science and Technology, 2013.
[13]	LOWE D G. Object recognition from local scale-invariant features [J]. IEEE Int Conf Comput Vision, 1999, 2: 1150-1157.
[14]	LOWE D G. Distinctive image features from scale-invariant keypoints [J]. Int J Computr Vision, 2004, 60(2): 91-110.
[15]	MATHUR A, FOODY G M. Multiclass and binary SVM classification: implications for training and classification users [J]. IEEE Trans Geosci Remote Sens Letter, 2008, 5(2): 241-245.
[16]	KALYANI S, SWARUP K S. Classification and assessment of power system security using multiclass SVM [J]. IEEE Trans Syst Man Cybern Part C Appl Rev, 2011, 41(5): 753-758.
[17]	GRAUMAN K, DARRELL T. The pyramid match kernel: discriminative classification with sets of image features [C]//Proceedings of the IEEE International Conference on Computer Vision. Beijing: IEEE Computer Society, 2005: 1458-1465.
[18]	CHANG C C, LIN C J. LIBSVM: a library for support vector machines [J]. ACM Trans Intell Syst Technol, 2007, 2(3): 389-396.

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(4) / Tables(3)

Get Citation

PDF

XML

Article views(3225) PDF downloads(369) Cited by()

Proportional views

HTML

森林具有巨大的生态、经济和社会功能，是应对经济全球化发展过程中造成的生态危机和气候变化问题的有效资源。森林资源清查和森林生态保护一直都是各级政府建设的重要内容。实际工作中，护林员通过手机拍摄到的林业现场数据传输回服务器后，可根据林业业务需求快速分类；其分类结果发送到相关管理部门，就可完成对相关事件及时有效的处理。这种森林资源监管模式避免了传统管理手段无法准确及时了解森林现状及动态的问题。要使林业各个管理部门全面配合、相互协调，增强决策支持和加快应急处理，其核心是实现林业业务图像迅速、准确的分类。陈锦标等^[1]使用.NET提出了基于分类的林业图像管理信息系统，解决林业图像管理分散、分类混乱、查找困难问题。刘义华等^[2]针对林业图像数据的特点，提出了海量数据服务器架设方式和需要解决的关键问题。这些研究的基础是对林业图像进行标注，系统代价高，人工成本也高。本研究中林业业务图像自动分类的理论基础是场景图像分类。场景图像分类是在20世纪90年代末开始兴起的一个研究领域，2006年麻省理工学院首次召开场景理解研讨会后成为了新的研究热点。2005年之前，场景图像分类主要采用基于底层特征（low level features）的方法和基于场景结构的方法；之后则采用基于图像视觉词汇的方法，该类方法由SIVIC等^[3]提出视觉词汇的概念，将文本分类中的词袋方法（bag of words, BoW）应用到图像分类中来。之后，由于视觉词汇在图像分类中具有特征表达能力强和简单有效的优点^[4]，被研究者应用在计算机视觉的图像分类领域^[5-12]。词袋方法的核心是提取图像特征构建视觉词汇本。近年来，多采用局部特征用于图像分类，例如，LOWE^[13]提出的高效区域检测算法SIFT（scale invariant feature transform）具有图像旋转、尺度缩放、平移保持不变性，该方法在2004年得到完善^[14]；Dense SIFT即密集SIFT，是在SIFT基础上发展而来的一种算法，相比传统SIFT特征后者具有实时性好、表达能力强的优点。本研究针对林业业务图像数据的特点，利用Dense SIFT提取图像中的业务信息，构建合理的视觉词汇本，描述林业业务图像；根据林业业务管理需求，联合直方图正交核的支持向量机对图像自动分类，并将各类信息传递至各职能管理，从而实现快速、及时、准确、有效的管理。

3. 结论

本研究提出了一种基于Dense SIFT特征的BoW模型，联合直方图正交核的支持向量机对林业业务图像进行自动分类。以收集到的林业业务图像数据集为对象进行实验，结论如下：① 本研究以3类林业业务图像的识别为例，验证发现BoW模型应用于林业业务图像分类可以取得比较好的识别效果。增加新的业务类别时，只要选择足够数量的新增类别的训练样本，重新建立“视觉词汇本”即可。② SIFT和Dense SIFT都能有效地提取到林业业务图像的识别特征；就对图像局部特征完整提取的效果而言，Dense SIFT特征提取法比SIFT在对林业业务图像分类上更有优势。利用BoW模型对特征进行组合，产生的直方图特征更能反映林业业务本身特点，因而识别的准确率得到了极大提高。③ 采用SVM对林业业务图像进行分类时，应用不同的核函数对最后的识别率会产生较大的影响。由于BoW模型使用直方图描述图像的特征，直方图正交核能更好地处理直方图的比较问题，故能取得最佳的识别效果。

综上所述，基于Dense SIFT的BoW模型方法为林业业务图像自动识别研究提供了一种重要思路。该问题的研究与应用有助于中国对森林资源监管模式的创新与实践，有利于加强林业各个管理部门配合，相互协调，增强决策支持和应急处理能力，进而为实现森林的快速、有效、及时的现代化管理打下基础。

Reference (18)

[1]	陈锦标, 张春花. 基于分类的林业图片管理信息系统的设计与实现[J]. 中南林业调查规划, 2010, 29(2): 30-33.	CHEN Jinbiao, ZHANG Chunhua. Design and implementation of forestry picture management information system based on classification[J]. Cent South For Invent Plann, 2010, 29(2): 30-33.
[2]	刘义华, 李媛媛. 海量林业图像数据的分布式体系分析[J]. 林业调查规划, 2010, 35(4): 10-14.	LIU Yihua, LI Yuanyuan. Distributed systematic analysis for massive data of forestry image[J]. For Invent Plann, 2010, 35(4): 10-14.
[3]	SIVIC J, ZISSERMAN A. Video Google: a text retrieval approach to object matching in videos[J]. IEEE Int Conf Comput Vis, 2003, 2(): 1470-1478.
[4]	WU Lei, HOI S C H, YU Nenghai. Semantics-preserving bag-of-words models and applications[J]. IEEE Trans Image Proc, 2010, 19(7): 1908-1920. doi: 10.1109/TIP.2010.2045169
[5]	UIJLINGS J R R, SMEULDERS A W M, SCHA R J H. Real-time visual concept classification[J]. IEEE Trans Multimedia, 2010, 12(7): 665-681. doi: 10.1109/TMM.2010.2052027
[6]	赵春晖, 王莹, KANEKOM. 一种基于词袋模型的图像优化分类方法[J]. 电子与信息学报, 2012, 34(9): 2064-2070.	ZHAO Chunhui, WANG Ying, KANEKO M. An optimized method for image classification based on bag of words model[J]. J Electron Inf Technol, 2012, 34(9): 2064-2070.
[7]	艾浩军, 张敏, 方禹. 视觉词汇的主成分线性编码方法[J]. 软件学报, 2013, 24(supp 2): 42-49.	AI Haojun, ZHANG Min, FANG Yu. Principal component linear coding for visual words[J]. J Software, 2013, 24(supp 2): 42-49.
[8]	朱映映, 朱艳艳, 文振焜. 基于类型标志镜头与词袋模型的体育视频分类[J]. 计算机辅助设计与图形学学报, 2013, 25(9): 1375-1383.	ZHU Yingying, ZHU Yanyan, WEN Zhenkun. Sports video classification based on marked genre shots and bag of words model[J]. J Comput-Aid Des Comput Graph, 2013, 25(9): 1375-1383.
[9]	LI Zhen, YAP K H. An efficient approach for scene categorization based on discriminative codebook learning in bag-of-words framework[J]. Image Vision Comput, 2013, 31(10): 748-755. doi: 10.1016/j.imavis.2013.07.001
[10]	MUMTAZ A, COVIELLO E, LANCKRIET G R. A scalable and accurate descriptor for dynamic textures using bag of system trees[J]. IEEE Trans Pattern Anal Mach Intell, 2015, 37(4): 697-712. doi: 10.1109/TPAMI.2014.2359432
[11]	生海迪. 视觉词袋模型的改进及其在图像分类中的应用研究[D]. 济南: 山东师范大学, 2015.	SHENG Haidi. The Improvement of Bag-of-Visual-Words Model and Its Application Research in Images Classification [D]. Ji'nan: Shandong Normal University, 2015.
[12]	王涛. 基于词袋模型的人脸表情识别研究[D]. 武汉: 华中科技大学, 2013.	WANG Tao. Research on Bag of Words Model-Based Facial Expression Recognition [D]. Wuhan: Huazhong University of Science and Technology, 2013.
[13]	LOWE D G. Object recognition from local scale-invariant features[J]. IEEE Int Conf Comput Vision, 1999, 2(): 1150-1157.
[14]	LOWE D G. Distinctive image features from scale-invariant keypoints[J]. Int J Computr Vision, 2004, 60(2): 91-110. doi: 10.1023/B:VISI.0000029664.99615.94
[15]	MATHUR A, FOODY G M. Multiclass and binary SVM classification: implications for training and classification users[J]. IEEE Trans Geosci Remote Sens Letter, 2008, 5(2): 241-245. doi: 10.1109/LGRS.2008.915597
[16]	KALYANI S, SWARUP K S. Classification and assessment of power system security using multiclass SVM[J]. IEEE Trans Syst Man Cybern Part C Appl Rev, 2011, 41(5): 753-758. doi: 10.1109/TSMCC.2010.2091630
[17]	GRAUMAN K, DARRELL T. The pyramid match kernel: discriminative classification with sets of image features [C]//Proceedings of the IEEE International Conference on Computer Vision. Beijing: IEEE Computer Society, 2005: 1458-1465.
[18]	CHANG C C, LIN C J. LIBSVM: a library for support vector machines[J]. ACM Trans Intell Syst Technol, 2007, 2(3): 389-396.

核函数	森林火灾/%	非法采伐/%	森林病虫害/%	平均识别率/%
多项式核函数	80.0	85.0	75.0	80.0
径向基核函数	85.0	85.0	80.0	83.3
多层感知器核函数	80.0	80.0	75.0	78.3
直方图交叉核函数	85.0	90.0	85.0	86.7

特征	分类识别时间/s	平均识别率/%
SIFT特征	33.652	48.5
Dense SIFT特征	21.312	52.3
Dense SIFT特征+BoW模型	60.143	86.7

Classification of forestry images based on the BoW Model

DOI: 10.11833/j.issn.2095-0756.2017.05.004

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Related

Proportional views