Scientia Agricultura Sinica ›› 2011, Vol. 44 ›› Issue (23): 4833-4840.doi: 10.3864/j.issn.0578-1752.2011.23.009

• SOIL & FERTILIZER·WATER-SAVING IRRIGATION·AGROECOLOGY & ENVIRONMENT • Previous Articles     Next Articles

Applied Research of Combinatorial Algorithm of Clustering,Rough Set and Decision Tree Method in Productivity Evaluation

 CHEN  Gui-Fen, MA  Li, DONG  Wei, XIN  Min-Gang   

  1. 1.吉林农业大学信息技术学院,长春 130118
    2.吉林省农安县农业技术推广总站,吉林农安 130200
  • Received:2010-07-26 Online:2011-12-01 Published:2011-06-14

Abstract: 【Objective】 Fertility evaluation method has a certain subjective and less considers the dependence relation among soil attributes. This paper is aimed to seek a new method of productivity evaluation by data mining method. 【Method】 Based on Nong’an cultivated land survey data, the paper used optimization algorithm of K-means clustering method, Johnson rough set attribute reduction algorithm and C4.5 decision tree algorithm to evaluate the productivity grade. 【Result】 The best learning samples are obtained by using K-means clustering method. Rough sets are used in soil attribute reduction, and 7 soil redundant attributes are removed. The decision tree model has 317 nodes and 159 leaf nodes, extracts 159 rules, model accuracy is 82.08%. The decision tree node number decreased by 41.62% compared with no-clustering and no-reduction approaches. 【Conclusion】 Using the combination algorithm, while the accuracy of the model is ensured, the algorithm time and space complexity are reduced and the mining efficiency is improved.

Key words: clustering, rough set, decision tree, soil evaluation, productivity grade

[1]王令超, 王国强, 王国灵. 农用土地定级的总分值计算模型研究. 地域研究与开发, 2001, 20(3): 10-12, 38.

Wang L C, Wang G Q, Wang G L. Study on the model for calculating total score in farmland evaluation. Areal Research and Development, 2001, 20(3): 10-12, 38. (in Chinese)

[2]秦明周, 赵  杰. 城乡结合部土壤质量变化与可持续利用对策: 以开封市为例. 地理学报, 2000, 55(5): 545-554.

Qin M Z, Zhao J. Variation in soil quality and strategies for sustainable use of urban fringe area: A case of Kaifeng. Acta Geographica Sinica, 2000, 55(5): 545-554. (in Chinese)

[3]张  萍, 刘高焕, 邢立新. 农业土地资源动态评价模型研究. 国土资源遥感, 2000(1): 51-56.

Zhang P, Liu G H, Xing L X. Study on the model for dynamic evaluation of agricultural land resources. Remote Sensing for Land and Resources, 2000(1): 51-56. (in Chinese)

[4]徐  晗. 耕地分等方法实证研究[D]. 西安: 长安大学, 2009.

Xu H. Authentic proof study on the gradation of cultivated land[D]. Xi,an: Changan University, 2009. (in Chinese)

[5]危向峰, 段建南, 胡振琪, 王建峰. 层次分析法在耕地地力评价因子权重确定中的应用. 湖南农业科学, 2006(2): 39-42.

Wei X F, Duan J N, Hu Z Q, Wang J F. Applying analytic hierarchy process to determining farmland productivity evaluation factors’ weight. Hunan Agricultural Sciences, 2006(2): 39-42. (in Chinese)

[6]王建国, 杨林章, 单艳红. 模糊数学在土壤质量评价中的应用研究. 土壤学报, 2001, 38(2): 176-185.

Wang J G, Yang L Z, Shan Y H. Application of fuzzy mathemztics to soil quality evaluation. Acta Pedologica Sinica, 2001, 38(2): 176-185. (in Chinese)

[7]胡月明, 万洪富, 吴志峰, 吴谷丰, 李华兴. 基于GIS的土壤质量模糊变权评价. 土壤学报, 2001, 38(5): 226-238.

Hu Y M, Wan H F, Wu Z F, Wu G F, Li H X. GIS-based soil quality evaluation with fuzzy variable weight. Acta Pedologica Sinica, 2001, 38(5): 226-238. (in Chinese)

[8]杜红悦, 李  京. 土地适宜性评价方法研究与系统实现: 以攀枝花为例. 资源科学, 2001, 23(5): 41-45.

Du H Y, Li J. Agricultural land suitability evaluation: model and system implementation: A case study of Panzhihua. Resources Science, 2001, 23(5): 41-45. (in Chinese)

[9]侯文广, 江聪世, 熊庆文, 陈继祥. 基于GIS的土壤质量评价研究.武汉大学学报: 信息科学版, 2003, 28(1): 60-64.

Hou G W, Jiang C S, Xiong Q W, Chen J X. Evaluation of soil quality based on GIS. Geomatics and Information Science of Wuhan University, 2003, 28(1): 60-64. (in Chinese)

[10]王新忠, 林  仪, 于  磊. 天然草地类型综合评价中的数据处理及灰色关联度分析. 系统工程理论与实践, 2000(2): 131-135, 140.

Wang X Z, Lin Y, Yu L. An application of the grey system theory in the evaluation of natural grassland types. Systems Engineering: Theory and Practice, 2000(2): 131-135, 140. (in Chinese)

[11]陈桂芬. 面向精准农业的空间数据挖掘技术研究与应用[D]. 长春:吉林大学, 2009.

Chen G F. Research and application of spatial data mining technology for precision agriculture[D]. Changchun: Jilin University, 2009. (in Chinese)

[12]黄  健, 李会民, 张惠琳, 马 兵, 孙宇新, 张国恩, 朱健菲. 基于GIS的吉林省县级耕地地力评价与评价指标体系的研究. 吉林农业科学, 2007, 32(1) : 57-62.

Huang J, Li H M, Zhang H L, Ma B, Sun Y X, Zhang G E, Zhu J F. Researches on the evaluation of integrated soil fertilities of farmland at county level in Jilin Province and the evaluation index system based on GIS. Journal of Jilin Agricultural Sciences, 2007, 32(1): 57-62. (in Chinese)

[13]吴谷丰, 胡月明, 张少康, 刘智华, 李世华. 基于GIS与关联分析的土壤肥力评价. 农业系统科学与综合研究, 2002,18(3): 169-171.

Wu G F, Hu Y M, Zhang S K, Liu Z H, Li S H. Soil fertility evaluation based on geographic information system and gray relative analysis. System Sciences and Comprehensive Studies in Agriculture, 2002, 18(3): 169-171. (in Chinese)

[14]曹丽英, 孙学生, 赵月玲, 陈桂芬. 一种基于决策树算法的耕地地力等级评价. 东北林业大学学报, 2011, 39(2): 93-96.

Cao L Y, Sun X S, Zhao Y L, Chen G F. Evaluation on soil fertility grade for cultivated land based on decision trees. Journal of Northeast Forestry University, 2011, 39(2): 93-96. (in Chinese)

[15]尚海昆. K-means聚类算法的研究[D]. 保定: 华北电力大学, 2009.

Shang H K. Research of K-means clustering algorithm[D]. Baoding: Northern Electric University, 2009. (in Chinese)

[16]杨  哲. 基于粗糙集理论的属性约简算法研究[D]. 长春: 长春理工大学, 2008.

Yang Z. Research on attribute reduction algorithms based on rough sets theory[D]. Changchun: Changchun University of Science and Technology, 2008. (in Chinese)

[17]李华华. 粗糙集理论研究及其在隧道病害预测中的应用[D] . 北京: 北京交通大学, 2007.

Li H H. Research on rough set and application on tunnel damage prediction[D]. Beijing: Beijing Traffic University, 2007. (in Chinese)

[18]Hu X. Knowledge discovery in databases: An attribute-oriented rough set approach[D] . Saskatchewan: University of Regina,1995.

[19]洪雪飞. 基于粗糙集的数据挖掘算法的研究与应用[D]. 北京: 北京交通大学, 2008.

Hong X F. Research and application on rough set based data mining algorithm[D]. Beijing: Beijing Traffic University,2008. (in Chinese)

[20]Metha M, Agrawal R, Rissanen J. SLIQ: A fast scalable classifier for data mining. In EDBT 96, AvignonkFranc, 1996.

[21]范  洁, 杨岳湘, 温  璞. C4.5算法在在线学习行为评估系统中的应用. 计算机工程与设计, 2006, 27(6): 946-948.

Fan J, Yang Y X, Wen P. Application of C4.5 algorithm in online study behavior assessment system. Computer Engineering and Design, 2006, 27(6): 946-948. (in Chinese)

[22]王  勇. 时序数据挖掘技术及其在水质预测中的应用研究[D]. 广州: 广东工业大学, 2005.

Wang Y. Research on data mining techniques and the weather forecast on the quality of the application[D]. Guangzhou: Guangdong University of Technology, 2005. (in Chinese)

[23]Mehta M, Rissanen J, Agrawal R. MDL-based decision tree pruning[C]. International Conference on Knowledge Discovery in Databases and Data Mining, 1995: 216-221.

[24]植小三. 基于粗糙集理论的数据挖掘模型及属性约简算法研究[D]. 重庆: 重庆大学, 2003.

Zhi X S. The study on model of data mining and attribute reduction algorithm based on the rough set theory[D]. Chongqing: Chongqing University, 2003. (in Chinese)

[25]孙微微, 胡月明, 刘才兴, 薛月菊. 基于决策树的土壤质量等级研究. 华南农业大学学报: 自然科学版, 2005, 26(3):118-110.

Sun W W, Hu Y M, Liu C X, Xue Y J. Soil quality grade evaluation based on decision tree. Journal of South China Agricultural University: Natural Science Edition, 2005, 26(3):118-110. (in Chinese)
[1] ZHANG ZhenHua,DING JianLi,WANG JingZhe,GE XiangYu,WANG JinJie,TIAN MeiLing,ZHAO QiDong. Digital Soil Properties Mapping by Ensembling Soil-Environment Relationship and Machine Learning in Arid Regions [J]. Scientia Agricultura Sinica, 2020, 53(3): 563-573.
[2] QIU PengXun, WANG XiaoQin, CHA MingXing, LI YaLi. Crop Identification Based on TWDTW Method and Time Series GF-1 WFV [J]. Scientia Agricultura Sinica, 2019, 52(17): 2951-2961.
[3] LI ChunJia, QIN Wei, XU ChaoHua, LIU HongBo, MAO Jun, LU Xin. Genetic Variations and Cluster Analysis of Photosynthetic Gas Exchange Parameters in Exotic Sugarcane Cultivars [J]. Scientia Agricultura Sinica, 2018, 51(12): 2288-2299.
[4] LIU HuanJun, YU ShengNan, ZHANG XinLe, GUO Dong, YIN JiXian. Timeliness Analysis of Crop Remote Sensing Classification One Crop A Year [J]. Scientia Agricultura Sinica, 2017, 50(5): 830-839.
[5] WANG RuiYun, LIU XiaoYu, WANG HaiGang, LU Ping, LIU MinXuan, CHEN Ling, QIAO ZhiJun. Evaluation of Genetic Diversity of Common Millet (Panicum miliaceum) Germplasm Available in China Using High Motif Nucleotide Repeat SSR Markers [J]. Scientia Agricultura Sinica, 2017, 50(20): 3848-3859.
[6] YAN Li, WANG CuiPing, CHEN JianWei, QIAO GaiXia, LI Jian. Analysis of MYB Transcription Factor Family Based on Transcriptome Sequencing in Lycium ruthenicum Murr. [J]. Scientia Agricultura Sinica, 2017, 50(20): 3991-4002.
[7] LU ZhiJuan, ZHANG YongQing, ZHANG Chu, LIU LiQin, YANG ChunTing. Comprehensive Evaluation and Indicators of the Drought Resistance of Different Genotypes of Fagopyrum tataricum at Seedling Stage [J]. Scientia Agricultura Sinica, 2017, 50(17): 3311-3322.
[8] DAI Pan-hong, SUN Jun-ling, HE Shou-pu, WANG Li-ru, JIA Yin-hua, PAN Zhao-e, PANG Bao-yin, DU Xiong-ming, WANG Mi. Comprehensive evaluation and genetic diversity analysis of phenotypic traits of core collection in upland cotton [J]. Scientia Agricultura Sinica, 2016, 49(19): 3694-3708.
[9] QU Yang, ZHOU Yu, WANG Zhao, WANG Peng-ke, GAO Jin-feng, GAO Xiao-li, FENG Bai-li . Analysis of Genetic Diversity and Structure of Tartary Buckwheat Resources from Production Regions [J]. Scientia Agricultura Sinica, 2016, 49(11): 2049-2062.
[10] HUANG Ya-jie, YE Hui-chun, ZHANG Shi-wen, YUN Wen-ju, HUANG Yuan-fang. Zoning of Arable Land Productivity Based on Self-organizing Map in China [J]. Scientia Agricultura Sinica, 2015, 48(6): 1136-1150.
[11] XU Jin-pu, ZHU Ye-ping. The Agricultural Price Information Acquisition Method Based on Speech Recognition [J]. Scientia Agricultura Sinica, 2015, 48(3): 449-459.
[12] YE Hui-Chun-1, ZHANG Shi-Wen-2, HUANG Yuan-Fang-1, ZHOU Zhi-Ming-1, SHEN Zhong-Yang-1. Application of Rough Set Theory to Determine Weights of Soil Fertility Factor [J]. Scientia Agricultura Sinica, 2014, 47(4): 701-717.
[13] XIE Xiao-Yu, ZHANG Xia, ZHANG Bing. Evaluation of Drought Resistance and Analysis of Variation of Relevant Parameters at Seedling Stage of Rapeseed (Brassica napus L.) [J]. Scientia Agricultura Sinica, 2013, 46(3): 476-485.
[14] ZHENG Yong-qiang,DENG Lie,HE Shao-lan,ZHOU Zhi-qin,YI Shi-lai,ZHAO Xu-yang ,WANG Liang
. Screening for the Agronomic Traits Regulating Fruit Oleocellosis with the Specificity Between Rootstocks and Scions of ‘Hamlin’ Sweet Orange
[J]. Scientia Agricultura Sinica, 2010, 43(23): 4877-4885 .
[15] MENG Qing-li,GUAN Zhou-bo,FENG Bai-li,CHAI Yan,HU Yin-gang
. Principal Component Analysis and Fuzzy Clustering on Drought-Tolerance Related Traits of Foxtail Millet (Setaria italica)
[J]. Scientia Agricultura Sinica, 2009, 42(8): 2667-2675 .
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!