中国农业科学 ›› 2014, Vol. 47 ›› Issue (16): 3231-3249.doi: 10.3864/j.issn.0578-1752.2014.16.010

• 土壤肥料·节水灌溉·农业生态环境 • 上一篇    下一篇

海量空间数据提取、整合与制图表达方法概要

 张维理   

  1. 中国农业科学院农业资源与农业区划研究所/农业部作物营养与施肥重点开放实验室,北京 100081
  • 收稿日期:2013-12-22 出版日期:2014-08-18 发布日期:2014-04-15
  • 作者简介:张维理,Tel:010-82108394;E-mail:zhangweili@caas.cn
  • 基金资助:

    科技部科技基础性工作专项(2006FY120200、2012FY112100)

A Summary of Methodology for Extracting, Integrating and Mapping of Massive Geo-Data

 ZHANG  Wei-Li   

  1. Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences/Key Laboratory of Crop Nutrition and Fertilization, Ministry of Agriculture, Beijing 100081
  • Received:2013-12-22 Online:2014-08-18 Published:2014-04-15

摘要: 海量空间数据提取、整合与制图表达方法是农业与环境科学、地学、制图学、信息科学及计算机科学多个学科方法融合形成的新方法,主要是通过数据模型设计和海量空间信息分析与制图表达流程设计,对农业与环境领域产生的海量、异源、异质、异构、异形信息进行有效的抽提、关联、分析与专题图制图表达。该方法可用于对不同地区、不同时段、不同调查获得的海量观测数据、地图数据、遥感影像数据进行抽提与分析,从而能适应现代农业与环境研究主题中,研究区域尺度变大、对区域内信息精度要求提高、对系统内多要素进行量化表征的需求。依据笔者多年科研实践,本文介绍了这一方法的边界与内涵、应用范围、相关概念、基本思想与主要内容,为农业与环境领域的科研及管理人员了解和在今后采用这新的一方法提供参考。本方法作为一种大数据分析方法,可广泛用于土壤资源数量与质量评价、气候变化、作物适生性分析、环境质量演变、农业面源污染源防治、水土流失防治、抗旱防涝减灾等多专业领域,也可用于对土壤肥力、环境质量等要素进行精准化、定量化和可视化表达,使农民和相关行业技术人员更易于采用现代技术和公益性科研成果,并为国家实施农业与环境奖惩政策提供科学依据。海量空间数据分析方法的核心是根据科学目标界定对海量空间信息的分类依据,并按照信息类型对异源空间信息进行赋码、抽提与表达。由于海量空间数据集数据结构的水平与垂直方向特征,在对海量空间信息进行分析时需要采用空间集四元表达式。利用四元表达式判定各异源数据逻辑结构与存储结构异同,以逻辑结构的归一化带动对实体库的抽提、整合与表达。在农业与环境科学研究范畴应用本方法时,易出现的问题是数据分析处理过程中科学目标的弱化以至迷失。因此不仅在进行海量空间信息分析之初需要准确界定科学及专业目标,分层次进行数据分析流程设计,在数据分析过程中还应当及时审视科学或专业目标的落实。农业与环境领域专业人员对数据分析科学目标理解最到位,应完成高层级分析流程设计,并按方法学规范编制流程设计文档。

关键词: 农业 , 环境 , 海量空间数据 , 数字制图 , 大数据方法

Abstract: The methodology for extracting, integrating and mapping of massive geo-data is a new method combined by different disciplinary approaches in agriculture and environmental science, geo science, cartography, information and computer science. The methodology consists mainly in the data model design and the working flow design for processing, analyzing and mapping massive geo-info. By using the methodology, big data in heterogeneous format and structure originated from different resources and regions of agricultural and environmental research and working programs can be effectively extracted, analyzed and correlated for mapping and thematic expression. The method can be used to analyze point observation data, map data, remote sensing data in different spaces and times. So that it supplies a useful tool for providing information with higher accuracy, larger covering area and more system concerning elements, through which major factors and mechanism in agricultural and environmental system can be much more exactly qualified and quantified. Based on many years of research and practical experience, the author of this paper introduced the connotation, the application range, the basic and the concerning conceptions and the main contents of the methodology, with the purpose to provide a brief understanding for researchers and managers in agriculture and the environment sectors to apply the new method in their working fields. As a big data approach, the method can be widely used for evaluation of the quantity and quality of soil resources, climate change, environmental quality, evaluation of crop varieties suitable distribution area, agricultural non-point source pollution control,erosion control, drought and flood disaster mitigation, and other working and research areas. It can also be applied for precise and visualized expressing of soil fertility and environmental quality, so that farmers or other users from different sectors can access to research results and progress much easier and get benefits from it. It provides also a useful approach to establish scientific basis for developing and implementing incentive policy in agriculture and environment sectors. The core of methodology is to define the rules according to scientific target for classifying massive geo-data. Based on the rules defined, grouping, coding, extracting and mapping of massive geo-data can be then carried out. Because of horizontal and vertical features of data structure of a massive geo-dataset, the four component expression of massive geo-dataset should be applied. Through which both logical and physical structure differences of massive geo information originated from different sources can be distinguished and displayed clearly. After normalization of data logical structure, the extracting, integrating and mapping of massive geo-datasets are then followed. In agriculture and environmental research works, however, frequent difficulty of big data analysis approach is the weakening or even lost of the scientific target during data treatment. Therefore, scientific or specific target should be defined as precise as possible at the beginning of the data analysis working program. A five-level designing process should be applied for drafting working flow of data extracting, integrating and mapping. Checking and examining the realization of defined target should be done time to time during the data processing. With deep understanding of the target, researchers and professionals from agriculture and environment sectors should be responsible for designing data processing flow of high-levels and drafting the corresponding design documents according to specifications of the methodology.

Key words: agriculture , environment , massive geo-data , digital mapping , big data methodology