期刊
  出版年
  关键词
结果中检索 Open Search
Please wait a minute...
选择: 显示/隐藏图片
1. Structured AJAX Data Extraction Based on Agricultural Ontology
LI Chuan-xi, SU Ya-ru, WANG Ru-jing, WEI Yuan-yuan, HUANG He
Journal of Integrative Agriculture    2012, 11 (5): 784-791.   DOI: 10.1016/S1671-2927(00)8600
摘要1265)      PDF    收藏
More web pages are widely applying AJAX (Asynchronous JavaScript XML) due to the rich interactivity and incremental communication. By observing, it is found that the AJAX contents, which could not be seen by traditional crawler, are well-structured and belong to one specific domain generally. Extracting the structured data from AJAX contents and annotating its semantic are very significant for further applications. In this paper, a structured AJAX data extraction method for agricultural domain based on agricultural ontology was proposed. Firstly, Crawljax, an open AJAX crawling tool, was overridden to explore and retrieve the AJAX contents; secondly, the retrieved contents were partitioned into items and then classified by combining with agricultural ontology. HTML tags and punctuations were used to segment the retrieved contents into entity items. Finally, the entity items were clustered and the semantic annotation was assigned to clustering results according to agricultural ontology. By experimental evaluation, the proposed approach was proved effectively in resource exploring, entity extraction, and semantic annotation.
参考文献 | 相关文章 | 多维度评价
2. Agricultural Ontology Based Feature Optimization for Agricultural Text Clustering
SU Ya-ru, WANG Ru-jing, CHEN Peng, WEI Yuan-yuan, LI Chuan-xi
Journal of Integrative Agriculture    2012, 11 (5): 752-759.   DOI: 10.1016/S1671-2927(00)8596
摘要1422)      PDF    收藏
Feature optimization is important to agricultural text mining. Usually, the vector space model is used to represent text documents. However, this basic approach still suffers from two drawbacks: the curse of dimension and the lack of semantic information. In this paper, a novel ontology-based feature optimization method for agricultural text was proposed. First, terms of vector space model were mapped into concepts of agricultural ontology, which concept frequency weights are computed statistically by term frequency weights; second, weights of concept similarity were assigned to the concept features according to the structure of the agricultural ontology. By combining feature frequency weights and feature similarity weights based on the agricultural ontology, the dimensionality of feature space can be reduced drastically. Moreover, the semantic information can be incorporated into this method. The results showed that this method yields a significant improvement on agricultural text clustering by the feature optimization.
参考文献 | 相关文章 | 多维度评价