帮助 关于我们

返回检索结果

数据空间自相关性对关联规则的挖掘与实验分析
Application and Effects of Data Spatial Autocorrelation on Association Rule Mining

查看参考文献20篇

文摘 传统的空间关联规则挖掘,一般是使用属性关联规则的挖掘算法,对空间数据进行泛化处理,不考虑空间数据的空间自相关性,也没有考虑空间自相关与空间关联规则的关系.本文运用改进的Apriori算法对某一数据进行空间关联规则挖掘,并对同一数据进行空间自相关分析,比较两种方法反映的属性的相关性,探讨了数据的空间自相关性对空间关联规则挖掘的影响.论文采用2000年英国的HAYFEVE患病数据集和当时的气温,降雨数据作为实验数据.采用两种方法处理相同的数据集,即Apriori方法和空间自相关方法,发现二者的结果中所得的一项关联规则和二项关联规则一致,证明了通过研究数据的空间自相关性也能获得准确的关联规则,且数据的空间自相关性对关联规则的挖掘存在作用和影响.如何定量度量一元空间自相关对空间关联规则的影响,以及利用二元空间自相关结果作为空间关联规则候挖掘的候选频繁项集,进而提高挖掘效率是本文的进一步工作
其他语种文摘 Spatial autocorrelation is a very general statistical property of spatial variables,it indicates correlation of a variable with itself through space. Spatial association rule mining, discovery of interesting, meaningful rules in spatial databases, ignores autocorrelation of spatial data,or just generalizes the spatial data into attribute data currently. In most of the ways on spatial association rules mining, they transferred the spatial relations into non-spatial relations by virtue of spatial analysis. This means the separation of spatial autocorrelation from spatial association rule mining. In order to study the relations between spatial autocorrelation and spatial association rule mining,in this paper, the spatial association rules were mined by developed Apriori algorithm. Then, spatial autocorrelation analysis was implemented in the same spatial data set. A basic assumption of many spatial association rules mining is lacking for a priori information about spatial attributes. The two dimensional spatial autocorrelation results were used as priori knowledge in spatial association rules mining in this paper. The experimental data is about the amount of the hay fever(disease caused by pollen allergic rhinitis) patients and its factors, including temperature, precipitation and vegetation types of each county in the United Kingdom in 2000.The obtained frequent itemsets and the spatial association rules prove that factors have stronger correlation with hay fever(correlation coefficient is lager) appear with hay fever simultaneously more frequently in the spatial database, which confirms the existence of the effects that spatial autocorrelation has on spatial association rule mining. The analysis results not only point out the relation between spatial autocorrelation and spatial association rule mining, but also provide priori knowledge in the process of spatial association rule mining, making the mining process more targeted. Besides, without calculating the Cartesian in developed Apriori algorithm, spatial autocorrelation analysis can get the correlation coefficients efficiently, making the mining process more effectively. Further work would focus on how to evaluate the effects of the spatial autocorrelation on spatial association rules mining, how to find out the candidate frequent spatial itemsets from the results of spatial autocorrelation analysis in practical application
来源 地球信息科学学报 ,2011,13(1):109-117 【核心库】
关键词 空间自相关 ; 关联规则挖掘 ; 空间数据挖掘 ; Apriori
地址

武汉大学遥感信息工程学院, 武汉, 430079

语种 中文
文献类型 研究性论文
ISSN 1560-8999
学科 自然地理学
基金 国家自然科学基金青年科学基金 ;  教育部留学科研基金项目
文献收藏号 CSCD:4137043

参考文献 共 20 共1页

1.  李德仁. 论空间数据挖掘和知识发现的理论与方法. 武汉大学学报(信息科学版),2002(3):222-233 被引 1    
2.  Fayyad U M. Advances in Knowledge Discovery and Data Mining,1996 被引 43    
3.  李德仁. 空间数据挖掘理论与应用,2006 被引 68    
4.  张建峰. 关联规则在空间数据挖掘中的应用及实现. 计算机技术与发展,2007,17(8):208-211 被引 1    
5.  黄旭峰. 空间数据挖掘中关联规则的研究与实现. 科技信息,2009(7):481-482 被引 1    
6.  Wang J-F. Geographical Detectors-based Health Risk Assessment and Its Application in the Neural Tube Defects Study of the Heshun Region, China. International Journal of Geographical Information Science,2010,24(1):107-127 被引 447    
7.  Tobler W. A Computer Movie Simulating Urban Growth in the Detroit Regional Economic Geography. Economic geography,1970,46(2):234-2401 被引 590    
8.  Tobler W. On the First Law of Geography: A Reply. Annals of the Association of American Geographers,2004,94(2):304-310 被引 56    
9.  Moran PAP. The Interpretation of Statistical Maps. Journal of the Royal Statistical Society B,1948(10):243-251 被引 107    
10.  Moran PAP. Notes on Continuous Stochastic Phenomenal. Biometrika,1950,37:17-33 被引 280    
11.  Geary R C. The Contiguity Ratio and Statistical Mapping. The Incorporated Statistician,1954,5:115-145 被引 72    
12.  王永. 空间自相关方法及其主要应用现状. 中国卫生统计,2008,25(4):443-445 被引 15    
13.  陈彦光. 基于Moran统计量的空间自相关理论发展和方法改进. 地理研究,2009,28(6):1449-1463 被引 155    
14.  王劲峰. 地图的定性和定量分析. 地球信息科学学报,2009,11(2):169-175 被引 5    
15.  廖顺宝. 属性数据空间化误差评价指标体系研究. 地球信息科学学报,2009,11(4):176-182 被引 15    
16.  Chen Jiangping. Mining Spatial Association Rules with Geostatistics. Proceedings of the 8th International Symposium on Spatial Accuracy Assessment in Natural Resources and Environmental Sciences,2008 被引 2    
17.  何彬彬. 基于空间统计学的空间关联挖掘. 计算机工程,2006,32(5):20-22 被引 5    
18.  Han Jiawei. Mining Knowledge at Multiple Concept Levels. Proceedings of the Fourth International Conference on Information and Knowledge Management,1995 被引 1    
19.  Krzysztof Koperski. Spatial Data Mining: Progress and Challenges Survey Paper. SIGMOD'96 Workshop on Research Issues on Data Mining and Knowledge Discovery (DMKD' 96), Montreal, Canada, June,1996 被引 1    
20.  陈江平. 一种Apriori的改进算法. 武汉大学学报(信息科学版),2003,28(1):94-99 被引 5    
引证文献 5

1 巩垠熙 基于信息熵的森林立地离散空间场相关性研究 西北林学院学报,2015,30(1):87-95
被引 0 次

2 郭平 大数据分析中基于MapReduce的空间权重创建方法研究 重庆邮电大学学报. 自然科学版,2016,28(4):533-538
被引 0 次

显示所有5篇文献

论文科学数据集
PlumX Metrics
相关文献

 作者相关
 关键词相关
 参考文献相关

版权所有 ©2008 中国科学院文献情报中心 制作维护:中国科学院文献情报中心
地址:北京中关村北四环西路33号 邮政编码:100190 联系电话:(010)82627496 E-mail:cscd@mail.las.ac.cn 京ICP备05002861号-4 | 京公网安备11010802043238号