网格环境下分布式空间离群挖掘体系的设计与应用
Service and Application of Grid Based Distributed Spatial Outliers Mining
查看参考文献18篇
文摘
|
空间离群是指空间数据集中那些非空间属性值与邻域中其他空间对象明显不同的空间对象。空间数据一般按地理分布存储具有海量特性,传统的集中式处理模式不能满足海量数据处理的效率和空间数据本身的安全性等要求。因此,在研究小组开发的地理知识服务网格平台GeoKS-Grid的基础上,本文针对分布式空间离群挖掘,提出了一个基于网格的分布式体系框架,制定了网格环境下分布式空间离群挖掘的策略,实现了具体的分布式空间离群挖掘算法。另遵循分布式空间数据挖掘的一般过程和网格服务通用、可重用和可组合的原则,将算法按合理粒度进行分解,并封装成多个基本的原子服务,进而以网格工作流的方式进行服务发现与组合,完成包括局部离群挖掘和全局离群挖掘在内的分布式空间离群挖掘。最后,通过福建省生态地球化学调查土壤数据离群分析实例,验证了服务或系统的合理性和有效性。 |
其他语种文摘
|
A spatial outlier is a spatial object whose non-spatial attribute values are significantly deviated from the other data’s in the dataset.The identification of spatial outliers can lead to the discovery of some unexpected knowledge,and it has a number of practical applications.There are massive spatial data maintained over geographically distributed sites in WAN.It’s necessary to analyse and process the data by using the high-performance distributed parallel processing system.Grid is one of the most effective approaches to meet this requirement.The geographical knowledge grid platform(GeoKS-Grid)established by our research group is the application of knowledge grid in geo-information science,which integrate technologies of grid computing,web service,WebGIS,data mining,information visualization,knowledge base of ontology and knowledge reasoning,online analytical processing,decision analysis,data warehouse and workflow,to form a geographical problem solving environment.In this paper,a grid based distributed framework and the corresponding strategy for distributed spatial data mining system are discussed,and a distributed algorithm for spatial outlier mining is designed and implemented.In general,the process of distributed spatial outlier mining can be seen to be a series of services including atomic services and composite services.Furthermore,according to the principle of web service reusage and compositionality,the distributed spatial outlier mining algorithm is decomposed into several grid atomic services.Distributed spatial outlier mining including local spatial outlier mining and global spatial outlier mining is realized by grid workflow approach to discovery and composition of knowledge atomic grid services provided by knowledge grid.Finally,demonstration application is carried out on the basis of soil geochemistry data inspected by the Ecological Geochemistry Survey of Fujian Coastal Economic Belt,the efficiency and the validity of the distributed spatial outlier mining service and system are verified and confirmed. |
来源
|
地球信息科学学报
,2011,13(3):383-390 【核心库】
|
关键词
|
空间离群
;
分布式挖掘
;
知识网格
;
原子服务
;
服务组合
|
地址
|
1.
福州大学福建省空间信息工程研究中心, 福州大学福建省空间信息工程研究中心;;空间数据挖掘与信息共享教育部重点实验室, 福州, 350002
2.
福建省经济信息中心, 福州, 350001
|
语种
|
中文 |
ISSN
|
1560-8999 |
学科
|
自动化技术、计算机技术 |
基金
|
福建省科技计划重点项目
;
欧盟第七框架计划项目
;
中-匈政府间科技合作项目
;
国家自然科学基金
|
文献收藏号
|
CSCD:4222612
|
参考文献 共
18
共1页
|
1.
Shekhar S. A Unified Approach to Detecting Spatial Outliers.
GeoInformatica,2003,7(2):139-166
|
CSCD被引
33
次
|
|
|
|
2.
Aflori C. Grid Implementation of the Apriori Algorithm.
Advances in Engineering Software,2007,38(5):295-300
|
CSCD被引
2
次
|
|
|
|
3.
Rawat S S. Performance of Distributed Apriori Algorithms on a Computational Grid.
Services Computing Conference.APSCC 2009.IEEE Asia-Pacific,2009:163-167
|
CSCD被引
1
次
|
|
|
|
4.
Meligy A. A Grid-based Distributed SVM Data Mining Algorithm.
European Journal of Scientific Research,2009,27(3):313-321
|
CSCD被引
3
次
|
|
|
|
5.
Yang C T. Decision Tree Construction for Data Mining on Grid Computing Environments.
19th International Conference on Advanced Information Networking and Applications,AINA 2005,2005:421-425
|
CSCD被引
1
次
|
|
|
|
6.
Pérez M S. Design and Implementation of a Data Mining Grid-aware Architecture.
Future Generation Computer Systems,2007,23(1):42-47
|
CSCD被引
1
次
|
|
|
|
7.
Khoussainov R. Grid-enabled Weka:A Toolkit for Machine Learning on the Grid.
ERCIM News,2004,59:47-48
|
CSCD被引
1
次
|
|
|
|
8.
Senger H. Inhambu:Data Mining Using Idle Cycles in Clusters of PCs.
Network and Parallel Computing,2004:213-220
|
CSCD被引
1
次
|
|
|
|
9.
Ali A S. Web Services Composition for Distributed Data Mining.
ICPPW '05 Proceedings of the 2005 International Conference on Parallel Processing Workshops,2005:11-18
|
CSCD被引
1
次
|
|
|
|
10.
Talia D. Weka4WS:AWSRF-enabled Weka Toolkit for Distributed Data Mining on Grids.
Proc.PKDD 2005,2005:309-320
|
CSCD被引
1
次
|
|
|
|
11.
Brezany P. Gridminer:An Infrastructure for Data Mining on Computational Grids.
APAC Conference and Exhibition on Advanced Computing,Grid Applications and eResearch,2003
|
CSCD被引
1
次
|
|
|
|
12.
Stankovski V. Grid-enabling Data Mining Applications with DataMiningGrid:An Architectural Perspective.
Future Gener.Comput.Syst.,2008,24(4):259-279
|
CSCD被引
2
次
|
|
|
|
13.
Wu X. The Design,Development and Application of Geographical Knowledge Service Grid Portal.
Proc.of 17th International Conference on Geoinformatics,2009
|
CSCD被引
1
次
|
|
|
|
14.
林甲祥.
考虑约束条件的分布式空间离群挖掘及其应用研究[博士学位论文],2010
|
CSCD被引
1
次
|
|
|
|
15.
薛安荣. 局部离群点挖掘算法研究.
计算机学报,2007,30(8):1455-1463
|
CSCD被引
51
次
|
|
|
|
16.
Chawla S. SLOM:A New Measure for Local Spatial Outliers.
Knowledge and Information Systems,2006,9(4):412-429
|
CSCD被引
29
次
|
|
|
|
17.
郑琦. 基于Delaunay三角网的空间离群挖掘.
微计算机应用,2008,29(6):76-82
|
CSCD被引
1
次
|
|
|
|
18.
刘丰富.
基于网格的地理空间知识服务技术与原型系统开发[硕士学位论文],2007
|
CSCD被引
1
次
|
|
|
|
|