东北大学学报(自然科学版) ›› 2003, Vol. 24 ›› Issue (5): 449-452.DOI: -

• 论著 • 上一篇    下一篇

基于范数的多维数据模糊聚类方法

王丽娜;费如纯;董晓梅;于戈   

  1. 东北大学信息科学与工程学院;东北大学信息科学与工程学院;东北大学信息科学与工程学院;东北大学信息科学与工程学院 辽宁沈阳 110004
  • 收稿日期:2013-06-24 修回日期:2013-06-24 出版日期:2003-05-15 发布日期:2013-06-24
  • 通讯作者: Wang, L.-N.
  • 作者简介:-
  • 基金资助:
    国家自然科学基金资助项目(6017305);;

Norm-based fuzzy clustering method for multi-dimension data

Wang, Li-Na (1); Fei, Ru-Chun (1); Dong, Xiao-Mei (1); Yu, Ge (1)   

  1. (1) Sch. of Info. Sci. and Eng., Northeastern Univ., Shenyang 110004, China
  • Received:2013-06-24 Revised:2013-06-24 Online:2003-05-15 Published:2013-06-24
  • Contact: Wang, L.-N.
  • About author:-
  • Supported by:
    -

摘要: 根据多维数据间相似度的定义,阐述了数据相似度与向量范数之间的关系,提出了一种基于范数的多维数据模糊聚类方法·该方法把每一个多维数据看做一个多维向量,利用与向量有关的范数对其中的数据进行排序,得到一个近似聚类族解·同理,对每一个近似聚类使用另一个范数做进一步分解,求解多维数据模糊聚类的近似解·最后,对得到的每一个近似聚类使用传统方法求出准确聚类·使用该方法不需建立模糊相似关系即可进行多维数据的近似聚类,总共所需访问数据库的次数也较小,因此具有较好的性能,特别适合于针对大型数据库的聚类·

关键词: 模糊聚类, 相似度, 范数, 多维向量, 数据库

Abstract: The relationship between the similitude of data and the norm of a vector was discussed according to the definition of the similitude between multi-dimension data. A norm-based fuzzy clustering method for multi-dimension data was proposed. Each multi-dimension datum is regarded as a multi-dimension vector, and the data are sorted according to the vector-related norms. The approximate solution of clustering is presented. Each approximate cluster is further decomposed according to another norm by the same way. Then, the approximate solution of fuzzy clustering for multi-dimension data can be obtained. Finally, the exact clusters can be found from the approximate clusters using traditional methods. The fuzzy similarity relation does not need to be built when approximately clustering for multi-dimension data, so the total counts of accessing database is comparatively small. Hence, the method proposed is high efficient and fits to the clustering of large databases.

中图分类号: