[1]闫强强,张 敏,荀亚玲.采用有效邻近点和适应密度的密度聚类算法[J].计算机技术与发展,2022,32(09):14-22.[doi:10. 3969 / j. issn. 1673-629X. 2022. 09. 003]
 YAN Qiang-qiang,ZHANG Min,XUN Ya-ling.A Density Clustering Algorithm Based on Effective Neighboring Points and Adaptive Density Distribution[J].,2022,32(09):14-22.[doi:10. 3969 / j. issn. 1673-629X. 2022. 09. 003]
点击复制

采用有效邻近点和适应密度的密度聚类算法()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
32
期数:
2022年09期
页码:
14-22
栏目:
大数据与云计算
出版日期:
2022-09-10

文章信息/Info

Title:
A Density Clustering Algorithm Based on Effective Neighboring Points and Adaptive Density Distribution
文章编号:
1673-629X(2022)09-0014-09
作者:
闫强强张 敏荀亚玲
太原科技大学 计算机科学与技术学院,山西 太原 030024
Author(s):
YAN Qiang-qiangZHANG MinXUN Ya-ling
School of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan 030024,China
关键词:
密度聚类伸缩半径有效邻近点适应密度分布相对距离
Keywords:
density clustering algorithmtelescopic radiuseffective neighboring pointsadaptive density distributionrelative distance
分类号:
TP311
DOI:
10. 3969 / j. issn. 1673-629X. 2022. 09. 003
摘要:
密度聚类作为一类重要的聚类分析方法,具有无需预先指定类簇数,可识别任意形状聚类族等优点,但在计算密度的过程中, K 近邻或邻域半径的选取对聚类效果具有较大的影响,且当数据集中存在类簇间距相差较大的情况时,密度聚类无法自适应类簇中数据对象密度变换,导致聚类效果与实际存在较大误差。 针对现有密度聚类分析存在的不足,利用有效邻近点和适应密度分布,提出了一种密度聚类分析算法。 该算法首先通过相对距离确定伸缩半径,定义了数据对象的有效邻近点,并有效地克服了近邻值 K 选取对聚类效果的影响;其次,计算核心点和边界点阈值,依据有效邻近点,并确定类簇中的核心区域数据对象,有效地改善了聚类分析效率;然后,调整簇内有效距离,改善了类簇密度分布不均匀、类簇间距离过大等问题;最后,在人工和 UCI 数据集上验证了该算法的有效性。
Abstract:
As an important cluster analysis method,density clustering has the advantages of unspecified number of cluster in advance andclustering with arbitrary shapes can be discovered. However,in the process of calculating the density,there is an important influence onthe clustering due to the selection of K-nearest neighboring or Eps. When cluster spacing vary a lot in the datasets,the density clusteringis unable to adapt to the data object density transformation in the clusters,which leads to a large deviation between the clustering and thereality datasets. In order to overcome shortcomings of existing density cluster analysis,a density clustering algorithm is proposed by usingeffective neighboring points and adaptive density distribution. Firstly, the telescopic radius is determined by the relative distance, the effective neighboring points of the data object is defined, and the influence of the selection of the nearest neighbor value K on the clustering effect is overcame. Secondly,core point and boundary point threshold are calculated using the relative distance,so that core area objects in the cluster are determined according to the effective neighboring points,which effectively improves the efficiency of cluster analysis. Thirdly,uneven density distribution and large distance between clusters are improved by adjusting the effective distance within the cluster. In the end,the effectiveness of the proposed algorithm is validated on artificial and UCI datasets.

相似文献/References:

[1]聂跃光 陈立潮 陈湖.基于密度的空间聚类算法研究[J].计算机技术与发展,2008,(08):91.
 NIE Yue-guang,CHEN Li-chao,CHEN Hu.Research of Spatial Clustering Algorithms Based on Density[J].,2008,(09):91.
[2]张付霞,蒋朝惠. 基于DSNPP算法的社交网络隐私保护方法[J].计算机技术与发展,2015,25(08):152.
 ZHANG Fu-xia,JIANG Chao-hui. Privacy-preserving Approach in Social Networks Based on DSNPP Algorithm[J].,2015,25(09):152.
[3]赵晓梅,刘兆邦,张正平,等.基于超像素和密度聚类算法的皮肤镜图像分割[J].计算机技术与发展,2020,30(06):167.[doi:10. 3969 / j. issn. 1673-629X. 2020. 06. 032]
 ZHAO Xiao-mei,LIU Zhao-bang,ZHANG Zheng-ping,et al.Dermoscopy Image Segmentation Based on Superpixel and Density Clustering Algorithm[J].,2020,30(09):167.[doi:10. 3969 / j. issn. 1673-629X. 2020. 06. 032]
[4]王 诚,高兴东.基于最小生成树的密度聚类算法研究[J].计算机技术与发展,2022,32(02):45.[doi:10. 3969 / j. issn. 1673-629X. 2022. 02. 007]
 WANG Cheng,GAO Xing-dong.Research on Density Clustering Algorithm Based on MST[J].,2022,32(09):45.[doi:10. 3969 / j. issn. 1673-629X. 2022. 02. 007]

更新日期/Last Update: 2022-09-10