[1]安计勇,闫子骥.基于半监督学习的蛋白质相互作用预测模型[J].计算机技术与发展,2021,31(07):7-12.[doi:10. 3969 / j. issn. 1673-629X. 2021. 07. 002]
 AN Ji-yong,YAN Zi-ji.Prediction Model of Protein-protein Interactions Based on Semi-supervised Learning[J].,2021,31(07):7-12.[doi:10. 3969 / j. issn. 1673-629X. 2021. 07. 002]
点击复制

基于半监督学习的蛋白质相互作用预测模型()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
31
期数:
2021年07期
页码:
7-12
栏目:
人工智能
出版日期:
2021-07-10

文章信息/Info

Title:
Prediction Model of Protein-protein Interactions Based on Semi-supervised Learning
文章编号:
1673-629X(2021)07-0007-06
作者:
安计勇12闫子骥12
1. 中国矿业大学 矿山数字化教育部工程研究中心,江苏 徐州 221000;
2. 中国矿业大学 计算机科学与技术学院,江苏 徐州 221000
Author(s):
AN Ji-yong12YAN Zi-ji12
1. Engineering Research Center of Mine Digitalization of Ministry of Education,China University of Mining and Technology,Xuzhou 221000,China;
2. School of Computer Science and Technology,China University of Mining and Technology,Xuzhou 221000,China
关键词:
相关向量机半监督学习自训练AP 聚类Renyi 熵分类预测
Keywords:
relevance vector machinesemi-supervised learningself-trainingAP clusteringRenyi entropyclassification prediction
分类号:
TP311. 13
DOI:
10. 3969 / j. issn. 1673-629X. 2021. 07. 002
摘要:
基于有监督学习的预测模型在预测过程中存在以下缺陷:一是过分依赖训练集中有标签样本的数量, 导致分类精度受有标签样本数量多少的制约;二是其预测分类一次完成, 导致大量的无标签样本无法用来修正分类器的预测精度,大量数据信息被浪费,从而影响分类性能。 针对以上问题,该文提出一种基于 AP 聚类与 Renyi 熵融合的自训练半监督相关向量机分类预测模型。 该模型通过 AP 聚类分析与 Renyi 熵来共同标记无标签样本的标签类别,筛选置信度高的无标签样本扩充原有训练集进行自训练迭代分类,降低噪声数据对分类器预测精度的影响,构造出了性能最优的基于半监督学习的蛋白质相互作用预测模型。 通过在 M. musculus、H. pylori 和 H. sapiens 蛋白质相互作用数据集上的实验验证,证明了提出的半监督分类预测模型的有效性。
Abstract:
In the prediction process,the disadvantage of the prediction model based on supervised learning are as follows:firstly,due to over-dependence on the number of labeled samples in the training set,the classification accuracy is limited by the number of labeled samples. Secondly,its prediction classification is completed once,resulting in a large number of unlabeled samples that cannot be used to amend the prediction accuracy of the classifier, and a large amount of data information is wasted, thus affecting the classification perform-ance. In view of the above problems,we propose a new self-training semi-supervised classification prediction model of RVM based on AP clustering and Renyi entropy fusion. This model can greatly reduce the influence of noise data on the prediction accuracy of classifier by using AP clustering and Renyi entropy fusion to assign labels for unlabeled samples. The semi-supervised classifier with optimal performance was constructed through adding the unlabeled samples with high degree of confidence to the training set and executing the self-training iteration classification with the expanded training set. It is demonstrated that the proposed prediction model is effective by experimenting validation on M. musculus,H. pylori and H. sapiens datasets.

相似文献/References:

[1]陈锦禾 沈洁.基于信息熵的主动学习半监督分类研究[J].计算机技术与发展,2010,(02):110.
 CHEN Jin-he,SHEN Jie.Active Learning Based on Information Entropy for Semi- supervised Classification[J].,2010,(07):110.
[2]陈东泳 钟尚平.基于半监督学习的JPEG图像通用隐写检测方法[J].计算机技术与发展,2009,(02):169.
 CHEN Dong-yong,ZHONG Shang-ping.A Universal Steganalysis Method for JPEG Images Based on Semi - supervised Learning[J].,2009,(07):169.
[3]徐庆伶 汪西莉.一种基于支持向量机的半监督分类方法[J].计算机技术与发展,2010,(10):115.
 XU Qing-ling,WANG Xi-li.A Novel Semi-Supervised Classification Method Based on SVM[J].,2010,(07):115.
[4]戴林 姜梅.基于半监督学习的入侵检测系统[J].计算机技术与发展,2011,(01):162.
 DAI Lin JIANG Mei.Semi-Supervised Learning-Based Network Intrusion Detection System[J].,2011,(07):162.
[5]李亚娥,汪西莉.一种自适应的半监督图像分类算法[J].计算机技术与发展,2013,(02):112.
 LI Ya-e,WANG Xi-li.An Adaptive Semi-supervised Image Classification Algorithm[J].,2013,(07):112.
[6]朱乔亚,陈可佳,方彪. 采用位置信息的半监督链接预测方法[J].计算机技术与发展,2015,25(07):63.
 ZHU Qiao-ya,CHEN Ke-jia,FANG Biao. A Semi-supervised Link Prediction Method Using Place Features[J].,2015,25(07):63.
[7]郑文静,李雷. 基于聚类核的半监督情感分类算法研究[J].计算机技术与发展,2016,26(12):87.
 ZHENG Wen-jing,LI Lei. Research on Semi-supervised Sentiment Classification Based on Cluster Kernel[J].,2016,26(07):87.
[8]张志武[],荆晓远[][],吴飞[]. 基于非负稀疏图的协同训练软件缺陷预测[J].计算机技术与发展,2017,27(07):38.
 ZHANG Zhi-wu[],JING Xiao-yuan[] [],WU Fei[]. Defect Prediction of Co-training Software with Non-negative Sparse Graph[J].,2017,27(07):38.
[9]奚晓钰,吴 飞,荆晓远.基于多视图半监督学习的图像识别[J].计算机技术与发展,2017,27(12):48.[doi:10.3969/ j. issn.1673-629X.2017.12.011]
 XI Xiao-yu,WU Fei,JING Xiao-yuan.Image Recognition Based on Semi-supervised Learning of Multi-view[J].,2017,27(07):48.[doi:10.3969/ j. issn.1673-629X.2017.12.011]
[10]陆子豪,荆晓远.基于改进 SMOTE 的半监督极限学习机缺陷预测[J].计算机技术与发展,2021,31(12):21.[doi:10. 3969 / j. issn. 1673-629X. 2021. 12. 004]
 LU Zi-hao,JING Xiao-yuan.Semi-supervised Extreme Learning Machine Based on Improved SMOTE for Software Defect Prediction[J].,2021,31(07):21.[doi:10. 3969 / j. issn. 1673-629X. 2021. 12. 004]

更新日期/Last Update: 2021-07-10