[1]谢 奔,张索非,吴晓富.基于热重启机制的胶囊投影网络快速训练算法[J].计算机技术与发展,2020,30(12):21-26.[doi:10. 3969 / j. issn. 1673-629X. 2020. 12. 004]
 XIE Ben,ZHANG Suo-fei,WU Xiao-fu.Learning Capsule Projection Network by Stochastic Gradient Descent with Warm Restarts[J].,2020,30(12):21-26.[doi:10. 3969 / j. issn. 1673-629X. 2020. 12. 004]
点击复制

基于热重启机制的胶囊投影网络快速训练算法()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
30
期数:
2020年12期
页码:
21-26
栏目:
智能、算法、系统工程
出版日期:
2020-12-10

文章信息/Info

Title:
Learning Capsule Projection Network by Stochastic Gradient Descent with Warm Restarts
文章编号:
1673-629X(2020)12-0021-06
作者:
谢 奔1张索非2吴晓富1
1. 南京邮电大学 通信与信息工程学院,江苏 南京 210003; 2. 南京邮电大学 物联网学院,江苏 南京 210003
Author(s):
XIE Ben1ZHANG Suo-fei2WU Xiao-fu1
1. School of Telecommunications and Information Engineering,Nanjing University of Posts and Telecommunications,Nanjing 210003,China; 2. School of Internet of Things,Nanjing University of Posts and Telecommunications,Nanjing 210003,China
关键词:
卷积神经网络胶囊投影结构热重启机制快速训练算法深度学习
Keywords:
convolutional neural networkcapsule projection structurewarm restartsfast training algorithmdeep learning
分类号:
TP391
DOI:
10. 3969 / j. issn. 1673-629X. 2020. 12. 004
摘要:
胶囊投影网络是一种新型的深度神经网络结构,将传统的卷积神经网络与胶囊投影结构结合来学习潜在的视觉特征。尽管胶囊投影网络在多个分类数据集上展现出了先进的性能,但是训练该算法模型往往需要较高的学习成本,这对胶囊投影网络在实际问题中的应用带来一定的限制。 针对该问题,将基于热重启机制的随机梯度下降算法引入到胶囊投影网络的学习中,提出了一种基于热重启机制的胶囊投影网络快速训练算法,并在多个分类数据集上对该方法进行实验评估。 实验结果表明,与原始的胶囊投影网络相比,该方法不仅解决了训练成本高昂的问题,同时所学模型也具有比较好的泛化性能。
Abstract:
Capsule projection network (CapProNet) is a recently proposed deep neural network architecture,which provides potential features by combining conventional deep networks with capsule projection structure. Although CapProNet shows competitive performance on various benchmark datasets, the model requires much expensive budget for training,which brings certain limitations to the applica-tion of CapProNet in practical problems. To address this problem,we introduce stochastic gradient descent with warm restarts (SGDR) into the learning of CapProNet and propose a CapProNet model fast training algorithm based on the warm restarts. Different learning strategies of methods are compared and evaluated. The experiment demonstrates that the proposed method can deliver better generalization performance with equivalent or even less training epochs compared with the traditional training method.

相似文献/References:

[1]崔凤焦.表情识别算法研究进展与性能比较[J].计算机技术与发展,2018,28(02):145.[doi:10.3969/j.issn.1673-629X.2018.02.031]
 CUI Feng-jiao.Research and Performance Comparison of Facial Expression Recognition Algorithm[J].,2018,28(12):145.[doi:10.3969/j.issn.1673-629X.2018.02.031]
[2]张丹丹,李雷. 基于PCANet-RF的人脸检测系统[J].计算机技术与发展,2016,26(02):31.
 ZHANG Dan-dan,LI Lei. Face Detection System Based on PCANet-RF[J].,2016,26(12):31.
[3]陈强锐,谢世朋.基于深度学习的肺部肿瘤检测方法[J].计算机技术与发展,2018,28(04):201.[doi:10.3969/ j. issn.1673-629X.2018.04.043]
 CHEN Qiang-rui,XIE Shi-peng.Lung Cancer Detection Method Based on Deep Learning[J].,2018,28(12):201.[doi:10.3969/ j. issn.1673-629X.2018.04.043]
[4]郭子琰,舒心,刘常燕,等.基于ReLU 函数的卷积神经网络的花卉识别算法[J].计算机技术与发展,2018,28(05):154.[doi:10.3969/j.issn.1673-629X.2018.05.035]
 GUO Ziyan,SHU Xin,LIU Changyan,et al.A Recognition Algorithm of Flower Based on Convolution Neural Network with ReLU Function[J].,2018,28(12):154.[doi:10.3969/j.issn.1673-629X.2018.05.035]
[5]缪宇杰,吴智钧,宫 婧.基于3D 卷积的视频错帧筛选方法[J].计算机技术与发展,2018,28(05):179.[doi:10.3969/ j. issn.1673-629X.2018.05.040]
 MIAO Yu-jie,WU Zhi-jun,GONG Jing.A Wrong Temporal-order Frames Identification Method Based on 3D Convolution[J].,2018,28(12):179.[doi:10.3969/ j. issn.1673-629X.2018.05.040]
[6]吴玉枝,吴志红,熊运余.基于卷积神经网络的小样本车辆检测与识别[J].计算机技术与发展,2018,28(06):1.[doi:10.3969/ j. issn.1673-629X.2018.06.001]
 WU Yu-zhi,WU Zhi-hong,XIONG Yun-yu.Vehicle Detection and Recognition of a Few Samples Based on Convolutional Neural Network[J].,2018,28(12):1.[doi:10.3969/ j. issn.1673-629X.2018.06.001]
[7]李相桥,李晨,田丽华,等.卷积神经网络并行训练的优化研究[J].计算机技术与发展,2018,28(08):12.[doi:10.3969/ j. issn.1673-629X.2018.08.003]
 LI Xiang-qiao,LI Chen,TIAN Li-hua,et al.Research on Optimization of Parallel Training for Convolution Neural Network[J].,2018,28(12):12.[doi:10.3969/ j. issn.1673-629X.2018.08.003]
[8]邓宗平,赵启军,陈虎. 基于深度学习的人脸姿态分类方法[J].计算机技术与发展,2016,26(07):11.
 DEND Zong-ping,ZHAO Qi-jun,CHEN Hu. Face Pose Classification Method Based on Deep Learning[J].,2016,26(12):11.
[9]河海大学 计算机与信息学院,江苏 南京 0098.卷积网络的无监督特征提取对人脸识别的研究[J].计算机技术与发展,2018,28(06):17.[doi:10.3969/ j. issn.1673-629X.2018.06.004]
 DU Bai-sheng.Research on Unsupervised Feature Extraction Based on Convolutional Neural Network for Face Recognition[J].,2018,28(12):17.[doi:10.3969/ j. issn.1673-629X.2018.06.004]
[10]高翔,陈志,岳文静,等.基于视频场景深度学习的人物语义识别模型[J].计算机技术与发展,2018,28(06):53.[doi:10.3969/ j. issn.1673-629X.2018.06.012]
 GAO Xiang,CHEN Zhi,YUE Wen-jing,et al.Human Semantic Recognition Model Based on Video Scene Deep Learning[J].,2018,28(12):53.[doi:10.3969/ j. issn.1673-629X.2018.06.012]

更新日期/Last Update: 2020-12-10