半导体光电, 2020, 41 (3): 414, 网络出版: 2020-06-18  

基于时空注意力网络的中国手语识别

Chinese Sign Language Recognition Based on Spatial-Temporal Attention Network
罗元 1,*李丹 1张毅 2
作者单位
1 重庆邮电大学光电工程学院, 重庆 400065
2 重庆邮电大学信息无障碍与服务机器人工程技术研究中心, 重庆 400065
引用该论文

罗元, 李丹, 张毅. 基于时空注意力网络的中国手语识别[J]. 半导体光电, 2020, 41(3): 414.

LUO Yuan, LI Dan, ZHANG Yi. Chinese Sign Language Recognition Based on Spatial-Temporal Attention Network[J]. Semiconductor Optoelectronics, 2020, 41(3): 414.

参考文献

[1] Li K, Zhou Z, Lee C H. Sign transition modeling and a scalable solution to continuous sign language recognition for real-world applications[J]. ACM Trans. on Accessible Computing, 2016, 8(2): 1-23.

[2] Lin Y, Chai X, Zhou Y, et al. Curve matching from the view of manifold for sign language recognition[J]. Lecture Notes in Computer Science, 2014, 9010: 233-246.

[3] Khotimah W N, Suciati N, Benedict I. Indonesian sign language recognition by using the static and dynamic features[C]// 2018 Inter. Seminar on Intelligent Technol. and Its Applications (ISITIA), 2018: 293-298.

[4] Huang J, Zhou W, Li H, et al. Sign language recognition using 3D convolutional neural networks[C]// 2015 IEEE Inter. Conf. on Multimedia and Expo, 2015: 1-6.

[5] Zhu G, Zhang L, Shen P, et al. Multimodal gesture recognition using 3-D convolution and convolutional LSTM[J]. IEEE Access, 2017, 5: 4517-4524.

[6] Zhang L, Zhu G, Shen P, et al. Learning spatiotemporal features using 3DCNN and convolutional LSTM for gesture recognition[C]// 2017 IEEE Inter. Conf. on Computer Vision Workshop, 2017: 3120-3128.

[7] Tran D, Ray J, Shou Z, et al. ConvNet architecture search for spatiotemporal feature learning[EB/OL]. [2019-10-15]. https://arxiv.org/abs/1708.05038.

[8] Li Z, Gavrilyuk K, Gavves E, et al. VideoLSTM convolves, attends and flows for action recognition[J]. Computer Vision and Image Understanding, 2018, 166: 41-50.

[9] Sharma S, Kiros R, Salakhutdinov R. Action recognition using visual attention[EB/OL]. [2019-10-15]. https://arxiv.org/abs/1511.04119.

[10] Hu Y, Wong Y, Wei W, et al. A novel attention-based hybrid CNN-RNN architecture for sEMG-based gesture recognition[J]. PLoS ONE, 2018, 13(10): e0206049.

[11] Li D, Yao T, Duan L, et al. Unified spatio-temporal attention networks for action recognition in videos[J]. IEEE Trans. on Multimedia, 2019, 21(2): 416-428.

[12] Wang L, Xu Y, Cheng J, et al. Human action recognition by learning spatio-temporal features with deep neural networks[J]. IEEE Access, 2018, 6: 17913-17922.

[13] Molchanov P, Gupta S, Kim K, et al. Hand gesture recognition with 3D convolutional neural networks[C]// 2015 IEEE Conf. on Computer Vision and Pattern Recognition Workshops, 2015: 1-7.

[14] Peng M, Wang C, Chen T. Attention based residual network for micro-gesture recognition[C]// 13th IEEE Inter. Conf. on Automatic Face & Gesture Recognition, 2018: 790-794.

[15] Zhengyuan Y, Yuncheng L, Jianchao Y, et al. Action recognition with spatio-temporal visual attention on skeleton image sequences[J]. IEEE Trans. on Circuits and Systems for Video Technol., 2019, 29(8): 2405-2415.

[16] Zhang H, Goodfellow I, Metaxas D, et al. Self-attention generative adversarial networks[C]// Proc. of the 36th Inter. Conf. on Machine Learning, 2019, 97: 7354-7363.

[17] Huang J, Zhou W, Li H, et al. Attention based 3D-CNNs for large-vocabulary sign language recognition[J]. IEEE Trans. on Circuits and Systems for Video Technol., 2019, 29(9): 2822-2832.

[18] Pu J, Zhou W, Zhang J, et al. Sign language recognition based on trajectory modeling with HMMs[C]// The Inter. Conf. on Multimedia Modeling, 2016, 9516: 686-697.

[19] Zhang J, Zhou W, Xie C, et al. Chinese sign language recognition with adaptive HMM[C]// 2016 IEEE Inter. Conf. on Multimedia and Expo, 2016: 1-6.

[20] Liu T, Zhou W, Li H. Sign language recognition with long short-term memory[C]// 2016 IEEE Inter. Conf. on Image Proc., 2016: 2871-2875.

[21] Miao Q, Li Y, Ouyang W, et al. Multimodal gesture recognition based on the ResC3D network[C]// 2017 IEEE Inter. Conf. on Computer Vision Workshop, 2017: 3047-3055.

[22] Zhang Liang, Zhu Guangming, Lin Mei, et al. Attention in convolutional LSTM for gesture recognition[C]// Neural Information Processing Systems (NIPS), 2018: 1953-1962.

罗元, 李丹, 张毅. 基于时空注意力网络的中国手语识别[J]. 半导体光电, 2020, 41(3): 414. LUO Yuan, LI Dan, ZHANG Yi. Chinese Sign Language Recognition Based on Spatial-Temporal Attention Network[J]. Semiconductor Optoelectronics, 2020, 41(3): 414.

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!