帮助 关于我们

返回检索结果

基于核化双线性卷积网络的细粒度图像分类
Kernelized Bilinear CNN Models for Fine-Grained Visual Recognition

查看参考文献30篇

文摘 双线性卷积网络( Bilinear CNN,B-CNN)在计算机视觉任务中有着广泛的应用. B-CNN通过对卷积层输出的特征进行外积操作,能够建模不同通道之间的线性相关,从而增强了卷积网络的表达能力.由于没有考虑特征图中通道之间的非线性关系,该方法无法充分利用通道之间所蕴含的更丰富信息.为了解决这一不足,本文提出了一种核化的双线性卷积网络,通过使用核函数的方式有效地建模特征图中通道之间的非线性关系,进一步增强卷积网络的表达能力.本文在三个常用的细粒度数据库CUB-200-2011、FGVC-Aircraft以及Cars上对本文方法进行了验证,实验表明本文方法在三个数据库上均优于同类方法.
其他语种文摘 The bilinear convolutional neural network( B-CNN) has been widely used in computer vision. B-CNN can capture the linear correlation between different channels by performing the outer product operation on the features of the convolutional layer output, thus enhancing the representative ability of the convolutional network. Since the non-linear relationship between the channels in the feature map is not taken account of, this method cannot make full use of the richer information contained between the channels. In order to solve this problem, this paper proposes a kernelized bilinear convolutional neural network employing the kernel function to effectively capture the non-linear relationship between the channels in the feature map, and further enhancing the representative ability of the convolutional network. In this paper, the method is evaluated on three common fine-grained benchmarks CUB-200-2011,FGVC-Aircraft and Cars. Experiments show that our method is superior to its counterparts on all three benchmarks.
来源 电子学报 ,2019,47(10):2134-2141 【核心库】
DOI 10.3969/j.issn.0372-2112.2019.10.015
关键词 核化双线性聚合 ; 双线性卷积网络 ; 端到端学习 ; 细粒度图像分类
地址

大连理工大学信息与通信工程学院, 辽宁, 大连, 116024

语种 中文
文献类型 研究性论文
ISSN 0372-2112
学科 自动化技术、计算机技术
基金 国家自然科学基金
文献收藏号 CSCD:6668666

参考文献 共 30 共2页

1.  Krizhevsky A. Imagenet classification with deep convolutional neural networks. Advances in Neural Information Processing Systems,2012:1097-1105 CSCD被引 3074    
2.  Deng J. Imagenet: A largescale hierarchical image database. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition,2009:248-255 CSCD被引 86    
3.  柯圣财. 基于卷积神经网络和监督核哈希的图像检索方法. 电子学报,2017,45(1):157-163 CSCD被引 15    
4.  王泽宇. 基于空间结构化推理深度融合网络的RGB-D场景解析. 电子学报,2018,46(5):1253-1258 CSCD被引 3    
5.  李康. 基于卷积神经网络的鲁棒高精度目标跟踪算法. 电子学报,2018,46(9):2087-2093 CSCD被引 15    
6.  邹承明. 基于多特征组合的细粒度图像分类方法. 计算机应用,2018,38(7):1853-1856,1861 CSCD被引 2    
7.  Lin T Y. Bilinear CNN models for fine-grained visual recognition. Proceedings of IEEE International Conference on Computer Vision,2015:1449-1457 CSCD被引 7    
8.  Li P. Is second-order information helpful for large-scale visual recognition. Proceedings of IEEE International Conference on Computer Vision,2017:2070-2078 CSCD被引 1    
9.  Lin T Y. Improved bilinear pooling with CNNs. British Machine Vision Conference,2017:1-12 CSCD被引 1    
10.  Ioffe S. Batch normalization: Accelerating deep network training by reducing internal covariate shift. International Conference on Machine Learning,2015:448-456 CSCD被引 209    
11.  Maji S. Fine-Grained Visual Classification of Aircraft,2013 CSCD被引 18    
12.  Wah C. The Caltech-Ucsd Birds-200-2011 Dataset. Technical report, Caltech,2011 CSCD被引 2    
13.  Krause J. 3D object representations for fine-grained categorization. Proceedings of IEEE International Conference on Computer Vision Workshops,2013:554-561 CSCD被引 5    
14.  Gao Y. Compact bilinear pooling. Proceedings of IEEE Conference on Computer Vision and attern Recognition,2016:317-326 CSCD被引 1    
15.  Li Y. Factorized bilinear models for image recognition. Proceedings of IEEE International Conference on Computer Vision,2017:2098-2106 CSCD被引 1    
16.  Cui Y. Kernel pooling for convolutional neural networks. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition,2017:3049-3058 CSCD被引 1    
17.  Wang Q. G2DeNet: Global Gaussian distribution embedding network and its application to visual recognition. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition,2017:2730-2739 CSCD被引 1    
18.  Ionescu C. Training deep networks with structured layers by matrix backpropagation,2015 CSCD被引 1    
19.  Lin H T. A Study on Sigmoid Kernels for SVM and the Training of Non-PSD Kernels by SMO-Type Methods. Technical Report,2003 CSCD被引 1    
20.  Vedaldi A. Matconvnet: convolutional neural networks for matlab. ACM International Conference on Multimedia,2015:689-692 CSCD被引 10    
引证文献 7

1 李祥霞 细粒度图像分类的深度学习方法 计算机科学与探索,2021,15(10):1830-1842
CSCD被引 5

2 杨倩文 基于改进双线性细粒度模型的压板状态识别 激光与光电子学进展,2021,58(20):2010007
CSCD被引 0 次

显示所有7篇文献

论文科学数据集
PlumX Metrics
相关文献

 作者相关
 关键词相关
 参考文献相关

版权所有 ©2008 中国科学院文献情报中心 制作维护:中国科学院文献情报中心
地址:北京中关村北四环西路33号 邮政编码:100190 联系电话:(010)82627496 E-mail:cscd@mail.las.ac.cn 京ICP备05002861号-4 | 京公网安备11010802043238号