【论文笔记 Detection】(2017 ICCV)Deformable Convolutional Network

こ雲淡風輕ζ 提交于 2020-03-25 10:25:21

3 月,跳不动了?>>>

在这里插入图片描述

Abstract

I’ve writen many paper reviews in Chinese before, so to get more fun, English will be used for latter paper reviews gradually.

A few days ago, we have reviewed STN. We all know that becaus of Pooling layer, CNN has spatial invariance(such as translation invariance, rotation invariance). And as the larger the pooling kernel and conv kernel are, invariance of CNN will get more and more powerful. But at the same time, the larger the pooling kernel and conv kernel are, CNN wil lost more and more local information.
As a result, downsampling ratio needs to be adjusted according to defferent datasets.

如果降采样太少,那么空间不变性太弱,泛化性不好;如果降采样太多,那么局部信息损失太多,对模型结果影响太大。

However pooling layer’s spatial invariance is not enough for natural scene. Anamorphose of images includes rotation, distortion, scaling, aliasing, etc. STN proposed a spatial transformer, which can learnably adjust spatial feature map.

Comvolutional/Regular convolution operates a pre-defined rectangular grid , and the size of grid usually are 33 and 55. However object which we need to be classified or detected can be deformable or occluded within the image.

In DCN, the grid is deformable, and each grid point is moved by a learnable offset. By use of these kind of deformable convolution, the paper proposed Deformable ROI Pooling. By using these two new modules, DCN imporves the accuracy of DeepLab, Faster RCNN, R-FCN, and FPN ,etc.

Contents

1. Deformable Convolution

2. Deformable RoI Pooling

3. Deformable Positive-Sensitive(PS) ROI Pooling

4. Deformable ConvNets Using Resnet-101 & Aligned-Inception-Resnet

5. Ablation Study & Results

6. More results on COCO Detection Challenge Using Aligned Xception

Conclusion

Reference

发布了751 篇原创文章 · 获赞 196 · 访问量 32万+
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!