Identity Mappings in Deep Residual Networks.pdf


立即下载 NetworkAttachedStorage
2024-12-31
ity residual Neton Res ident Deep 机器 200-layer CIFAR-100 4.62%
1.1 MB

Identity Mappings in Deep Residual Networks
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun
Microsoft Research
Abstract Deep residual networks [1] have emerged as a family of ex-
tremely deep architectures showing compelling accuracy and nice con-
vergence behaviors. In this paper, we analyze the propagation formu-
lations behind the residual building blocks, which suggest that the for-
ward and backward signals can be directly propagated from one block
to any other block, when using identity mappings as the skip connec-
tions and after-addition activation. A series of ablation experiments sup-
port the importance of these identity mappings. This motivates us to
propose a new residual unit, which makes training easier and improves
generalization. We report improved results using a 1001-layer ResNet
on CIFAR-10 (4.62% error) and CIFAR-100, and a 200-layer ResNet
on ImageNet. Code is available at: https://github.com/KaimingHe/
resnet-1k-layers.
1 Introduction
Deep


ity/residual/Neton/Res/ident/Deep/机器/200-layer/CIFAR-100/4.62%/ ity/residual/Neton/Res/ident/Deep/机器/200-layer/CIFAR-100/4.62%/
-1 条回复
登录 后才能参与评论
-->