ResNeSt: Split-Attention Networks

May 2020

tl;dr: A new drop-in replacement for ResNet for object detection and segmentation task.

Overall impression

It is almost a combination of ResNeXt and SKNet, with improvement in implementation (cardinality-major to radix major).

I do feel that the paper uses too much tricks (MixUp, AutoAugment, distributed training, etc) and is too similar to SKNet, especially that the hyperparameter selection reduces this work. Engineering contribution > innovation.

Key ideas

Technical details