MoNet3D: Towards Accurate Monocular 3D Object Localization in Real Time

November 2020

tl;dr: Encodes the local geometric consistency (spatial correlation of neighboring objects) into learning.

Overall impression

The idea is similar to enforcing certain order in prediction. It learns the second degree of information hidden in the GT labels. It incorporates prior knowledge of geometric locality as regularization in the training module. The mining of pair-wise relationship if similar to MonoPair.

The writing is actually quite bad with heavy use of non-standard terminology. No ablation study on the effect of this newly introduced regularization.

Key ideas

Technical details