The speed at which China seems to be adopting and implementing the latest deep learning technology into everyday life is truly astounding. There is no doubt it is going to be one of the top countries contending for the world leader in AI in the next decade or so.
Does anyone have a hint on what technique they might have used?
Simple: One-sentence method summary: use keypoint detection technic to detect the bounding box center point and regress to all other object properties like bounding box size, 3d information, and pose.
Versatile: The same framework works for object detection, 3d bounding box estimation, and multi-person pose estimation with minor modification.
Fast: The whole process in a single network feedforward. No NMS post processing is needed. Our DLA-34 model runs at 52 FPS with 37.4 COCO AP.
Strong: Our best single model achieves 45.1AP on COCO test-dev.
Easy to use: We provide user friendly testing API and webcam demos.