Newer
Older
Demo-Maker / modules / rtmpose / configs / body_2d_keypoint / topdown_heatmap / README.md

Top-down heatmap-based pose estimation

Top-down methods divide the task into two stages: object detection, followed by single-object pose estimation given object bounding boxes. Instead of estimating keypoint coordinates directly, the pose estimator will produce heatmaps which represent the likelihood of being a keypoint, following the paradigm introduced in Simple Baselines for Human Pose Estimation and Tracking.

Results and Models

COCO Dataset

Results on COCO val2017 with detector having human AP of 56.4 on COCO val2017 dataset

ModelInput SizeAPARDetails and Download
ViTPose-h256x1920.7900.840vitpose_coco.md
HRNet-w48+UDP256x1920.7680.817hrnet_udp_coco.md
MSPN 4-stg256x1920.7650.826mspn_coco.md
HRNet-w48+Dark256x1920.7640.814hrnet_dark_coco.md
HRNet-w48256x1920.7560.809hrnet_coco.md
HRFormer-B256x1920.7540.807hrformer_coco.md
RSN-50-3x256x1920.7500.814rsn_coco.md
CSPNeXt-l256x1920.7500.800cspnext_udp_coco.md
HRNet-w32256x1920.7490.804hrnet_coco.md
Swin-L256x1920.7430.798swin_coco.md
ViTPose-s256x1920.7390.792vitpose_coco.md
HRFormer-S256x1920.7380.793hrformer_coco.md
Swin-B256x1920.7370.794swin_coco.md
SEResNet-101256x1920.7340.790seresnet_coco.md
SCNet-101256x1920.7330.789scnet_coco.md
ResNet-101+Dark256x1920.7330.786resnet_dark_coco.md
CSPNeXt-m256x1920.7320.785cspnext_udp_coco.md
ResNetV1d-101256x1920.7320.785resnetv1d_coco.md
SEResNet-50256x1920.7290.784seresnet_coco.md
SCNet-50256x1920.7280.784scnet_coco.md
ResNet-101256x1920.7260.783resnet_coco.md
ResNeXt-101256x1920.7260.781resnext_coco.md
HourglassNet256x2560.7260.780hourglass_coco.md
ResNeSt-101256x1920.7250.781resnest_coco.md
RSN-50256x1920.7240.790rsn_coco.md
Swin-T256x1920.7240.782swin_coco.md
MSPN 1-stg256x1920.7230.788mspn_coco.md
ResNetV1d-50256x1920.7220.777resnetv1d_coco.md
ResNeSt-50256x1920.7200.775resnest_coco.md
ResNet-50256x1920.7180.774resnet_coco.md
ResNeXt-50256x1920.7150.771resnext_coco.md
PVT-S256x1920.7140.773pvt_coco.md
CSPNeXt-s256x1920.6970.753cspnext_udp_coco.md
LiteHRNet-30256x1920.6760.736litehrnet_coco.md
CSPNeXt-tiny256x1920.6650.723cspnext_udp_coco.md
MobileNet-v2256x1920.6480.709mobilenetv2_coco.md
LiteHRNet-18256x1920.6420.705litehrnet_coco.md
CPM256x1920.6270.689cpm_coco.md
ShuffleNet-v2256x1920.6020.668shufflenetv2_coco.md
ShuffleNet-v1256x1920.5870.654shufflenetv1_coco.md
AlexNet256x1920.4480.521alexnet_coco.md

MPII Dataset

ModelInput SizePCKh@0.5PCKh@0.1Details and Download
HRNet-w48+Dark256x2560.9050.360hrnet_dark_mpii.md
HRNet-w48256x2560.9020.303hrnet_mpii.md
HRNet-w48256x2560.9010.337hrnet_mpii.md
HRNet-w32256x2560.9000.334hrnet_mpii.md
HourglassNet256x2560.8890.317hourglass_mpii.md
ResNet-152256x2560.8890.303resnet_mpii.md
ResNetV1d-152256x2560.8880.300resnetv1d_mpii.md
SCNet-50256x2560.8880.290scnet_mpii.md
ResNeXt-152256x2560.8870.294resnext_mpii.md
SEResNet-50256x2560.8840.292seresnet_mpii.md
ResNet-50256x2560.8820.286resnet_mpii.md
ResNetV1d-50256x2560.8810.290resnetv1d_mpii.md
CPM368x368*0.8760.285cpm_mpii.md
LiteHRNet-30256x2560.8690.271litehrnet_mpii.md
LiteHRNet-18256x2560.8590.260litehrnet_mpii.md
MobileNet-v2256x2560.8540.234mobilenetv2_mpii.md
ShuffleNet-v2256x2560.8280.205shufflenetv2_mpii.md
ShuffleNet-v1256x2560.8240.195shufflenetv1_mpii.md

CrowdPose Dataset

Results on CrowdPose test with YOLOv3 human detector

ModelInput SizeAPARDetails and Download
HRNet-w32256x1920.6750.816hrnet_crowdpose.md
CSPNeXt-m256x1920.6620.755hrnet_crowdpose.md
ResNet-101256x1920.6470.800resnet_crowdpose.md
HRNet-w32256x1920.6370.785resnet_crowdpose.md

AIC Dataset

Results on AIC val set with ground-truth bounding boxes.

ModelInput SizeAPARDetails and Download
HRNet-w32256x1920.3230.366hrnet_aic.md
ResNet-101256x1920.2940.337resnet_aic.md

JHMDB Dataset

ModelInput SizePCK(norm. by person size)PCK (norm. by torso size)Details and Download
ResNet-50256x25696.080.1resnet_jhmdb.md
CPM368x36889.865.7cpm_jhmdb.md

PoseTrack2018 Dataset

Results on PoseTrack2018 val with ground-truth bounding boxes.

ModelInput SizeAPDetails and Download
HRNet-w48256x19284.6hrnet_posetrack18.md
HRNet-w32256x19283.4hrnet_posetrack18.md
ResNet-50256x19281.2resnet_posetrack18.md

Human-Art Dataset

Results on Human-Art validation dataset with detector having human AP of 56.2 on Human-Art validation dataset

ModelInput SizeAPARDetails and Download
ViTPose-s256x1920.3810.448vitpose_humanart.md
ViTPose-b256x1920.4100.475vitpose_humanart.md

Results on Human-Art validation dataset with ground-truth bounding-box

ModelInput SizeAPARDetails and Download
ViTPose-s256x1920.7380.768vitpose_humanart.md
ViTPose-b256x1920.7590.790vitpose_humanart.md