[Feature] Support ViTPose #1937

Annbless · 2023-01-16T09:02:53Z

Motivation

Merge the ViTPose variants code and pre-trained models into mmpose.

Modification

Add a vit backbone model in mmpose/models/backbones. The __init__ file is modified accordingly.
Add the config files and corresponding markdown files in the configs folder.
Fix a bug in the registration of layer-wise learning rate decay.
Add a 'resize_upsample4' input transformation in the mmpose/models/heads/topdown_heatmap_simple_head.py file to support the simple decoder in ViTPose. It has no influence on other models.

BC-breaking (Optional)

No.

Use cases (Optional)

Checklist

Before PR:

I have read and followed the workflow indicated in the CONTRIBUTING.md to create this PR.
Pre-commit or linting tools indicated in CONTRIBUTING.md are used to fix the potential lint issues.
Bug fixes are covered by unit tests, the case that causes the bug should be added in the unit tests.
New functionalities are covered by complete unit tests. If not, please add more unit tests to ensure correctness.
The documentation has been modified accordingly, including docstring or example tutorials.

After PR:

CLA has been signed and all committers have signed the CLA in this PR.

ly015 · 2023-01-16T09:13:45Z

Thank you very much for your help! For now, there are lint issues in the code. Could you please install pre-commit hooks (see our docs) and run pre-commit run --all-files in your local repo? The lint issues will be fixed automatically.

codecov · 2023-01-16T09:26:32Z

Codecov Report

❗ No coverage uploaded for pull request base (dev-0.x@fd98b11). Click here to learn what that means.
Patch has no changes to coverable lines.

❗ Current head 22fbc7b differs from pull request most recent head 52ee52b. Consider uploading reports for the commit 52ee52b to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##             dev-0.x    #1937   +/-   ##
==========================================
  Coverage           ?   84.10%           
==========================================
  Files              ?      242           
  Lines              ?    21227           
  Branches           ?     3652           
==========================================
  Hits               ?    17853           
  Misses             ?     2450           
  Partials           ?      924

Flag	Coverage Δ
unittests	`84.01% <0.00%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

Annbless · 2023-01-16T09:47:17Z

Thanks for your instruction. The code is checked now. How can I upload the pre-trained weights and logs for the ViTPose variants? Can I provide the links using OneDrive or Google Drive?

jin-s13 · 2023-01-16T09:49:52Z

Thanks. Both OneDrive and Google Drive are welcome.

BTW, would you mind adding some unit-tests? An example could be found https://github.com/open-mmlab/mmpose/pull/1907/files#diff-dadc2075341a40335f28131ceaf3d0d415e5c316c54bb6b0a0741aeb002db24e

ly015 · 2023-01-17T02:34:44Z

The unit test of ViTPose seems failed. For quick debugging, you can run unit tests locally by $pytest tests/.

Annbless · 2023-01-18T07:46:23Z

Thanks a lot for your help! The files and configs have been updated now. The pre-trained models and logs are available at Onedrive. The uncovered codes by the unit test are mostly existing in the init weight using the pre-trained models part (for example, changing the tensor name between the MAE pre-trained models and the backbones). We are wondering how can we cover these parts in the unit test. Can we produce pseudo checkpoints via torch.save in the unit test to cover the rename parts? We have tested this parts via re-training the models for several epochs and find them work well.

Annbless · 2023-01-19T15:28:44Z

We also fixed some bugs caused by the updated NumPy version in the dataset files. Please check the recent commits. By the way, it seems that the current failed build is cased by the HTTP error.

Annbless · 2023-02-08T01:12:57Z

Hi @ly015, would you mind restarting the failed checks? I just checked the logs, and it seems that the pip installation caused the error. Thanks a lot.

Annbless · 2023-02-08T03:31:10Z

It seems that the current failure information is related to the docker version... Should I open a new PR for the dev-1.x branch and close this PR instead? Thanks a lot for your patience.

Annbless · 2023-02-09T01:41:31Z

Hi @ly015, it seems that the current failure case is in loading the video in test_inference.py, where no frames are detected after the command.
To this end, is there anything we can do to aid the merging?

Thanks a lot.

ly015 · 2023-02-09T02:19:40Z

We will help check and fix the CI problem.

Annbless · 2023-02-25T05:24:28Z

Hi, is there anything we can do to help fix the CI problem? Besides, could we open a new PR based on the dev-1.x branch to merge the ViTPose variants into mmpose? Thanks for your response.

LareinaM · 2023-03-01T03:51:44Z

mmpose/models/backbones/vit.py

+ new_key = k.replace('patch_embed.proj',
+ 'patch_embed.projection')
+ new_ckpt[new_key] = v
+ else:


The downloaded backbone has keys norm.weight and norm.bias, but in the model the two are called last_norm.weight and last_norm.bias, it is necessary to add another conversion?

No need. The last norm layer is re-initialized for the pose dataset.

LareinaM · 2023-03-08T05:50:07Z

I have trained these models using your code and downloaded pretrained backbones. However, the results for some models mismatch with your record.

With classic decoder

Arch	Input Size	AP	AP⁵⁰	AP⁷⁵	AR	AR⁵⁰
ViTPose-S	256x192	0.737	0.905	0.813	0.790	0.943
ViTPose-B	256x192	0.751	0.905	0.823	0.803	0.944
ViTPose-L	256x192	0.777	0.915	0.850	0.828	0.953
ViTPose-H	256x192	0.785	0.914	0.853	0.835	0.951

With simple decoder

Arch	Input Size	AP	AP⁵⁰	AP⁷⁵	AR	AR⁵⁰
ViTPose-S	256x192	0.736	0.904	0.812	0.790	0.942
ViTPose-B	256x192	0.750	0.905	0.827	0.805	0.944
ViTPose-L	256x192	0.774	0.911	0.847	0.826	0.950
ViTPose-H	256x192	0.785	0.915	0.854	0.835	0.952

The validation accuracy is the same for all models.

I also noticed that the highest accuracy for large and huge models is at around 80th epoch. Maybe there is a problem with the optimizer? Have you validated the training process on this PR?

Annbless · 2023-03-20T04:02:39Z

Hi
We have re-trained the models and figured out that the performance drop is caused by the difference between the transformer layers implemented by mmcv and timm. Is it possible for us to use timm for the backbone implementation?

ly015 · 2023-03-20T05:47:21Z

Yes, you can use timm for the backbone implementation. There is a tutorial in MMDetection on how to use timm backbones in MMDetection through an MMClassification wrapper, which should also be applicable for MMPose: https://mmdetection.readthedocs.io/en/latest/tutorials/how_to.html#use-backbone-network-in-timm-through-mmclassification

The above tutorial is just for your reference. You can use any approach to integrate timm backbones in your implementation.

Annbless · 2023-03-23T02:47:29Z

Hi there,

Thanks for your patience. We have uploaded a timm version of the ViTPose.

The training logs are available here.
vitpose_base.log
vitpose_simple_base.log
vitpose_small.log
vitpose_simple_small.log

Annbless · 2023-04-13T05:10:16Z

Hi there,

Are there any things we could help to aid the merge of the PR? We are willing to provide more information.

Best,

register ViTPose

0161329

mm-assistant bot assigned ly015 Jan 16, 2023

ly015 requested review from jin-s13 and ly015 January 16, 2023 09:10

reformat the init file

40c46ba

reformat the init file

f0dd298

Annbless added 3 commits January 16, 2023 18:48

add unit test for ViTPose

6551a1b

correct the params used for vitpose-small

ac04826

add links for the pre-trained backbone

260c7bf

ly015 changed the title ~~Merge ViTPose into mmpose~~ [Feature] Support ViTPose Jan 17, 2023

Annbless added 3 commits January 17, 2023 10:57

bug fix in ViT's pos embedding

295fc4d

change the resize_upsample4 to upsample

dde9004

add padding in config to match the result

58e145f

Annbless added 4 commits January 19, 2023 22:04

add more unit test for vit backbone

685a585

np type fix for numpy version

6238550

np type fix for numpy version

e8f71af

np type fix for numpy version

c23f756

rectify the comment

e79e636

fix CI lint

22fbc7b

LareinaM reviewed Mar 1, 2023

View reviewed changes

ly015 mentioned this pull request Mar 20, 2023

Why no VitPose? #2095

Closed

change to timm implementation

619ebd1

wouterwln mentioned this pull request Apr 7, 2023

May I know the reason why you included the entire mmpose in your repository? ViTAE-Transformer/ViTPose#86

Open

Tau-J changed the base branch from master to dev-0.x April 20, 2023 08:07

LareinaM added 2 commits April 20, 2023 16:07

Merge branch 'dev-0.x' into ViTPose

4bb5213

Merge branch 'dev-0.x' into ViTPose

52ee52b

Tau-J merged commit 6fb1280 into open-mmlab:dev-0.x Apr 20, 2023

Tau-J mentioned this pull request Apr 20, 2023

Roadmap of MMPose 1.x #2258

Open

11 tasks

Ben-Louis pushed a commit to Ben-Louis/mmpose that referenced this pull request Apr 28, 2023

[Feature] Support ViTPose (open-mmlab#1937)

5c50c8f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support ViTPose #1937

[Feature] Support ViTPose #1937

Annbless commented Jan 16, 2023 •

edited

Loading

ly015 commented Jan 16, 2023

codecov bot commented Jan 16, 2023 •

edited

Loading

Annbless commented Jan 16, 2023

jin-s13 commented Jan 16, 2023

ly015 commented Jan 17, 2023

Annbless commented Jan 18, 2023

Annbless commented Jan 19, 2023

Annbless commented Feb 8, 2023

Annbless commented Feb 8, 2023

Annbless commented Feb 9, 2023

ly015 commented Feb 9, 2023

Annbless commented Feb 25, 2023

LareinaM Mar 1, 2023

Annbless Mar 1, 2023

LareinaM commented Mar 8, 2023

Annbless commented Mar 20, 2023

ly015 commented Mar 20, 2023

Annbless commented Mar 23, 2023

Annbless commented Apr 13, 2023

[Feature] Support ViTPose #1937

[Feature] Support ViTPose #1937

Conversation

Annbless commented Jan 16, 2023 • edited Loading

Motivation

Modification

BC-breaking (Optional)

Use cases (Optional)

Checklist

ly015 commented Jan 16, 2023

codecov bot commented Jan 16, 2023 • edited Loading

Codecov Report

Annbless commented Jan 16, 2023

jin-s13 commented Jan 16, 2023

ly015 commented Jan 17, 2023

Annbless commented Jan 18, 2023

Annbless commented Jan 19, 2023

Annbless commented Feb 8, 2023

Annbless commented Feb 8, 2023

Annbless commented Feb 9, 2023

ly015 commented Feb 9, 2023

Annbless commented Feb 25, 2023

LareinaM Mar 1, 2023

Choose a reason for hiding this comment

Annbless Mar 1, 2023

Choose a reason for hiding this comment

LareinaM commented Mar 8, 2023

Annbless commented Mar 20, 2023

ly015 commented Mar 20, 2023

Annbless commented Mar 23, 2023

Annbless commented Apr 13, 2023

Annbless commented Jan 16, 2023 •

edited

Loading

codecov bot commented Jan 16, 2023 •

edited

Loading