Hard to run eval_video_official.py & some training questions #8

Learningm · 2022-07-20T08:08:45Z

Hi, I encounter some problems when using the code.
I followed data/Readme.md and download the chair data then preprocess it. After that, I got nothing inside the output folder (e.g. outf_all) except another empty folder named 'chair_train'.

I got a 'bug_lists.txt' file, seems including all the related 'chair' category video name after preprocessing.

I see there's also 'bug_list.txt' in the 'label' folder? What do you mean by 'bug_lists'?

Please help me figure out what matters, Thank you very much!

Learningm · 2022-07-21T07:35:48Z

I found that it's the annotation_path matters.

Running preprocess.py inside data/ folder, but annotation_path start with data/ folder.

Learningm · 2022-07-27T07:09:34Z

I follow the tutorial in Readme inside the objectron_eval folder, however, the prepare_test_video.py script didn't work when the category is 'bike' as the partition() func return empty.

After I modified the code in prepare_test_video.py to eval on the bike category, I run python eval_video_official.py --eval_c bike --arch dlav1_34 inside ${CenterPose}/src/tools/objectron_eval and it outputs nothing.

I figured out that I need to put the tfrecord data folder inside the path mentioned aboved. Then it runs a batch of videos and some debug windows appears and stucks.

It's hard to run the evaluation part related scripts, please modify the tutorial readme with more details, thank you very much!

Uio96 · 2022-07-29T20:03:14Z

I follow the tutorial in Readme inside the objectron_eval folder, however, the prepare_test_video.py script didn't work when the category is 'bike' as the partition() func return empty.

After I modified the code in prepare_test_video.py to eval on the bike category, I run python eval_video_official.py --eval_c bike --arch dlav1_34 inside ${CenterPose}/src/tools/objectron_eval and it outputs nothing.

I figured out that I need to put the tfrecord data folder inside the path mentioned aboved. Then it runs a batch of videos and some debug windows appears and stucks.

It's hard to run the evaluation part related scripts, please modify the tutorial readme with more details, thank you very much!

Sorry for the inconvenience. I may update the tutorial to make it more user-friendly if I am more available later.

Uio96 · 2022-07-29T20:08:46Z

Hi, I encounter some problems when using the code. I followed data/Readme.md and download the chair data then preprocess it. After that, I got nothing inside the output folder (e.g. outf_all) except another empty folder named 'chair_train'.

I got a 'bug_lists.txt' file, seems including all the related 'chair' category video name after preprocessing.

I see there's also 'bug_list.txt' in the 'label' folder? What do you mean by 'bug_lists'?

Please help me figure out what matters, Thank you very much!

We ran into some problems (annotation inconsistency or video missing) when processing the raw data from the Objectron dataset for training purposes. So we skip those clips. You can find out the related code in the data folder https://github.com/NVlabs/CenterPose/blob/main/data/preprocess.py#L83-L95.

Learningm · 2022-08-02T06:51:06Z

I follow the tutorial in Readme inside the objectron_eval folder, however, the prepare_test_video.py script didn't work when the category is 'bike' as the partition() func return empty.
After I modified the code in prepare_test_video.py to eval on the bike category, I run python eval_video_official.py --eval_c bike --arch dlav1_34 inside ${CenterPose}/src/tools/objectron_eval and it outputs nothing.
I figured out that I need to put the tfrecord data folder inside the path mentioned aboved. Then it runs a batch of videos and some debug windows appears and stucks.
It's hard to run the evaluation part related scripts, please modify the tutorial readme with more details, thank you very much!

Sorry for the inconvenience. I may update the tutorial to make it more user-friendly if I am more available later.

Thanks for reply. I encounter some other problems when studying your code.

The variable name confused me though you have made some comments, but i still wonder why the 'hp' stands for 'keypoints' instead of using 'kp'.

My second question is the channel number when I start to train the code using main_CenterPose.py. The 'hp_offset' in opt heads is 2, but paper mentions this channel should be 16?
https://github.com/NVlabs/CenterPose/blob/main/src/lib/opts.py#L403

paper:

Learningm · 2022-08-04T06:28:22Z

Hi, I encounter another question, https://github.com/NVlabs/CenterPose/blob/main/src/lib/models/networks/pose_dla_dcn.py#L253

what is opt.pre_img / opt.pre_hm / opt.pre_hm_hp ? I can not find the comments about these parameters.

Uio96 · 2022-08-05T19:08:42Z

Hi, I encounter another question, https://github.com/NVlabs/CenterPose/blob/main/src/lib/models/networks/pose_dla_dcn.py#L253

what is opt.pre_img / opt.pre_hm / opt.pre_hm_hp ? I can not find the comments about these parameters.

That's for CenterPoseTrack. Its pipeline plot may give you some ideas.

pre_img is for the image; pre_hm is for the filtered center heatmap; pre_hm_hp is for the filtered keypoint heatmaps.

Uio96 · 2022-08-05T19:11:22Z

The variable name confused me though you have made some comments, but i still wonder why the 'hp' stands for 'keypoints' instead of using 'kp'.

Sorry for the confusion. I had some other thoughts about the name before but used "keypoints" in the end.

My second question is the channel number when I start to train the code using main_CenterPose.py. The 'hp_offset' in opt heads is 2, but paper mentions this channel should be 16?

We have 8 keypoints, and each of them has 2 heads. The total channel is 16.

Learningm · 2022-08-12T07:50:40Z

Thanks for reply.

I got another problem, if I remove the 'wh' head for training, then I will got some error when I inference.

https://github.com/NVlabs/CenterPose/blob/main/src/lib/models/decode.py#L102
There will be no kps_heatmap_std / kps_heatmap_mean / kps_heatmap_height output compared with having 'wh' head.

As you can see, if I don't have the 'wh' head, I run into
https://github.com/NVlabs/CenterPose/blob/main/src/lib/models/decode.py#L258

However,
https://github.com/NVlabs/CenterPose/blob/main/src/lib/models/decode.py#L358 asks for the missing variable above.

Could you illustrate the decode processing with more detail please? Or some reference papers? I got confused when I try to understand the decode processing part. Thank you very much !

Learningm · 2022-08-12T08:40:42Z

More questions:

I try to remove 'scale' head and I set opt.obj_scale = False. And I got crash when I am running the demo.py, the crash happenes on the following link as it asks for scale variable,
https://github.com/NVlabs/CenterPose/blob/main/src/lib/utils/pnp/cuboid_pnp_shell.py#L12

I tried to set the scale to be 1 constantly, the visualization just become a square cuboid which is not correct.

Could I get the pose without predicting the scale? Is it possible to use pnp to get output3d points and infer the scale using the predicted 3d points?

Uio96 · 2022-09-19T14:18:02Z

Thanks for reply.

I got another problem, if I remove the 'wh' head for training, then I will got some error when I inference.

https://github.com/NVlabs/CenterPose/blob/main/src/lib/models/decode.py#L102 There will be no kps_heatmap_std / kps_heatmap_mean / kps_heatmap_height output compared with having 'wh' head.

As you can see, if I don't have the 'wh' head, I run into https://github.com/NVlabs/CenterPose/blob/main/src/lib/models/decode.py#L258

However, https://github.com/NVlabs/CenterPose/blob/main/src/lib/models/decode.py#L358 asks for the missing variable above.

Could you illustrate the decode processing with more detail please? Or some reference papers? I got confused when I try to understand the decode processing part. Thank you very much !

Sorry about the confusion. I once tried to see the impact of removing 'wh' option for CenterPose. I found that it might not be a good choice, so I did not go further. Then when I developed for CenterPoseTrack with some more parameters, e.g., kps_heatmap_std / kps_heatmap_mean / kps_heatmap_height, I assumed the 'wh' option was already enabled. In your case, you probably have to give kps_heatmap_std / kps_heatmap_mean / kps_heatmap_height some default values there (if you do not care about tracking).

As for the reference, our implementation is based on CenterNet. I do not think there is detailed instruction available now, as most of the codes are just too detailed to be put on paper.

Uio96 · 2022-09-19T14:35:27Z

More questions:

I try to remove 'scale' head and I set opt.obj_scale = False. And I got crash when I am running the demo.py, the crash happenes on the following link as it asks for scale variable, https://github.com/NVlabs/CenterPose/blob/main/src/lib/utils/pnp/cuboid_pnp_shell.py#L12

I tried to set the scale to be 1 constantly, the visualization just become a square cuboid which is not correct.

Could I get the pose without predicting the scale? Is it possible to use pnp to get output3d points and infer the scale using the predicted 3d points?

In my implementation, 'scale' head is trying to predict the ratio between width/height/length. If not given the absolute height info from somewhere else, then its prediction is width/1/length (we call it relative scale).

pnp is used to get the pose given 2d key points on the image and 3d key points (prior information). In our case, width/1/length or width/height/length can be used to get 3d key points (as we work on a 3d bounding box).

I think your so-called "predicted 3d points" is more like transforming the 3d key points (prior information) into the camera space with the pose calculated by pnp. In other words, you cannot get the pose without predicting the scale.

Learningm · 2022-09-26T03:14:48Z

Thanks for the clarfication for the questions above.

I got another question, how to train cup category?
In this implementation, you got two checkpoints for mug and cup respectively in the evaluation script.
However, it's not clear how to train these two checkpoints, With --mug flag seems to train mug category, without this flag seems to train mug and cup together, instead of only cup data.

YC0315 · 2022-10-13T07:44:21Z

Did you succeed in the test? Some errors occur when I run the test code "python eval_video_official. py".

Learningm closed this as completed Jul 21, 2022

Learningm reopened this Jul 27, 2022

Learningm changed the title ~~Question about preprocessing.py~~ Hard to run eval_video_official.py Jul 27, 2022

Learningm changed the title ~~Hard to run eval_video_official.py~~ Hard to run eval_video_official.py & some training questions Aug 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hard to run eval_video_official.py & some training questions #8

Hard to run eval_video_official.py & some training questions #8

Learningm commented Jul 20, 2022

Learningm commented Jul 21, 2022

Learningm commented Jul 27, 2022

Uio96 commented Jul 29, 2022

Uio96 commented Jul 29, 2022

Learningm commented Aug 2, 2022

Learningm commented Aug 4, 2022

Uio96 commented Aug 5, 2022

Uio96 commented Aug 5, 2022

Learningm commented Aug 12, 2022

Learningm commented Aug 12, 2022 •

edited

Loading

Uio96 commented Sep 19, 2022

Uio96 commented Sep 19, 2022

Learningm commented Sep 26, 2022

YC0315 commented Oct 13, 2022

Hard to run eval_video_official.py & some training questions #8

Hard to run eval_video_official.py & some training questions #8

Comments

Learningm commented Jul 20, 2022

Learningm commented Jul 21, 2022

Learningm commented Jul 27, 2022

Uio96 commented Jul 29, 2022

Uio96 commented Jul 29, 2022

Learningm commented Aug 2, 2022

Learningm commented Aug 4, 2022

Uio96 commented Aug 5, 2022

Uio96 commented Aug 5, 2022

Learningm commented Aug 12, 2022

Learningm commented Aug 12, 2022 • edited Loading

Uio96 commented Sep 19, 2022

Uio96 commented Sep 19, 2022

Learningm commented Sep 26, 2022

YC0315 commented Oct 13, 2022

Learningm commented Aug 12, 2022 •

edited

Loading