Missing sparsely video feature extraction module #2

kimia-cvengineer · 2023-10-23T17:43:14Z

To get inference on a custom video dataset we need to sparsely extract video features, the same way as you do to get a good result. That would be great if you can make the module accessible on the repo.

doc-doc · 2023-10-24T08:18:15Z

Hi, thanks for the interest. I have uploaded the related code (for reference only). To extract region feature, you need to sample frames in the same way and use the tool provided by BUTD.

kimia-cvengineer · 2023-10-24T23:47:42Z

Thank you very much for providing them. It would also be good if you could add some documentation to the files and functions so that we can better understand the starting point and steps to follow in order to extract feature properly.

doc-doc · 2023-10-25T04:32:09Z

Bascially, you can follow a coarse pipeline: extract_video.py (decode mp4 into frames)->preprocess_feature.py (sample and encode frames into CNN representations)->split_dataset_feat.py(split the feature into train/val/test).

kimia-cvengineer · 2023-10-25T18:37:30Z

That's so helpful. Thanks for explaining it.

kimia-cvengineer · 2023-10-26T21:52:09Z

Which mode of 'cafe 'or 'd2' did you use to extract regional features?

doc-doc · 2023-11-17T05:20:07Z

Please choose resnet-101 with d2.

Khadgar123 · 2024-03-28T06:18:36Z

@doc-doc It seems that object_align.py does not give a complete method to obtain the bounding box, but directly reads region_8c10b_{}.h5. Is there any complete code that can detect the bounding box and then write it to region_8c10b_{}.h5?

This was referenced Dec 15, 2023

How can I get the data feature such as model_base_vqa_capfilt_large h5 files yl3800/TranSTR#1

Closed

Pre-processing code for video features yl3800/TranSTR#2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing sparsely video feature extraction module #2

Missing sparsely video feature extraction module #2

kimia-cvengineer commented Oct 23, 2023

doc-doc commented Oct 24, 2023

kimia-cvengineer commented Oct 24, 2023

doc-doc commented Oct 25, 2023

kimia-cvengineer commented Oct 25, 2023

kimia-cvengineer commented Oct 26, 2023

doc-doc commented Nov 17, 2023

Khadgar123 commented Mar 28, 2024

Missing sparsely video feature extraction module #2

Missing sparsely video feature extraction module #2

Comments

kimia-cvengineer commented Oct 23, 2023

doc-doc commented Oct 24, 2023

kimia-cvengineer commented Oct 24, 2023

doc-doc commented Oct 25, 2023

kimia-cvengineer commented Oct 25, 2023

kimia-cvengineer commented Oct 26, 2023

doc-doc commented Nov 17, 2023

Khadgar123 commented Mar 28, 2024