-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Alignment key for the A/V features in the .npy/.hdf5 files #20
Comments
Hi 👋 ! Indeed! I am afraid, I don't have the exact snippet which does
I am not sure what you mean here.
Yes, it is. |
Hi Vladimir, Do the audio features for each video cover the entire video? Did you filter out the audio segments that are not inside event proposals? Thank you! |
Yes, similar to visual features and speech, the audio is available for the entire video. And yes we trim the modalities to be in a segment as shown here Line 67 in df3b88a
|
Hi Vladimir,
Long time no talk :) I was wondering if you can share the code that converted the .npy features (from your VGGish and I3D feature extractor) that you made available to me mid last year, to .hdf5 in MDVC Readme: Usage. In particular, I am interested in understanding how you "align" the audio and video features (based on the code below).
Questions:
Thanks again for your help!
The text was updated successfully, but these errors were encountered: