Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions on verification_RFW and training procedure #61

Open
lizhenstat opened this issue Aug 10, 2021 · 7 comments
Open

Questions on verification_RFW and training procedure #61

lizhenstat opened this issue Aug 10, 2021 · 7 comments

Comments

@lizhenstat
Copy link

lizhenstat commented Aug 10, 2021

Hi,
Thanks for your great work and sharing of the code on these two papers !
It takes me days to read the paper and go through the repository and I have a few questions:

(2)
Do you have the code for training the features (asian_females, asian_males, black_females, black_males, indian_females, indian_males,...). Since I have a hard time finding something like train.py (e.g. the loss function and training process).
(I suppose the released code is mainly on image pre-processing and result analysis)
(Since BFW dataset is not as large as other face dataset and it may possible for me to train it from scratch on one GPU)

(3)
I am little confused about how the BFW is used in two papers, as I understand:

in paper Face Recognition: Too Bias, or Not Too Bias? , the train and test model are as follows:
train: CASIA_webface trained using Sphereface loss
test: LFW
where does BFW dataset not used in training in this set of experiments?

in paper Balancing Biases and Preserving Privacy on Balanced Faces in the Wild the train, test model are as follows:
tain:
(1) MS1M trained using Arcface loss --> to get 512-dim embedding (f_in in Fig.6)
(2) BFW dataset is used to train the encoder and two classifiers in Fig 6
test: 4-folds used for training and 1-fold used for testing (using the best threshold chosen)

is that right?

(4)
There are some difference from "bfw-v0.1.5-datatable.csv" and the TABLE-2 in paper 2:
for example: there are 921379 records in TABLE-2 while ther are 923898 records from the csv file?
and there is no "{dir_meta}thresholds.pkl" file.

Thanks for your time and any help would be appreciated !

@visionjo
Copy link
Owner

visionjo commented Aug 11, 2021 via email

@lizhenstat
Copy link
Author

@visionjo hi, thanks for your reply!
Besides, I also found the pictures in the following folder:
"jrobby\bfw\bfw-cropped-aligned\asian_females\n009042"
the following three pictures(0015_01.jpg, 0034_01.jpg, 0394_01.jpg) belong to Guo Jing Jing while folder "n009042" is of Wu Min Xia, both of them are famous Diving Athlete in China,
0034_010394_010394_01

I think there are many picutures of them together from the internet then there maybe some confuse.
guojingjing

Here is the snapshot of the folder "n009042":
snapshot

@lizhenstat
Copy link
Author

@visionjo Hi, any update here? thanks!

@visionjo
Copy link
Owner

visionjo commented Sep 7, 2021

Thanks for pointing this out. We are currently integrating the data as part of the fiftyone datasets, and as part of this an updated version of data will be available. This bug will be addressed and released with this ASAP (within weeks). Thanks again and do post additional items that come up 👌

@visionjo
Copy link
Owner

visionjo commented Sep 7, 2021

I'll provide the features and answer your original question sooner (a night this week). My apologies, I lost track of this. Thanks for reaching back out.

@lizhenstat
Copy link
Author

@visionjo Hi, thanks, can you notice me if you update the dataset. Thanks a lot !

@jrobinson-vs
Copy link
Collaborator

@lizhenstat, I have been busy with our other dataset (Families In the Wild (FIW)), as the challenge just ended and we are preparing the white paper. With my time limited to spare time around full-time work, progress has been a bit slow here. Nonetheless, as mentioned prior, we are returning to BFW shortly (within the month), and have a few plans (including releasing the next version of data). Feel free to PR any changes (whether to data or code)-- it would be highly appreciated :)

In any case, I will check back shortly and get back to you on this. Thanks for reaching out!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants