The question of computing the token prediction Acc. #11

Xiyu-AI · 2023-09-25T12:21:50Z

train.py:

compute the token prediction Acc.

non_pad_mask = cap_labels[:, 1:].ne(Constants.PAD)
n_word = non_pad_mask.sum().item()
cms_non_pad_mask = cms_labels[:, 1:].ne(Constants.PAD)
cms_n_word = cms_non_pad_mask.sum().item()
cap_loss /= n_word
cms_loss /= n_word

I'm a bit curious about the calculations. When computing the cap_loss and cms_loss, why are they both divided by n_word? And, why isn't cms_loss divided by cms_n_word? I'd appreciate your clarification. Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The question of computing the token prediction Acc. #11

The question of computing the token prediction Acc. #11

Xiyu-AI commented Sep 25, 2023

The question of computing the token prediction Acc. #11

The question of computing the token prediction Acc. #11

Comments

Xiyu-AI commented Sep 25, 2023

compute the token prediction Acc.