I believe there is an implementation error in SAN #4

waystogetthere · 2022-12-28T12:46:42Z

Hello, I believe that this line is an error:

san/san/__init__.py

Line 73 in bb9aaea

attended_matrix = self.multi_head[k](input_space) * input_space

I think it should be:

attended_matrix = self.softmax(self.multi_head[k](input_space)) * input_space

as shown in the paper:

The text was updated successfully, but these errors were encountered:

SkBlaz · 2022-12-29T13:22:01Z

Hello! Thanks for the issue/report, nice catch. I'd be grateful if you opened a PR with the change - the left-out softmax was a remnant of refactoring that made the source a bit more user friendly (and it appears it's starting to pay off!)

waystogetthere · 2022-12-29T23:08:40Z

Hello,
Thanks for your fast reply! Merry Xmas & Happy New year!
Yeah, I would make a pull request soon.

I think the 'self.multi_head[k]' is the k-th head attention which is Linear Layer with weight and bias: $W_{l_{att}}^k$ and $b^k_{l_{att}}$. So the softmax is needed

SkBlaz · 2022-12-30T17:41:20Z

Great, thanks. Indeed, to be aligned with the paper, the activation is required. Surprisingly, current version (multilinear blocks basically) also seems to work, might be worth further exploration at some point.

SkBlaz · 2023-01-01T09:27:37Z

The suggested change was merged to master.

waystogetthere mentioned this issue Jan 1, 2023

adding a softmax, revise the softmax part in SAN implementation #5

Merged

SkBlaz closed this as completed Jan 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I believe there is an implementation error in SAN #4

I believe there is an implementation error in SAN #4

waystogetthere commented Dec 28, 2022

SkBlaz commented Dec 29, 2022

waystogetthere commented Dec 29, 2022 •

edited

Loading

SkBlaz commented Dec 30, 2022

SkBlaz commented Jan 1, 2023

I believe there is an implementation error in SAN #4

I believe there is an implementation error in SAN #4

Comments

waystogetthere commented Dec 28, 2022

SkBlaz commented Dec 29, 2022

waystogetthere commented Dec 29, 2022 • edited Loading

SkBlaz commented Dec 30, 2022

SkBlaz commented Jan 1, 2023

waystogetthere commented Dec 29, 2022 •

edited

Loading