Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bad Results with Albert #2609

Closed
chikubee opened this issue Jan 22, 2020 · 4 comments
Closed

Bad Results with Albert #2609

chikubee opened this issue Jan 22, 2020 · 4 comments
Labels

Comments

@chikubee
Copy link

chikubee commented Jan 22, 2020

❓ Questions & Help

Trying to understand why is the cosine similarity between tokens with Albert way bad in comparison to DistilBert.
Any inferences on the same would be helpful.
Thanks in advance.

Embeddings constructed for a token by summing the last 4 encoded layers.
Distance metric: cosine

Results with DistilBert
Screenshot 2020-01-22 at 4 05 06 PM
Screenshot 2020-01-22 at 4 12 27 PM

Results with Albert
Screenshot 2020-01-22 at 4 04 56 PM
Screenshot 2020-01-22 at 4 10 16 PM

@BramVanroy
Copy link
Collaborator

Hi. Many people are reporting unstable results or just unexpected results. You can search for issues in this library, and even in other ones (e.g. deepset-ai/FARM#202 (comment)). It seems that ALBERT is very sensitive to hyperparameters and even then... For now there seems to be no solution. It is probably best to stick to another model. I'd recommend RoBERTa but it depends on your use-case.

@chikubee
Copy link
Author

@BramVanroy I tried with roberta-base as well, the token level similarity is coming out very bad.
Smoking is getting matched with software.

@BramVanroy
Copy link
Collaborator

Can you share a repo to your full code?

@stale
Copy link

stale bot commented Mar 24, 2020

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the wontfix label Mar 24, 2020
@stale stale bot closed this as completed Mar 31, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants