-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Weird results when translating english to finnish (using EasyNMT with opus-mt) #55
Comments
Yes, that looks a bit weird. The model at huggingface does not seem to handle that kind of input well. At least a newer OPUS-MT model does not do that anymore. You can try it here: https://translate.ling.helsinki.fi/ui/memad |
Thanks for the reply. I was able to solve the problem by using the new Tatoeba model. Just in case someone has the same problem, just follow these instructions to convert Tatoeba models into Hugginface format: Then you can use the model with this code (copied from here):
PS. Conversion worked only for "eng-fin" model, while "fin-eng" failed because of some dimension mismatch error: "raise ValueError(f"Hidden size {hidden_size} and configured size {cfg['dim_emb']} mismatched or not 512") KeyError: 'dim_emb'" |
While translating English to Finnish using your model via EasyNMT, I noticed something weird. Check this code and the results.
The output is:
So "=== Inclusions" is translated into "Suomennos: Michael T. Francis Pinmontagne SUBHEAVEN.ORG".
What is going on here? Is this a problem with Opus-MT model or its EasyMT implementation?
PS. The sample text is from ESCO ontology
The text was updated successfully, but these errors were encountered: