Restrict model options in transformers examples #2189

KickItLikeShika · 2021-09-09T12:15:29Z

In transformers example https://github.com/pytorch/ignite/tree/master/examples/contrib/transformers it's up to user to override the default model which is bert-base-uncased, a lot of models take similar inputs to BERT and similar outputs too, but models like distilbert-base-uncased, distilroberta-base, bart-base (and many other models) will not work here as they work on a bit different way regarding to the inputs and outputs of the model, check here for more info: https://huggingface.co/transformers/model_doc/distilbert.html#distilbertmodel

So we will get an error similar to this while using DistilBERT

KeyError: 'token_type_ids'

And also if we used something like XLNet that won't work well and we will have dimensions issue, because XLNet doesn't return a pooler_output, check here https://huggingface.co/transformers/model_doc/xlnet.html#transformers.XLNetModel

So what we should do is:

We don't provide the option of choosing models, and we just keep BERT, and if the user wants to change the model, he can do that manually
Keep it that way and mention in the docs that, the models that can be used here are BERT, RoBERTa (and if there is others we mentions them).

EDIT: After deciding to go with the first option, what we need to do now is to remove model from the argument, and set config['model'] = 'bert-base-uncased' manually inside run method, and mention in the docs we are using BERT by default, and maybe leave a note about if the user wants to try another model, he should that himself.

The text was updated successfully, but these errors were encountered:

vfdev-5 · 2021-09-09T12:20:52Z

Thanks for reporting that @KickItLikeShika , I think the first option is good enough.

Could you please provide few links on the code and explain in details on how to do that. Such that someone new could try to tackle the issue. Thanks

KickItLikeShika · 2021-09-09T12:28:51Z

@vfdev-5 Updated the description.

Ishan-Kumar2 · 2021-09-09T13:49:49Z

@KickItLikeShika Another way I think we can fix this is by passing the dictionary that tokenizer returns instead of making our own dictionary in dataset.py. So models which don't require token_type_ids their tokenizer will not produce them also. Have a look at this example of Classification from the HF repo.

KickItLikeShika · 2021-09-09T14:06:01Z

@Ishan-Kumar2 seems like a good idea for resolving the input issues, but we still have the same issue related to the Transformer output, most of other transformers don't have a pooled_output, I don't see how your suggestion can solve this, what do you think?

Ishan-Kumar2 · 2021-09-09T14:18:25Z

We can replace the model with AutoModelForSequenceClassification and then have the loss computed like this

outputs = model(**inputs, labels=labels)
loss = outputs.loss
logits = output.logits

This should work for both, looking at these examples for XLNet and Bert right?

KickItLikeShika · 2021-09-09T14:32:31Z

Initially it looks good, what do you think @vfdev-5? if that's OK, can you try this out please? @Ishan-Kumar2

vfdev-5 · 2021-09-09T14:34:11Z

OK for me if you think it could be understandable by other users.

vfdev-5 added enhancement good first issue help wanted labels Sep 9, 2021

Ishan-Kumar2 mentioned this issue Sep 10, 2021

Transformer Example Update - AutoModelforSequenceClassification #2190

Merged

3 tasks

sdesrozis closed this as completed in #2190 Sep 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restrict model options in transformers examples #2189

Restrict model options in transformers examples #2189

KickItLikeShika commented Sep 9, 2021 •

edited

Loading

vfdev-5 commented Sep 9, 2021 •

edited

Loading

KickItLikeShika commented Sep 9, 2021

Ishan-Kumar2 commented Sep 9, 2021

KickItLikeShika commented Sep 9, 2021

Ishan-Kumar2 commented Sep 9, 2021 •

edited

Loading

KickItLikeShika commented Sep 9, 2021

vfdev-5 commented Sep 9, 2021

Restrict model options in transformers examples #2189

Restrict model options in transformers examples #2189

Comments

KickItLikeShika commented Sep 9, 2021 • edited Loading

vfdev-5 commented Sep 9, 2021 • edited Loading

KickItLikeShika commented Sep 9, 2021

Ishan-Kumar2 commented Sep 9, 2021

KickItLikeShika commented Sep 9, 2021

Ishan-Kumar2 commented Sep 9, 2021 • edited Loading

KickItLikeShika commented Sep 9, 2021

vfdev-5 commented Sep 9, 2021

KickItLikeShika commented Sep 9, 2021 •

edited

Loading

vfdev-5 commented Sep 9, 2021 •

edited

Loading

Ishan-Kumar2 commented Sep 9, 2021 •

edited

Loading