Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Evaluation for MegatronT5 Model #1907

Closed
wangyanbao666 opened this issue May 31, 2024 · 4 comments
Closed

Evaluation for MegatronT5 Model #1907

wangyanbao666 opened this issue May 31, 2024 · 4 comments

Comments

@wangyanbao666
Copy link

Hi I have noticed that there are existing support for some nemo models. But it does not seem that there is a support for MegatronT5 Model. Anyone has ideas how to evaluate this model?

@haileyschoelkopf
Copy link
Contributor

Hi! We don't currently support this, though would welcome contributions adding it.

Perhaps @sergiopperez has input on how simple it would be to extend the current NeMo integration into Megatron-T5?

@wangyanbao666
Copy link
Author

Hi any updates on how to evaluate T5? Could you please give some clue on how to extend to Megatron-T5?

@StellaAthena
Copy link
Member

Hi any updates on how to evaluate T5? Could you please give some clue on how to extend to Megatron-T5?

If you are interested in evaluating a pretrained model such as NeMo Megatron T5 that has been released on Hugging Face, the Hugging Face version is compatible with this library. However we still lack support for models native to the Megatron library.

@haileyschoelkopf
Copy link
Contributor

Closing as NeMo Megatron-T5 support won't be prioritized, though we'd welcome a contribution for it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants