Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[MII] catch error wrt HF version and Mistral #4634

Merged
merged 4 commits into from
Nov 13, 2023
Merged

[MII] catch error wrt HF version and Mistral #4634

merged 4 commits into from
Nov 13, 2023

Conversation

jeffra
Copy link
Contributor

@jeffra jeffra commented Nov 6, 2023

If a user is trying to use Mistral with MII and running transformers <= 4.34.0 they will get a cryptic KeyError from transformers. This is documented on the Mistral model page https://huggingface.co/mistralai/Mistral-7B-v0.1 but is an easy error to run into for users.

This PR adds a version check to error out more gracefully. This should address at least part of the issue this user reported: microsoft/DeepSpeed-MII#258

Copy link
Contributor

@cmikeh2 cmikeh2 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Only comment from me is if it might make more sense to do this in the engine factory rather than the policy itself.

@jeffra jeffra enabled auto-merge November 6, 2023 17:45
@jeffra jeffra disabled auto-merge November 6, 2023 19:57
@jeffra jeffra merged commit 0a6095f into master Nov 13, 2023
9 checks passed
@jeffra jeffra deleted the mistral-check branch November 13, 2023 19:29
mauryaavinash95 pushed a commit to mauryaavinash95/DeepSpeed that referenced this pull request Feb 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants