-
Notifications
You must be signed in to change notification settings - Fork 982
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Code Cleanup #208
Labels
documentation
Improvements or additions to documentation
Comments
I think with the latest pull from #269 we can call this done. Lots of terrible megatron code has been removed, and we now have to maintain only a single model. Most unused codepaths are removed. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Basically want the code to be easier for new devs to get their head around. Some suggested steps we could take toward this:
Pooler
class in megatron/model/language_model.py`GPT2Model
which thenget_language_model
which then callsTransformerLanguageModel
which then callsParallelTransformer
... Dumb as hell.The text was updated successfully, but these errors were encountered: