Updated qlora.py to fix freezing of embedding layers #217

ffohturk · 2023-07-19T00:58:02Z

Updated the way embeddings are frozen by changing the order of operations. In the original codebase, the model was loaded, LoRA-fied and then the tokenizer was resized. The resizing resets the gradients of the embedding layers and puts them to default (i.e. True). This is not what you want and to fix this, you can simple do a different order: First load the base model, then resize tokenizer, then do prepare_for_kbit_training to freeze the model's original weights, then LoRA-fy and then train.

Updated the way embeddings are frozen. First load the base model, then resize tokenizer, then do prepare_for_kbit_training to freeze the model's original weights, then LoRA-fy and then train.

artidoro

@ffohturk Thanks for helping streamline the code! This makes more sense and will make it easier for people to understand what is going on. I verified that the embeddings are indeed frozen before training.

Updated qlora.py to fix freezing of embedding layers

Updated qlora.py

61dfbe2

Updated the way embeddings are frozen. First load the base model, then resize tokenizer, then do prepare_for_kbit_training to freeze the model's original weights, then LoRA-fy and then train.

artidoro approved these changes Jul 19, 2023

View reviewed changes

artidoro merged commit 4a3e5dd into artidoro:main Jul 19, 2023
1 check passed

LagPixelLOL pushed a commit to LagPixelLOL/qlora that referenced this pull request Feb 8, 2024

Merge pull request artidoro#217 from ffohturk/main

4c76880

Updated qlora.py to fix freezing of embedding layers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated qlora.py to fix freezing of embedding layers #217

Updated qlora.py to fix freezing of embedding layers #217

ffohturk commented Jul 19, 2023

artidoro left a comment

Updated qlora.py to fix freezing of embedding layers #217

Updated qlora.py to fix freezing of embedding layers #217

Conversation

ffohturk commented Jul 19, 2023

artidoro left a comment

Choose a reason for hiding this comment