Skip to content

A simple batch file to make the oobabooga one click installer compatible with llama 4bit models and able to run on cuda

Notifications You must be signed in to change notification settings

oobabooga/oobabooga-one-click-bandaid

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Update: These fixes have been included in this install.bat file: https://github.com/jllllll/one-click-installers You should be able to just clone that repo and run install.bat there, without the need for running the code here.

oobabooga-one-click-bandaid (windows native solution)

A simple batch file to make the oobabooga one click installer compatible with llama 4bit models and able to run on cuda

  1. download and extract the windows zip file from here: https://github.com/oobabooga/text-generation-webui/releases
  2. place bandaid.bat in the same folder as install.bat
  3. double click install.bat and let it run all the way through
  4. double click bandaid.bat and let it run all the way through (it will run install.bat again to fix some stuff that this hacky jank messes up. Don't worry, everything that's running is running from within the oobabooga folder. Worst case scenario, you delete everything and start from scratch)
  5. place your models in the text-generation-webui\models folder. The folder structure should look like this
models\
 |
 |- model-name-4bit.pt
 |- model-name\
     |- config.json
     |- generation_config.json
     |- pytorch_model.bin.index.json
     |- special_tokens_map.json
     |- tokenizer.model
     |- tokenizer_config.json
  1. make sure tokenizer_config.json says "tokenizer_class": "LlamaTokenizer" and not "tokenizer_class": "LLaMATokenizer"
  2. double click on start-webui.bat

And that's it! You should have a working install now. Just double click on start-webui.bat.

Note: Apparently some people are having trouble with Windows 11. You may have to manually edit your start-webui.bat file and change the line call python server.py --auto-devices --cai-chat to call python server.py --auto-devices --cai-chat --gptq-bits 4 --gptq-model-type LLaMa, and then double click install.bat and let it run all the way through one more time. If you are still getting cuda errors, you are on your own. This is what worked for me. Good luck!

Credit: I just slapped this .bat file together. Most of the hard work was done by the users in this thread: qwopqwop200/GPTQ-for-LLaMa#11 (comment)

About

A simple batch file to make the oobabooga one click installer compatible with llama 4bit models and able to run on cuda

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Batchfile 100.0%