ValueError: Cannot find the config file for gptq #22

vpurandara · 2024-06-09T14:41:40Z

ValueError: Cannot find the config file for gptq

vpurandara · 2024-06-15T15:46:38Z

PATH:
  INPUT: "./raw_txt_input"
  OUTPUT: "./output"
  DEFAULT_PROMPTS: "./prompts" # the baseline prompt folder that Augmentoolkit falls back to if it can't find a step in the PROMPTS path
  PROMPTS: "./prompts" # Where Augmentoolkit first looks for prompts
API:
  API_KEY: "" # Add the API key for your favorite provider here
  BASE_URL: "https://api.together.xyz" # add the base url for a provider, or local server, here. Some possible values:  http:https://127.0.0.1:5000/v1/ # <- local models. # https://api.together.xyz # <- together.ai, which is real cheap, real flexible, and real high-quality, if a tad unreliable. # https://api.openai.com/v1/ # <- OpenAI. Will bankrupt you very fast. # anything else that accepts OAI-style requests, so basically any API out there (openrouter, fireworks, etc etc etc...)
  LOGICAL_MODEL: "NousResearch/Hermes-2-Pro-Mistral-7B" # model used for everything except conversation generation at the very end
  LARGE_LOGICAL_MODEL: "NousResearch/Hermes-2-Pro-Mistral-7B" # model used for conversation generation at the very end. A pretty tough task, if ASSISTANT_MODE isn't on.
  QUANTIZATION_SMALL: "gptq" # Only use if Aphrodite mode is on.
  QUANTIZATION_LARGE: "gptq" # Only use if Aphrodite mode is on.
SYSTEM:
  USE_FILENAMES: False # give the AI context from the filenames provided to it. Useful if the filenames are meaningful, otherwise turn them off.
  ASSISTANT_MODE: True # If True, the conversations generated are between a user and an AI assistant. If False, the generated convs are between fictional characters in historical or fictional settings, with randomized personalities (some are nsfw by default, because a lot of model creators make models for that purpose. Change this (or amplify it) in ./augmentoolkit/generation_functions/special_instructions.py, it only requires changes to some strings.)
  DOUBLE_CHECK_COUNTER: 3 # How many times to check a question and answer pair during each validation step. Majority vote decides if it passes that step. There are three steps. So most questions are by default checked around 9 times (fewer if the first two checks for a step pass, obviously).
  USE_SUBSET: True # Whether to take only the first 13 chunks from a text during the run. Useful for experimenting and iterating and seeing all the steps without costing too much money or time.
  REARRANGEMENTS_TO_TAKE: 3 # How many times to rearrange the questions and answers for generating different conversations from the same group of questions and answers.
  CONCURRENCY_LIMIT: 50 # Hard limit of how many calls can be run at the same time, useful for API mode (aphrodite automatically manages this and queues things, as far as I know)
  COMPLETION_MODE: True # Change to false if you want to use chat (instruct) mode; this requires .json files in your chosen prompts directory, in the OpenAI API format. Not all APIs support completion mode.
  MODE: "aphrodite" # can be one of "api"|"aphrodite"
  GRAPH: False # Whether to show a pretty graph after filtering out stuff not worthy for questions, useful for seeing whether or not your text is suitable for making data from using Augmentoolkit by default. Will pause the pipeline's execution until you close the window, which is why this is false by default.
  STOP: True # True = Use stop tokens, False = do not use stop tokens. OpenAI's API restricts you to four stop tokens and all steps have way more than four stop tokens, so you'll need to turn this to False if you're using OAI's API. Also NOTE that if you turn this OFF while using COMPLETION MODE, EVERYTHING WILL BREAK and it will cost you money in the process. Don't do that.

this is the config file

e-p-armstrong · 2024-06-27T00:48:25Z

Hmm that's a strange one... do you have a traceback I can peek at? Like, the full error?

And is the config file still named config.yaml?

Also this config looks a bit broken, you're making requests to together.ai but there's no API key, meaning the requests will be incorrectly formatted. IDK if that's the cause of your problem but you should probably fix that.

In the most recent version of Augmentoolkit, aphrodite is supported only by running the aphrodite enigne in server mode (this is to make things easier to use, settings-wise). So, the problem you describe may have been patched out, since Augmentoolkit no longer has a quantization mode setting.

LoopControl · 2024-07-01T15:20:30Z

The error is because you're trying to use GPTQ with a non-GPTQ model. The model you linked is a full precision transformers model.

You want something more like this: https://huggingface.co/qeternity/Nous-Hermes-2-Mistral-7B-DPO-GPTQ-4bit-128g-actorder_False-Marlin/tree/main (except this is not hermes pro 2).

Look for a model that has GPTQ in the name on Huggingface (or convert your Hermes-Pro model to GPTQ).

e-p-armstrong · 2024-07-03T21:42:41Z

LoopControl is right. Closing this since it's not an issue with the project itself, thanks for your report!

vpurandara changed the title ~~ValueError: Cannot find the config file for awq~~ ValueError: Cannot find the config file for gptq Jun 18, 2024

e-p-armstrong closed this as completed Jul 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ValueError: Cannot find the config file for gptq #22

ValueError: Cannot find the config file for gptq #22

vpurandara commented Jun 9, 2024 •

edited

Loading

vpurandara commented Jun 15, 2024

e-p-armstrong commented Jun 27, 2024

LoopControl commented Jul 1, 2024 •

edited

Loading

e-p-armstrong commented Jul 3, 2024

ValueError: Cannot find the config file for gptq #22

ValueError: Cannot find the config file for gptq #22

Comments

vpurandara commented Jun 9, 2024 • edited Loading

vpurandara commented Jun 15, 2024

e-p-armstrong commented Jun 27, 2024

LoopControl commented Jul 1, 2024 • edited Loading

e-p-armstrong commented Jul 3, 2024

vpurandara commented Jun 9, 2024 •

edited

Loading

LoopControl commented Jul 1, 2024 •

edited

Loading