Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
can_alpaca_vicuna_answer.py		can_alpaca_vicuna_answer.py
does_alpaca_vicuna_lie.py		does_alpaca_vicuna_lie.py
evaluate_alpaca_vicuna.sh		evaluate_alpaca_vicuna.sh
evaluate_alpaca_vicuna_sweep.sh		evaluate_alpaca_vicuna_sweep.sh
generate_alpaca_vicuna_logprobs.py		generate_alpaca_vicuna_logprobs.py
lying_and_detection_results.ipynb		lying_and_detection_results.ipynb
minimal_alpaca_vicuna_test.py		minimal_alpaca_vicuna_test.py

README.md

Test Vicuna and Alpaca models

Alpaca and Vicuna are instruction-finetuned versions of Llama. As such, we tried the same lying prompts which were used for GPT-3.5 (text-davinci-003). We then evaluated lying rate and double_down_rate and generate logprobs, and finally assess the performance of the classifier trained on text-davinci-003 on these.

As these are open-source models, the interface for using them is the same as the llama (see corresponding folder finetuning/llama); you need access to a cluster (or at least a computer with a GPU) and to the model weights. They also rely on the deepspeed_llama codebase.

can_alpaca_vicuna_answer.py tests whether the original alpaca/vicuna model can answer to questions in the dataset
does_alpaca_vicuna_lie.py tests whether the alpaca/vicuna model actually lie to the questions with the different prompts
generate_alpaca_vicuna_logprobs.py generates the logprobs for the truthful and lying prompts.
lying_and_detection_results.ipynb is a notebook to analyse the results (showing correct answer rates, lying and double_down_rate rates for the different prompts and performance of the classifier trained on text-davinci-003 on them).
The two *sh files are example of slurm to run the above experiments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiments_alpaca_vicuna

experiments_alpaca_vicuna

README.md

Test Vicuna and Alpaca models

Files

experiments_alpaca_vicuna

Directory actions

More options

Directory actions

More options

Latest commit

History

experiments_alpaca_vicuna

Folders and files

parent directory

README.md

Test Vicuna and Alpaca models