RDRec arXiv

Paper

RDRec: Rationale Distillation for LLM-based Recommendation, ACL 2024 Main (short).
Xinfeng Wang, Jin Cui, Yoshimi Suzuki, Fumiyo Fukumoto.

Note

Please use the latest code released on June 11th, 2024.
The checkpoints of the RDRec model were uploaded on Google Drive and Baidu Drive.
The experimental setup follows POD. If there is any problem, please check our code or this paper.

Instruction

Step. 1 distill rationale before running RDRec

(a) Install llama 2 （download model weights and tokenizer）

    get the License from [the site](https://llama.meta.com/llama-downloads/)
    >> cd llama 
    >> ./download.sh (License required)
    >> pip install -e .

(b) Test llama 2 environment (under ./llama )

    >> torchrun --nproc_per_node 1 example_chat_completion.py \
      --ckpt_dir llama-2-7b-chat/ \
      --tokenizer_path tokenizer.model \
      --max_seq_len 512 --max_batch_size 6

(c) Rationale distillation ({dataset}: beauty, sports, and toys.) (under ./RDRec )

    >> torchrun --nproc_per_node 1 data/{dataset}/distillation_{dataset}.py \
      --ckpt_dir llama/llama-2-7b-chat/ \
      --tokenizer_path llama/tokenizer.model \
      --max_seq_len 512 --max_batch_size 6

Step. 2 train and test RDRec

(a) Install requirement

    >> pip install -r  requirement.txt

(b) Pre-training ({dataset}: beauty, sports, and toys.) (under ./RDRec )

    >> python pretrain.py --data_dir ./data/{dataset}/ --cuda --batch_size 64 --checkpoint ./checkpoint/{dataset}/

(c) Recommendation inference

    >> python seq.py --data_dir ./data/{dataset}/ --cuda --batch_size 32 --checkpoint ./checkpoint/{dataset}/
    >> python topn.py --data_dir ./data/{dataset}/ --cuda --batch_size 32 --checkpoint ./checkpoint/{dataset}/
    >> python exp.py --data_dir ./data/{dataset}/ --cuda --batch_size 32 --checkpoint ./checkpoint/{dataset}/

Others

All experiments, including rationale distillation, can be conducted on a single Nvidia GeForce RTX 3090 (24GB memory). Reduce the batch size if you encounter an OOM error on some dataset.
There are some fluctuations in RDRec's results for sequential recommendations. We reported average results in 10-trial runs in the paper (See t_test.py for more details). If the results are not ideal, please pre-train the model once again.
If you have any questions, please feel free to contact me at [email protected].

Code Reference

Citation

If this repository helps you, please cite:

@inproceedings{wang2024rdrec,
  title={RDRec: Rationale Distillation for LLM-based Recommendation},
  author={Wang, Xinfeng and Cui, Jin and Suzuki, Yoshimi and Fukumoto, Fumiyo},
  booktitle={Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)},
  year={2024}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RDRec arXiv

Paper

Note

Instruction

Step. 1 distill rationale before running RDRec

(a) Install llama 2 （download model weights and tokenizer）

(b) Test llama 2 environment (under ./llama )

(c) Rationale distillation ({dataset}: beauty, sports, and toys.) (under ./RDRec )

Step. 2 train and test RDRec

(a) Install requirement

(b) Pre-training ({dataset}: beauty, sports, and toys.) (under ./RDRec )

(c) Recommendation inference

Others

Code Reference

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
data		data
llama		llama
model		model
utils		utils
.gitignore		.gitignore
README.md		README.md
exp.py		exp.py
pretrain.py		pretrain.py
requirements.txt		requirements.txt
seq.py		seq.py
topn.py		topn.py

WangXFng/RDRec

Folders and files

Latest commit

History

Repository files navigation

RDRec arXiv

Paper

Note

Instruction

Step. 1 distill rationale before running RDRec

(a) Install llama 2 （download model weights and tokenizer）

(b) Test llama 2 environment (under ./llama )

(c) Rationale distillation ({dataset}: beauty, sports, and toys.) (under ./RDRec )

Step. 2 train and test RDRec

(a) Install requirement

(b) Pre-training ({dataset}: beauty, sports, and toys.) (under ./RDRec )

(c) Recommendation inference

Others

Code Reference

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages