GitHub - Alireza-shm/time-travel-in-llms: The official repository for the paper titled "Time Travel in LLMs: Tracing Data Contamination in Large Language Models."

Time Travel in LLMs: Tracing Data Contamination in Large Language Models

This repository hosts the components necessary for implementing all the methods proposed in the paper, Time Travel in LLMs: Tracing Data Contamination in Large Language Models, authored by Shahriar Golchin* and Mihai Surdeanu.

Overview

Our research is the first to systematically uncover and detect the issue of data contamination within the fully black-box large language models (LLMs). The primary idea revolves around the notion that if an LLM has seen a dataset instance during its pre-training phase, the LLM is able to replicate it. This is supported by two observations: (1) LLMs have enough capacity to memorize data; and (2) LLMs are trained to follow instructions effectively. However, due to the safety filters implemented in LLMs to prevent them from generating copyrighted content, explicitly asking LLMs to reproduce these instances is ineffective, as it triggers safety mechanisms. Our method circumvents these filters by replicating dataset instances given their random-length initial segments. Below is an example of our strategy in action, whereby the subsequent segment of an instance from the train split of the IMDB dataset is exactly replicated by GPT-4.

Getting Started

Installation

Start the process by cloning the repository, executing the command below:

git clone https://github.com/shahriargolchin/time-travel-in-llms.git

Afterward, proceed to the project's directory with this command:

cd time-travel-in-llms

Next, establish a conda environment to streamline the installation of required packages:

conda create --name time-travel-env python=3.11.8

Now, activate your environment:

conda activate time-travel-env

Lastly, use pip to install all the requisite packages:

pip install -r requirements.txt

Important

Note that the aforesaid command installs packages necessary for running evaluations via ROUGE-L and GPT-4 in-context learning (ICL). For evaluation using BLEURT, additional installation is needed since it is utilized as a dependency for this project. To do this, execute the following commands or refer to the BLEURT repository, but ensure it is located within dependencies/bleurt_scorer. You may skip these steps if you do not need to perform evaluations using BLEURT.

git clone https://github.com/google-research/bleurt.git dependencies/bleurt_scorer
cd dependencies/bleurt_scorer
pip install .

Then, download the model checkpoint for BLEURT to use, by running the following command: (Note that we used the BLEURT-20 checkpoint for our study, and the provided command downloads this particular checkpoint. You can use any other checkpoint from the list available here.)

wget https://storage.googleapis.com/bleurt-oss-21/BLEURT-20.zip
unzip BLEURT-20.zip

Alternatively, if you do not have wget installed, you can use the following command as an alternative:

curl -O https://storage.googleapis.com/bleurt-oss-21/BLEURT-20.zip
unzip BLEURT-20.zip

Experiments

For every scenario discussed in the paper, we provide a corresponding bash file in the scripts directory. Upon running these bash scripts, data contamination is detected for the examined subset of data. In the results directory, individual text files are generated for each evaluation method applied (such as ROUGE-L, BLEURT, and GPT-4 ICL) to display pass/fail results for contamination detection. The input CSV file, along with all intermediate results, is also stored in the same directory.

Usage

Before initializing the experiments, you need to export your OpenAI key to ensure the OpenAI models in this project (GPT-4 and GPT-3.5) can be accessed, with the following command:

export OPENAI_API_KEY=your-api-key

To run an experiment, first navigate to the scripts/dataset-name directory where bash scripts for each partition of a dataset (e.g., train, test/validation) are located. You can do this with the below command (assuming you are in the root directory):

cd scripts/dataset-name

Once inside the relevant directory, set the bash file to executable by running this command:

chmod +x bash-file-name.sh

Finally, run the experiment by executing:

./bash-file-name.sh

Citation

If you find our work useful, please use only the following standard format when citing our paper:

@article{DBLP:journals/corr/abs-2308-08493,
  author       = {Shahriar Golchin and
                  Mihai Surdeanu},
  title        = {Time Travel in LLMs: Tracing Data Contamination in Large Language
                  Models},
  journal      = {CoRR},
  volume       = {abs/2308.08493},
  year         = {2023},
  url          = {https://doi.org/10.48550/arXiv.2308.08493},
  doi          = {10.48550/ARXIV.2308.08493},
  eprinttype    = {arXiv},
  eprint       = {2308.08493},
  timestamp    = {Thu, 24 Aug 2023 12:30:27 +0200},
  biburl       = {https://dblp.org/rec/journals/corr/abs-2308-08493.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
results		results
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.gif		demo.gif
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Time Travel in LLMs: Tracing Data Contamination in Large Language Models

Overview

Getting Started

Installation

Experiments

Usage

Citation

Further Reading on Data Contamination

About

Releases

Packages

Languages

License

Alireza-shm/time-travel-in-llms

Folders and files

Latest commit

History

Repository files navigation

Time Travel in LLMs: Tracing Data Contamination in Large Language Models

Overview

Getting Started

Installation

Experiments

Usage

Citation

Further Reading on Data Contamination

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages