Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does memorization in small models predict memorization in large models? #19

Closed
uSaiPrashanth opened this issue Nov 26, 2022 · 3 comments
Assignees
Labels
good first issue Good for newcomers help wanted This issue needs assistance

Comments

@uSaiPrashanth
Copy link
Member

No description provided.

@uSaiPrashanth uSaiPrashanth self-assigned this Nov 26, 2022
@StellaAthena StellaAthena self-assigned this Dec 1, 2022
@StellaAthena StellaAthena added good first issue Good for newcomers help wanted This issue needs assistance labels Dec 1, 2022
@StellaAthena
Copy link
Member

We currently have the following correlation heat-map which indicates that the answer is "no." We should probably also make confusion matrices for the classifier that takes a small model and predicts memorization by the 13B model by assuming it is the same as the small model.

Image

@StellaAthena StellaAthena removed their assignment Dec 1, 2022
@lintangsutawika
Copy link
Contributor

There seems to be a trend where (but weak) where the larger the model, the more it is able to predict if a sequence from 13B is memorized or not.
image
image
image
image
image
image
image
image

@StellaAthena
Copy link
Member

The same comments about follow-ups I made on #29 apply here. The conclusions are different but the methodologies are the same.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers help wanted This issue needs assistance
Projects
Status: Done
Development

No branches or pull requests

3 participants