Fit the exponential decay curve to accuracy distribution #23

StellaAthena · 2022-11-27T04:50:10Z

We hypothesize that the Scatter SDE summary plot of the accuracy distribution is an exponential decay with a bump at acc = 1 corresponding to the sum of the tail probabilities (since the memorization score can't go above 1). Specifically, let p(x) = [the number of sequences in the training data that have accuracy x]. We want to do the following:

Fit an exponential decay curve to p(x) looking only at x in [0, k] for k in [0.25, 0.5, 0.75, 0.9, 0.99]
Check how well the curves agree on [k, infinity)
Check whether the sum from i = 1 to infinity of p(i) according to the fit model equals the observed p(1) value.

StellaAthena · 2022-12-08T14:36:36Z

We were recently discussing this theory in Discord, and it occurred to @norabelrose and I that an exponential decay pattern does not actually agree with the tail probability theory, as the tails an exponential decay are too fat to produce noticable bumps. However, power laws do have fat tails and a "rich get richer" dynamic makes sense in the context of memorization as the more detail one specifies about the generated sequence the more locked-in the model should be to the correct distribution.

Last night Nora decided to run some basic analysis, and lo and behold:

CalmDownKarm · 2023-02-11T01:51:18Z

Does this issue still need help?

StellaAthena · 2023-02-13T20:21:16Z

@CalmDownKarm Thanks for reaching out! We took care of this, and are currently preparing a paper for release detailing the results.

StellaAthena changed the title ~~Fit the exponential decay curve to accuracy distribution, confirm the hypothesis that the cut-off tail sums to the spike.~~ Fit the exponential decay curve to accuracy distribution Dec 1, 2022

StellaAthena added good first issue Good for newcomers help wanted This issue needs assistance labels Dec 1, 2022

StellaAthena assigned norabelrose Dec 8, 2022

StellaAthena closed this as completed Feb 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fit the exponential decay curve to accuracy distribution #23

Fit the exponential decay curve to accuracy distribution #23

StellaAthena commented Nov 27, 2022 •

edited

Loading

StellaAthena commented Dec 8, 2022

CalmDownKarm commented Feb 11, 2023

StellaAthena commented Feb 13, 2023

Fit the exponential decay curve to accuracy distribution #23

Fit the exponential decay curve to accuracy distribution #23

Comments

StellaAthena commented Nov 27, 2022 • edited Loading

StellaAthena commented Dec 8, 2022

CalmDownKarm commented Feb 11, 2023

StellaAthena commented Feb 13, 2023

StellaAthena commented Nov 27, 2022 •

edited

Loading