Add duplication eliminating module #158

leogao2 · 2021-02-27T18:31:35Z

The following usecases should be in mind for the refactor:

It should actually not be a pile of spaghetti
It should be possible to implement modules that consider that one LM query may be usable to answer multiple requests - for example, if the same query is repeated, or if there are two loglikelihood queries that each have the same context and different single-token-long-responses (there are a load of these - anything that continues with " yes" and " no" fits the bill). While it would be a violation of the abstraction to actually look at how many tokens the continuation is, it still makes sense to aggregate stuff with the same context and let LM know somehow that these are potentially-optimizable - and even it it's not single-token continuation, it might still be possible to cache the context or something. (this will require moderate changes to LM interface) I'm thinking tentatively maybe a flag it can pass along per-request to say "hey, this is something that should be cached for x times (evaluator counts how many times it expects it to be reused)" so only the things that get reused actually get cached, and get evicted when no longer necessary. This does have the chance that a new LM impl might not handle exactly the same and get out of sync on the count, but that just introduces inefficiency rather than breaking anything. Any better idea proposasls are welcome.

leogao2 · 2021-03-06T06:37:03Z

On second thought, the second point would probably depend on LM internals. Probably best to implement it as a utility that LM impls can build off.

leogao2 changed the title ~~Refactor evaluator to be modular~~ Add duplication eliminating module Mar 26, 2021

leogao2 self-assigned this Mar 26, 2021

haileyschoelkopf closed this as completed Mar 9, 2023

Provide feedback