improve eval performance by caching per-repo/version conda environments #104

waterson · 2024-04-26T21:00:45Z

Describe the feature

Right now running an eval (e.g., using the SWE-agent evaluation/evaluation.py script) runs in such a way that a temporary conda environment is created each time you run an eval. It seems like the conda environments could be created once per repo/version, and then reused again and again across different evaluations.

Potential Solutions

One way to do this (for which I'll attach a PR) is to simply configure a reaonable path_conda in the eval args; e.g.,

args.path_conda = os.path.join(testbed, "conda", repo.rsplit('__', 1)[-1], version)

The text was updated successfully, but these errors were encountered:

This change provides a `path_conda` to use for the eval in the testbed directory that will be reused across evaluations, and modifies the context manager's behavior so that a non-existent `path_conda` will be initialized and populated in the same way that a temporary context would be. Fixes princeton-nlp#104.

thisdotmatt · 2024-06-21T17:46:18Z

I believe Auto Code Rover has an implementation of this in which they group non-redundant conda environments and cache them.

waterson mentioned this issue Apr 26, 2024

Share conda environment across evals #105

Closed

aorwall mentioned this issue Apr 28, 2024

Fixes to not have to reinstall testbeds and conda envs #109

Closed

carlosejimenez mentioned this issue Jun 27, 2024

Containerize SWE-bench evaluation #142

Merged

john-b-yang closed this as completed in #142 Jun 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve eval performance by caching per-repo/version conda environments #104

improve eval performance by caching per-repo/version conda environments #104

waterson commented Apr 26, 2024

thisdotmatt commented Jun 21, 2024

improve eval performance by caching per-repo/version conda environments #104

improve eval performance by caching per-repo/version conda environments #104

Comments

waterson commented Apr 26, 2024

Describe the feature

Potential Solutions

thisdotmatt commented Jun 21, 2024