Share conda environment across evals #105

waterson · 2024-04-26T21:04:24Z

This change provides a path_conda to use for the eval in the testbed directory that will be reused across evaluations, and modifies the context manager's behavior so that a non-existent path_conda will be initialized and populated in the same way that a temporary context would be.

I realize that it might make some sense to have a bit of discussion about optionalizing this behavior, but I wanted to just get something out there to talk about. :)

Fixes #104.

This change provides a `path_conda` to use for the eval in the testbed directory that will be reused across evaluations, and modifies the context manager's behavior so that a non-existent `path_conda` will be initialized and populated in the same way that a temporary context would be. Fixes princeton-nlp#104.

This simplifies the parsing a bit by simply using the `--json` option to provide an easy-to-parse list.

john-b-yang · 2024-06-27T20:22:57Z

Hi @waterson thanks for the suggestion + contribution and your patience!

We have just published a major release swebench==2.0.0 today which should take care of this issue. I totally agree - the motivation behind this fix for the original code makes a lot of sense.

I left a more extensive reply to @aorwall at #109. In a nutshell, with swebench.harness.run_evaluation, it is now possible to cache [base, env, instance] images.

Put simply, if SWE-bench evaluation is run 2+ times, if the instance tier of images are cached, then environments don't need to be rebuilt at all.

Also, the env layer of images represent conda environments that multiple instances use, and this intermediate layer is what makes creating instance tier images a lot more efficient.

Thanks again for the issue - it provided the confirmation we needed to move forward with incorporating this feature, I really appreciate it a lot!

waterson added 3 commits April 26, 2024 14:03

Use --json to more reliably detect environment names

1b47fd1

This simplifies the parsing a bit by simply using the `--json` option to provide an easy-to-parse list.

Fix path to avoid creating the environment twice

5e23c19

john-b-yang closed this Jun 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Share conda environment across evals #105

Share conda environment across evals #105

waterson commented Apr 26, 2024

john-b-yang commented Jun 27, 2024

Share conda environment across evals #105

Share conda environment across evals #105

Conversation

waterson commented Apr 26, 2024

john-b-yang commented Jun 27, 2024