Add unit test for `models.py` #247

mikanfactory · 2024-04-17T15:09:46Z

Reference Issues/PRs

Fixes #185

What does this implement/fix? Explain your changes.

Add unit tests for available models and refactor
Refactored to a test format that can be reused even when adding other models or new models
Fix bugs for anthropic's claude-2.1

Any other comments?

There is a conflict with #236 , so I will resolve after #236 gets merged.

🧡 Thanks for contributing!

… project

codecov · 2024-04-17T15:18:36Z

Codecov Report

Attention: Patch coverage is 83.06452% with 21 lines in your changes missing coverage. Please review.

Project coverage is 78.25%. Comparing base (ad58268) to head (1f62a50).
Report is 437 commits behind head on main.

Files	Patch %	Lines
sweagent/agent/models.py	67.69%	21 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #247      +/-   ##
==========================================
+ Coverage   76.23%   78.25%   +2.02%     
==========================================
  Files          18       18              
  Lines        2845     2907      +62     
==========================================
+ Hits         2169     2275     +106     
+ Misses        676      632      -44

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

klieret · 2024-04-18T14:41:57Z

Thanks a lot, this is awesome! Let me know once this is ready for review (it's still marked as a draft)

sweagent/agent/models.py

mikanfactory · 2024-04-19T11:05:33Z

sweagent/agent/models.py

@@ -429,8 +430,7 @@ class OllamaModel(BaseModel):

 def __init__(self, args: ModelArguments, commands: list[Command]):
 super().__init__(args, commands)
- from ollama import Client
- self.client = Client(host=args.host_url)
+ self.client = OllamaClient(host=args.host_url)


As I mentioned in the comment below, I changed it to a format that can be imported from tests like patch("sweagent.agent.models.OllamaClient").

mikanfactory · 2024-04-19T11:14:55Z

tests/test_models.py

+@pytest.mark.parametrize("model_name", CLAUDE2_MODELS)
+def test_anthropic2_model(model_name, mock_anthropic2_response):
+ with patch("sweagent.agent.models.config.Config"), \
+ patch("sweagent.agent.models.Anthropic") as mock_anthropic:


The patch here also works when specified as follows.

patch("anthropic.resources.messages.Messages.create")

While above format works, when we add a model, we need to look up the method file from the library. I thought this was difficult for both the implementer and the reviewer.
That's why I adopted this approach.

mikanfactory · 2024-04-19T11:19:10Z

tests/test_models.py

+ }
+
+
+def split_claude_model_by_version():


For Claude-2, we were sending requests with completions.create(), while for Claude-3, we were sending requests with messages.create(), so we needed to separate the test cases and divide the models.

mikanfactory · 2024-04-19T11:57:03Z

@klieret
I was planning to mark it as "Ready for review" after a self-review, but It seems that the test I created is broken due to #207.
I will Request review once this is resolved.

mikanfactory · 2024-04-20T14:25:54Z

After merging #207, the AnthropicModel tests broke. To fix this, we have two options: either modify the unit tests or fix the implementation of AnthropicModel. After looking at the implementation of AnthropicModel, I was confident that I could cleanly refactor it, so I decided to fix that this time.

Originally, the implementations of the history_to_messages and query methods in AnthropicModel had branches that executed different code snippets for Claude-2 and Claude-3 (using early returns). This time, an additional branch for Bedrock was added. Therefore:

I created mixins for Claude-2 and Claude-3, and split code snippets.
The branching for Bedrock or not was mostly commonizable, so I made it common.

I read through the comments on #207, and I think this mixin format will not become redundant when adding other models. (e.g. Add Bedrock to cohere, meta.)

I'm still new here, so I'm not sure if this PR follows the development rules here. If the PR is too large or the changes are too big, please feel free to comment!

ofirpress · 2024-06-21T02:54:31Z

@klieret can we merge or close ?

mikanfactory added 4 commits April 17, 2024 22:38

test: add unit test for anthropic, ollama, human model

b016d39

test all supported anthropic models

76cd4d2

refactor: change the patch specification to the model imported in the…

a72d6c0

… project

fix ci

55b0336

klieret added the 🧪 CI Continuous integration/testing label Apr 17, 2024

mikanfactory commented Apr 19, 2024

View reviewed changes

mikanfactory changed the title ~~[WIP] Add unit test for models.py~~ Add unit test for models.py Apr 19, 2024

mikanfactory added 2 commits April 19, 2024 20:36

Merge branch 'main' into fix-185

dadcd33

fix import

695518e

mikanfactory added 4 commits April 20, 2024 00:14

refactor: split Anthropic Claude-2 and 3 API methods

a59d77a

refactor: consolidate the same logic into a Claude2Mixin.

e2acfb4

Merge remote-tracking branch 'upstream/main' into fix-185

ef9d533

refactor: remove static method decorator

82b019f

remove comment

46983f8

mikanfactory marked this pull request as ready for review April 20, 2024 14:27

mikanfactory requested a review from klieret April 20, 2024 14:27

Merge branch 'main' into fix-185

1f62a50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add unit test for `models.py` #247

Add unit test for `models.py` #247

mikanfactory commented Apr 17, 2024 •

edited

Loading

codecov bot commented Apr 17, 2024 •

edited

Loading

klieret commented Apr 18, 2024

mikanfactory Apr 19, 2024

mikanfactory Apr 19, 2024

mikanfactory Apr 19, 2024

mikanfactory commented Apr 19, 2024

mikanfactory commented Apr 20, 2024 •

edited

Loading

ofirpress commented Jun 21, 2024

Add unit test for models.py #247

Are you sure you want to change the base?

Add unit test for models.py #247

Conversation

mikanfactory commented Apr 17, 2024 • edited Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

codecov bot commented Apr 17, 2024 • edited Loading

Codecov Report

klieret commented Apr 18, 2024

mikanfactory Apr 19, 2024

Choose a reason for hiding this comment

mikanfactory Apr 19, 2024

Choose a reason for hiding this comment

mikanfactory Apr 19, 2024

Choose a reason for hiding this comment

mikanfactory commented Apr 19, 2024

mikanfactory commented Apr 20, 2024 • edited Loading

ofirpress commented Jun 21, 2024

Add unit test for `models.py` #247

Add unit test for `models.py` #247

mikanfactory commented Apr 17, 2024 •

edited

Loading

codecov bot commented Apr 17, 2024 •

edited

Loading

mikanfactory commented Apr 20, 2024 •

edited

Loading