Signature examples on site don't work out of the box? #1261

d2kagw · 2024-07-08T08:07:39Z

d2kagw
Jul 8, 2024

I've been working through the examples from the site to learn about DSpy and have noticed that none of the examples produce the output as described in the documentation.

For example:

Notice that it's returning the sentence as well as the sentiment.

All the signature examples on the site are producing the same output on both GPT4o and Claude 3.5 (I've only tested those two).

I wrote a short script that tests some of the prominent LLMs and got these results:

Testing on gpt_4
LLM Test: ✅ Looking good
-------------------------
Testing on claude_35
LLM Test: ❌ LLM did not respond as expected: Expected "Positive", got 

Here's the sentiment analysis for the given sentence:

Sentence: it's a charming and often affecting journey.
Sentiment: Positive

The sentence expresses a favorable opinion about a journey, describing it as "charming" and "affecting," which are both positive descriptors. This indicates an overall positive sentiment towards the subject.

-------------------------
Testing on llama70b
LLM Test: ✅ Looking good
-------------------------
Testing on llama8b
LLM Test: ❌ LLM did not respond as expected: Expected "Positive", got 

The sentiment of the sentence is positive.

-------------------------
Testing on claude_3_haiku
LLM Test: ❌ LLM did not respond as expected: Expected "Positive", got 

Sentence: it's a charming and often affecting journey.
Sentiment: Positive

Here's the script:

# -------------------------
# Setup Environment
# %pip install dspy-ai==2.4.9 boto3

# Check creds have been properly set
import os
def obsfucate(var): 
    return f"{var[:4]}{'*' * (len(var) - 4)}"

def check_env_vars():
    aws_vars = ['OPENAI_API_KEY', 'AWS_ACCESS_KEY_ID', 'AWS_SECRET_ACCESS_KEY', 'AWS_SESSION_TOKEN']
    for var in aws_vars:
        value = os.environ.get(var)
        if value:
            print(f"{var} is set. Value: {obsfucate(value)}")
        else:
            print(f"{var} is not set.")

check_env_vars()

# Check the version of dspy running
from importlib.metadata import version
version("dspy-ai")



# -------------------------
# Define LLMs
import dspy

aws_provider_ue1 = dspy.Bedrock(region_name="us-east-1")
aws_provider_uw2 = dspy.Bedrock(region_name="us-west-2")

llms = {
   "gpt_4": dspy.OpenAI(
      model='gpt-4',
      max_tokens=1000,
      api_key=os.environ.get('OPENAI_API_KEY')
    ),
    "claude_35": dspy.AWSAnthropic(
        aws_provider=aws_provider_ue1,
        model="anthropic.claude-3-5-sonnet-20240620-v1:0",
    ),
    "llama70b":  dspy.AWSMeta(
        aws_provider=aws_provider_uw2,
        model="meta.llama3-70b-instruct-v1:0",
        max_tokens=1000,
    ),
    "llama8b":  dspy.AWSMeta(
        aws_provider=aws_provider_uw2,
        model="meta.llama3-8b-instruct-v1:0",
        max_tokens=1000,
    ),
    "claude_3_haiku": dspy.AWSAnthropic(
        aws_provider=aws_provider_uw2,
        model="anthropic.claude-3-haiku-20240307-v1:0"
    )
}

def basic_test(lm):
    sentence = "it's a charming and often affecting journey."

    classify = dspy.Predict('sentence -> sentiment')
    sent = classify(sentence=sentence).sentiment

    if sent == "Positive":
        print('LLM Test: ✅ Looking good')
    else:
        print(f'LLM Test: ❌ LLM did not respond as expected: Expected "Positive", got \n```\n{sent}\n```')

for llm in llms:
    lm = llms[llm]
    dspy.settings.configure(lm=lm)

    print("-------------------------")
    print("Testing on", llm)
    basic_test(lm)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Signature examples on site don't work out of the box? #1261

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Signature examples on site don't work out of the box? #1261

d2kagw Jul 8, 2024

Replies: 0 comments

d2kagw
Jul 8, 2024