Use Llama 2 in watsonx conversational search #207

jwm4 · 2023-09-13T22:58:59Z

The Research team has done a big assessment and finds they get good overall accuracy with Llama 2 using the prompt template included in this update.

anirudhaBapat · 2023-09-14T00:43:09Z

I tried this locally and it works! However I have seen a couple issues.

There is a strikethrough on verbose

I asked a question like - "When was Pinnacles National Park founded?"
and it gave me an answer that was not at all present in the document that came back from the discovery, so this is a hallucination, but not sure if we can control that especially with the prompt given by a research team.

anirudhaBapat · 2023-09-14T01:01:50Z

integrations/extensions/starter-kits/language-model-conversational-search/README.md

- `model_id`: The id of the watsonx model that you select for this action. Defaults to `google/flan-ul2`.
- `model_input`: The input to the watsonx model. You MAY change this to do prompt engineering, but a default will be used by the model if you don’t pass a prompt here.
+- `model_id`: The id of the watsonx model that you select for this action. Defaults to `meta-llama/llama-2-70b-chat`. If you keep this default, be sure to comply with the [Acceptable Use Policy for this model](https://ai.meta.com/llama/use-policy/).
+- `model_input`: The input to the watsonx model. This is set in an expression in Step 5 of the "Generate Answer" action. You MAY change that expression to do prompt engineering. If you wish to do so and are using the default model, be sure to research [guidelines for prompting Llama 2](https://www.pinecone.io/learn/llama-2/). In our experience, this combination of prompt and model is quite effective at producing high-quality answers when it has useful content and it does _often_ say that it doesn't know when it does not have useful content (as instructed in our prompt). However, it does _sometimes_ provide answers that are not supported by its evidence so consider other models, prompt expressions, or additional logic to reduce the generation of answers that are not


However, it does sometimes provide answers that are not supported by its evidence so consider other models, prompt expressions, or additional logic to reduce the generation of answers that are not

That are not....? forgot something to add?

Use Llama 2 in watsonx conversational search

6c0f915

jwm4 requested a review from anibapat September 13, 2023 22:59

anirudhaBapat approved these changes Sep 14, 2023

View reviewed changes

More README updates for Llama 2

5478695

anirudhaBapat reviewed Sep 14, 2023

View reviewed changes

jwm4 added 2 commits September 13, 2023 21:04

More README updates for Llama 2 with Milvus

cd78e4a

Fixed error in previous Llama README update

e327adc

ethanwinters approved these changes Sep 14, 2023

View reviewed changes

jwm4 merged commit 7b0b480 into watson-developer-cloud:master Sep 14, 2023
1 check passed

jwm4 deleted the cs-llama-1 branch September 14, 2023 13:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Llama 2 in watsonx conversational search #207

Use Llama 2 in watsonx conversational search #207

jwm4 commented Sep 13, 2023

anirudhaBapat commented Sep 14, 2023

anirudhaBapat Sep 14, 2023

jwm4 Sep 14, 2023

Use Llama 2 in watsonx conversational search #207

Use Llama 2 in watsonx conversational search #207

Conversation

jwm4 commented Sep 13, 2023

anirudhaBapat commented Sep 14, 2023

anirudhaBapat Sep 14, 2023

Choose a reason for hiding this comment

jwm4 Sep 14, 2023

Choose a reason for hiding this comment