Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Port evaluation from 1.x and extend to LLMs #6672

Closed
2 tasks done
julian-risch opened this issue Jan 2, 2024 · 0 comments
Closed
2 tasks done

Port evaluation from 1.x and extend to LLMs #6672

julian-risch opened this issue Jan 2, 2024 · 0 comments
Assignees
Labels
2.x Related to Haystack v2.0 epic

Comments

@julian-risch
Copy link
Member

julian-risch commented Jan 2, 2024

We need to extend evaluation features, in particular for RAG pipelines so that users can answer questions like:

  • Is this pipeline good enough?
  • What should I focus on for optimization?
  • Is pipeline A better than B? (performance, costs, latency)

This includes the following components typically appearing in RAG pipelines:
Retrievers, Rankers, DocumentJoiners
a) labels available => statistical metrics
b) no labels available => model based heuristics / pseudo label generator

Generators
a) labels available = model based (SAS, answer correctness ...)
b) no labels = model based (groundedness score)

Tasks

  1. 19 of 19
    2.x epic topic:eval type:feature
    silvanocerza
  2. 13 of 14
    2.x P1 epic
    julian-risch
@julian-risch julian-risch added epic 2.x Related to Haystack v2.0 labels Jan 2, 2024
@julian-risch julian-risch self-assigned this Jan 3, 2024
@masci masci changed the title Extend evaluation to LLMs Port evaluation from 1.x and extend to LLMs Mar 12, 2024
@masci masci closed this as completed May 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2.x Related to Haystack v2.0 epic
Projects
Development

No branches or pull requests

4 participants