-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement Sentence-Window Retrieval for benchmark evaluation #7843
Comments
Might be related: #7389 |
@davidsbatista Could you please clarify in the issue description and title whether this issue includes adding sentence retrieval to our benchmarks? Or is this issue limited to adding a new feature/component to Haystack? I would expect the latter to keep the issue small. |
Initially the idea was to have it benchmarked against other architectures over some dataset and I did an initial naive version:
But then talking with Mathis and Stefano pointing out the issue, I've been working on extending the So the idea for this issue for now is just to have the |
It was added the to architectures in the An example of an evaluation using the sentence-window is here: |
The sentence-window approach breaks down documents into smaller chunks (sentences) and indexes them separately.
During retrieval, we retrieve the sentences that are most relevant to the query via similarity search and replace the
sentence with the full surrounding context, using a static sentence-window around the context.
The text was updated successfully, but these errors were encountered: