-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WhisperTranscriber
to add filename to document metadata
#5716
Labels
2.x
Related to Haystack v2.0
P3
Low priority, leave it in the backlog
topic:LLM
topic:metadata
type:feature
New feature or request
Comments
Additional learning with @anakin87 : The indexing pipeline:
|
As Tuana said, See, for example, the haystack/haystack/nodes/audio/whisper_transcriber.py Lines 176 to 186 in a5b8156
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
2.x
Related to Haystack v2.0
P3
Low priority, leave it in the backlog
topic:LLM
topic:metadata
type:feature
New feature or request
It would be great if we provided the option to add the filename to the metadata of the documents that the
WhisperTranscriber
creates. Currently there's not good way of doing this. This would really help when building RAG pipelines where you want to query videos, but you want to reference the video in the response.The text was updated successfully, but these errors were encountered: