Add splitting/chunking utilities in order to do in-memory RAG. #2092
homanp
started this conversation in
Ideas & Feedback
Replies: 1 comment 3 replies
-
Splitting / chunking can quickly become a very large topic with tokenizers, different splitters for different content types, etc. Do you have any splitters that are most important to you? |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Since cosine search is implemented in the package wouldn't it make sense to also add utilities for splitting/chunking text? This would allow users to do "in memory RAG" without having to store vectors in a db.
We use this approach for our web research agents and would be up for contributing.
Beta Was this translation helpful? Give feedback.
All reactions