Discussion on EmbeddingBasedDocumentSplitter
component
#356
Replies: 3 comments 4 replies
-
Hi @davidsbatista, I just tested it, and I would like share some feedback:
|
Beta Was this translation helpful? Give feedback.
-
Hi @d-kleine! Thanks for your feedback! Appreciated! :)
That's how it should work, although when we move it to haystack main package I should add the LazyImport warning to be a clean message. I noticed that's not being used in
It provides another way to split your large document into semantically meaningful, smaller documents. Instead of splitting by sections, paragraphs, or sentences one tries make a split where the semantics of the text diverge based on comparing embeddings. |
Beta Was this translation helpful? Give feedback.
-
Hi again @d-kleine - and thank you for all the good feeback, I've open a PR to improve the docstrings and the code example, thanks for pointing that out, I hope this makes it easier for users seeing this component for the first time. Regarding the last comment:
Can you detail a bit more your suggestion, maybe with an example? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
This is the discussion board for EmbeddingBasedDocumentSplitter component.
We introduced the
QueryExpander
component, which generates a list of semantically similar queries to the user query to improve retrieval recall in RAG.Usage example
Try it out and let us know what you think in the comments 👇
Beta Was this translation helpful? Give feedback.
All reactions