What's the difference between chat stream and RAG stream? #89

bmc0923 · 2025-05-16T20:40:31Z

bmc0923
May 16, 2025

I'm excited to implement Neuron into one of my projects. However, I'm currently curious as to how stream differs between chat and RAG and why a different function is needed? In a current project i have (not neuron), I simply have a tool, which then creates an embedding from the user prompt, searches the vector store and then returns results to the LLM. In this case, the same LLM is used, and RAG is only called if needed. Clarity on this would be appreciated. Thanks

Answered by bmc0923

May 21, 2025

Having played around with it now - the RAG stream is used when you want to search the Vector DB and include results with every interaction with the LLM. For example, if you have an agent which simply returns results from your company wiki, there is no point having it setup as a tool, where an extra LLM call is needed. Otherwise, setting rag up as a tool works fine.

View full answer

bmc0923 · 2025-05-21T20:57:34Z

bmc0923
May 21, 2025
Author

Having played around with it now - the RAG stream is used when you want to search the Vector DB and include results with every interaction with the LLM. For example, if you have an agent which simply returns results from your company wiki, there is no point having it setup as a tool, where an extra LLM call is needed. Otherwise, setting rag up as a tool works fine.

1 reply

eddya92 May 21, 2025

Hi:)
Do you have a public repo?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

What's the difference between chat stream and RAG stream? #89

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

What's the difference between chat stream and RAG stream? #89

Uh oh!

bmc0923 May 16, 2025

Replies: 1 comment · 1 reply

Uh oh!

bmc0923 May 21, 2025 Author

Uh oh!

eddya92 May 21, 2025

bmc0923
May 16, 2025

Replies: 1 comment 1 reply

bmc0923
May 21, 2025
Author