You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* feat: added smaller qwen models for debugging
Signed-off-by: Masataro Asai <guicho2.71828@gmail.com>
* feat(vllm): copied from huggingface
Signed-off-by: Masataro Asai <guicho2.71828@gmail.com>
* fix(vllm): remove alora and cache
Signed-off-by: Masataro Asai <guicho2.71828@gmail.com>
* fix(vllm): remove tool calls
Signed-off-by: Masataro Asai <guicho2.71828@gmail.com>
* fix(vllm): finished the implementation with limited functionality: free-form and constrained generation
Signed-off-by: Masataro Asai <guicho2.71828@gmail.com>
* fix(vllm): passing mypy and linter
Signed-off-by: Masataro Asai <guicho2.71828@gmail.com>
* fix(vllm): added vllm optional dep in pyproject.toml
Signed-off-by: Masataro Asai <guicho2.71828@gmail.com>
* feat(vllm test): copied from huggingface
Signed-off-by: Masataro Asai <guicho2.71828@gmail.com>
* fix(vllm test): implemented the test
Signed-off-by: Masataro Asai <guicho2.71828@gmail.com>
* test: require V0 in vllm test
Signed-off-by: Masataro Asai <guicho2.71828@gmail.com>
* refactor: ctx to chat conversion function
* refactor: use_alora function
* refactor: moved _extract_model_tool_requests to mellea.backends.utils
* feat(vllm): added tool calls
* test(tools): run test with mistral
* fix(vllm): rename model_options -> engine_args
* fix(vllm): use FancyLogger
* fix(vllm): ignore type checking for vllm and msgspec
* fix(vllm): fixed the backend name in the log
* feat(vllm): asynchronous call support
* test(vllm): asynchronous call support
* fix(vllm): avoid unnecessary incremental processing in non-streaming mode
* fix(vllm): fix for the new return format
* fix(vllm): fixed vllm test for the new contexts
* fix(vllm): addressed minor comments
* fix(vllm): uv lock
* fix(vllm): mark V0 api test qualitative; will be removed in a future PR that migrates to V1
---------
Signed-off-by: Masataro Asai <guicho2.71828@gmail.com>
Co-authored-by: MASATARO ASAI Masataro.Asai@ibm.com <masataro@login1.bluevela.rmf.ibm.com>
Co-authored-by: Nathan Fulton <nathan@ibm.com>
0 commit comments