llm-deployment

Here are 11 public repositories matching this topic...

trustgraph-ai / trustgraph

The production agentic context stack. Data streaming + knowledge graphs + vector search + LLM orchestration. All in a single deployment for self hosting, BYOC, or cloud.

data-engineering data-sovereignty model-serving ai-native agentic llm-deployment context-management agentic-framework graphrag llm-orchestration agentic-rag agentic-ai agentic-ai-development knowledge-core trustgraph agentic-graphrag context-engineering context-stack

Updated Nov 1, 2025
Python

little51 / llm-dev

Star

《大模型项目实战：多领域智能应用开发》配套资源

chat-application llm llm-training llm-inference llm-deployment

Updated Oct 31, 2025
JavaScript

eos3-ai / bubble-rag

Star

rag llm-training llm-deployment rag-chatbot

Updated Sep 25, 2025
Python

A curated collection of open-source Large Language Model (LLM) projects that are production-ready and can be used for solving real-world problems. This repository focuses on high-performance, scalable LLM solutions across various industries and applications.

data-science production agents fine-tuning rag large-language-models llm llm-deployment

Updated May 17, 2025

BjornMelin / local-llm-workbench

Star

🧠 A comprehensive toolkit for benchmarking, optimizing, and deploying local Large Language Models. Includes performance testing tools, optimized configurations for CPU/GPU/hybrid setups, and detailed guides to maximize LLM performance on your hardware.

cuda gpu-acceleration model-management inference-optimization model-quantization cpu-inference llama-cpp local-llm llm-deployment llm-benchmarking ollama-optimization hybrid-inference wsl-ai-setup context-window-scaling

Updated Mar 27, 2025
Shell

davide97l / LLM-deploy-API

Star

API to efficiently deploy Language Model (LLM) applications using Flask API

flask-application flask-api llm-inference llm-deployment

Updated Feb 26, 2024

ajithvcoder / emlo4-session-16-ajithvcoder

Star

AWS EKS + IRSA, Volumes, ISTIO & KServe+ NextJS App + Fastapi Serve + kubernetes + Helm charts + Multimodel or LLM-Deployment The School of AI EMLO-V4 course assignment https://theschoolof.ai/#programs

docker kubernetes nextjs istio aws-eks fastapi kserve llm-deployment