Cloud APIM - Otoroshi LLM Extension

Connect, setup, secure and seamlessly manage LLM models using an Universal/OpenAI compatible API

Unified interface: Simplify interactions and minimize integration hassles
Use multiple providers: 10+ LLM providers supported right now, a lot more coming
Load balancing: Ensure optimal performance by distributing workloads across multiple providers
Fallbacks: Automatically switch LLMs during failures to deliver uninterrupted & accurate performance
Automatic retries: LLM APIs often have inexplicable failures. You can rescue a substantial number of your requests with our in-built automatic retries feature.
Semantic cache: Speed up repeated queries, enhance response times, and reduce costs
Custom quotas: Manage LLM tokens quotas per consumer and optimise costs
Key vault: securely store your LLM API keys in Otoroshi vault or any other secret vault supported by Otoroshi.
Observability and reporting: every LLM request is audited with details about the consumer, the LLM provider and usage. All those audit events are exportable using multiple methods for further reporting
Fine grained authorizations: Use Otoroshi advanced fine grained authorizations capabilities to constrains model usage based on whatever you want: user identity, apikey, consumer metadata, request details, etc
Guardrails: Validate your prompts and prompts responses to avoid sensitive or personal informations leakage, irrelevant or unhelpful responses, gibberish content, etc
Prompt engineering: enhance your experience by providing contextual information to your prompts, storing them in a library for reusability, and using prompt templates for increased efficiency
Multi-modal: Audio (TTS, STT, translate), Image and Video model supported
Embeddings support: compute embeddings from various providers and models through a unified API
Vector Stores support: search through vector store to enrich LLM calls
Persistent memories: automatically stores conversation messages and re-inject them on the next calls
Agentic workflows: orchestrate LLM interactions using Otoroshi workflows

Otoroshi LLM Extension is set of Otoroshi plugins and resources to interact with LLMs. To know more about it, go to documentation

Supported LLM providers

All supported providers are available here

Anthropic
Azure OpenAI
Azure AI Foundry
Cloudflare
Cohere
Gemini
Groq
Huggingface 🇫🇷 🇪🇺
Mistral 🇫🇷 🇪🇺
Ollama (Local Models)
OpenAI
OVH AI Endpoints 🇫🇷 🇪🇺
Scaleway 🇫🇷 🇪🇺
X.ai
Deepseek

Supported Moderation models

OpenAI
- omni-moderation-latest

Supported Audio Text-to-Speech models

OpenAI
- gpt-4o-mini-tts
- tts-1
- tts-1-hd
Groq
- playai-tts
- playai-tts-arabic
ElevenLabs

Supported Audio Speech-to-text models

OpenAI
- whisper-1
Groq
- whisper-large-v3

Supported LLM Embeddings models

OpenAI
- text-embedding-3-small
- text-embedding-3-large
- text-embedding-ada-002
Anthropic
- voyage-3-large
- voyage-3
- voyage-3-lite
- voyage-code-3
- voyage-finance-2
- voyage-law-2
Mistral 🇫🇷 🇪🇺
- mistral-embed

Supported Image generation models

OpenAI
- dall-e-2
- dall-e-3
- gpt-image-1
Azure OpenAI
Grok
- grok-2-image
Luma
- photon-1 (default)
- photon-flash-1
Hive
- black-forest-labs/flux-schnell

Supported Video generation models

Luma
- ray-flash-2

Requirements

Run it on JDK17+

Name		Name	Last commit message	Last commit date
Latest commit History 462 Commits
.github		.github
docs		docs
documentation		documentation
js-runtime		js-runtime
project		project
src		src
testserver		testserver
.env		.env
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
SECURITY.md		SECURITY.md
build.sbt		build.sbt
doc.sh		doc.sh
langchain.xml		langchain.xml
logo-no-bg-no-text.png		logo-no-bg-no-text.png
logo-no-bg.png		logo-no-bg.png
logo-no-text.png		logo-no-text.png
logo-old.png		logo-old.png
logo.png		logo.png
pyproject.toml		pyproject.toml
python-deps.sh		python-deps.sh
rebuild.sh		rebuild.sh
uv.lock		uv.lock
version.sbt		version.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cloud APIM - Otoroshi LLM Extension

Supported LLM providers

Supported Moderation models

Supported Audio Text-to-Speech models

Supported Audio Speech-to-text models

Supported LLM Embeddings models

Supported Image generation models

Supported Video generation models

Requirements

About

Uh oh!

Releases 51

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

cloud-apim/otoroshi-llm-extension

Folders and files

Latest commit

History

Repository files navigation

Cloud APIM - Otoroshi LLM Extension

Supported LLM providers

Supported Moderation models

Supported Audio Text-to-Speech models

Supported Audio Speech-to-text models

Supported LLM Embeddings models

Supported Image generation models

Supported Video generation models

Requirements

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 51

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages