Envoy AI Gateway

Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI services.

Usage

When using Envoy AI Gateway, we refer to a two-tier gateway pattern. The Tier One Gateway functions as a centralized entry point, and the Tier Two Gateway handles ingress traffic to a self-hosted model serving cluster.

The The Tier One Gateway handles authentication, top-level routing, and global rate limiting
The Tier Two Gateway provides fine-grained control over self-hosted model access, with endpoint picker support for LLM inference optimization.

Supported AI Providers

Envoy AI Gateway supports a wide range of AI providers, making it easy to integrate with your preferred LLM services:

_OpenAI	_{Azure OpenAI}	_{Google Gemini}	_{Vertex AI}	_{AWS Bedrock}
_Mistral	_Cohere	_Groq	_{Together AI}	_DeepInfra
_DeepSeek	_Hunyuan	_SambaNova	_Grok

Documentation

Blog introducing Envoy AI Gateway.
Documentation for Envoy AI Gateway.
Quickstart to use Envoy AI Gateway in a few simple steps.
Concepts to understand the architecture and resources of Envoy AI Gateway.

Contact

Slack: Join the Envoy Slack workspace if you're not already a member. Otherwise, use the Envoy AI Gateway channel to start collaborating with the community.

Get Involved

We adhere to the CNCF Code of conduct

The Envoy AI Gateway team and community members meet every Thursday. Please register for the meeting, add agenda points, and get involved. The meeting details are available in the public document.

To contribute to the project via pull requests, please read the CONTRIBUTING.md file which includes information on how to build and test the project.

Background

The proposal of using Envoy Gateway as a Cloud Native LLM Gateway inspired the initiation of this project.

Name		Name	Last commit message	Last commit date
Latest commit History 632 Commits
.github		.github
api/v1alpha1		api/v1alpha1
cmd		cmd
docs/proposals		docs/proposals
examples		examples
filterapi		filterapi
internal		internal
manifests		manifests
site		site
tests		tests
.codespell.ignorewords		.codespell.ignorewords
.codespell.skip		.codespell.skip
.editorconfig		.editorconfig
.env.ollama		.env.ollama
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.licenserc.yaml		.licenserc.yaml
.testcoverage.yml		.testcoverage.yml
.trivyignore		.trivyignore
.yamllint		.yamllint
.yamllint.ignore		.yamllint.ignore
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
GOALS.md		GOALS.md
LICENSE		LICENSE
Makefile		Makefile
Makefile.tools.mk		Makefile.tools.mk
README.md		README.md
RELEASES.md		RELEASES.md
go.mod		go.mod
go.sum		go.sum
netlify.toml		netlify.toml
pre-commit.sh		pre-commit.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Envoy AI Gateway

Usage

Supported AI Providers

Documentation

Contact

Get Involved

Background

About

Uh oh!

Releases 12

Uh oh!

Contributors 36

Languages

License

envoyproxy/ai-gateway

Folders and files

Latest commit

History

Repository files navigation

Envoy AI Gateway

Usage

Supported AI Providers

Documentation

Contact

Get Involved

Background

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases 12

Uh oh!

Contributors 36

Languages