[docs] Create page on inference servers with transformers backend #39550

zucchini-nlp · 2025-07-21T10:56:42Z

What does this PR do?

As per title, I added the basic info about existing inference engines so feel free to add more examples/tips etc. This PR creates a space where we can host docs on all third-party servers and we can submit PRs in vLLM/SGLang/TGI pointing to this page

HuggingFaceDocBuilderDev · 2025-07-21T11:09:29Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

docs/source/en/_toctree.yml

Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

zucchini-nlp · 2025-07-21T18:02:04Z

docs/source/en/_toctree.yml

+  - local: serving
+    title: Serving
+  - local: transformers_as_backend
+    title: Transformers as a Unified Modeling Backend


Not sure if we can put docs here, I see Agents and Tools are deprecated. Though I can't say it belong in LLMs, it is more about generative models overall

@stevhliu or should we rename the LLMs section to smth like Text Generation models or simlar?

I think its ok to leave the LLM section as is and have serving.md and transformers_as_backend.md hang out in the overall Inference section

stevhliu

Great work, thanks!

stevhliu · 2025-07-21T19:11:32Z

docs/source/en/_toctree.yml

+  - local: serving
+    title: Serving
+  - local: transformers_as_backend
+    title: Transformers as a Unified Modeling Backend


I think its ok to leave the LLM section as is and have serving.md and transformers_as_backend.md hang out in the overall Inference section

docs/source/en/transformers_as_backend.md

docs/source/en/_toctree.yml

docs/source/en/transformers_as_backend.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

ArthurZucker

Thanks 🤗

draft docs on inference servers

75cf961

hmellor reviewed Jul 21, 2025

View reviewed changes

docs/source/en/_toctree.yml Outdated Show resolved Hide resolved

zucchini-nlp and others added 2 commits July 21, 2025 19:27

Update docs/source/en/_toctree.yml

59c2d5e

Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

update

bfa5fba

zucchini-nlp commented Jul 21, 2025

View reviewed changes

zucchini-nlp requested review from stevhliu and ArthurZucker July 21, 2025 18:04

dic build failed

2311a41

stevhliu reviewed Jul 21, 2025

View reviewed changes

zucchini-nlp and others added 20 commits July 22, 2025 09:16

Update docs/source/en/transformers_as_backend.md

15b1ce8

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/_toctree.yml

0f48ecc

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

a04e981

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

836d217

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

1909288

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

9b45483

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

9be800a

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

33e01e5

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

b8e7401

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

6a38ba6

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

37b13ac

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

bdddf56

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

5e305df

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

70acaac

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

bcd171e

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

228396e

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

d1e3b97

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

99fd3ec

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

031ead8

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

0b056a0

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

zucchini-nlp and others added 6 commits July 22, 2025 09:24

Update docs/source/en/transformers_as_backend.md

f53a804

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

2bd29a4

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

fd0a96d

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/transformers_as_backend.md

581e4dc

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Apply suggestions from code review

6895d06

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

apply last suggestions

756c5b2

zucchini-nlp force-pushed the vllm-docs branch from c0d7ddc to 756c5b2 Compare July 22, 2025 09:22

Merge branch 'main' into vllm-docs

43ae498

ArthurZucker approved these changes Jul 22, 2025

View reviewed changes

zucchini-nlp merged commit 1806583 into huggingface:main Jul 22, 2025
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[docs] Create page on inference servers with transformers backend #39550

[docs] Create page on inference servers with transformers backend #39550

Uh oh!

zucchini-nlp commented Jul 21, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jul 21, 2025

Uh oh!

Uh oh!

zucchini-nlp Jul 21, 2025

Uh oh!

stevhliu Jul 21, 2025

Uh oh!

stevhliu left a comment

Uh oh!

stevhliu Jul 21, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

Uh oh!

[docs] Create page on inference servers with transformers backend #39550

[docs] Create page on inference servers with transformers backend #39550

Uh oh!

Conversation

zucchini-nlp commented Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jul 21, 2025

Uh oh!

Uh oh!

zucchini-nlp Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

stevhliu Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

stevhliu Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

zucchini-nlp commented Jul 21, 2025 •

edited

Loading