[CLI] Add Inference Endpoints Commands #3428

hanouticelina · 2025-10-09T15:05:16Z

This PR implements a CLI to manage Inference Endpoints, this provides "one liners" to deploy/delete/update/etc. endpoints, which could be handy in many cases. The DX intentionally mirrors a bit the UI instead of the API, to quote @ErikKaum :

we're renaming things in the UI quite fast to adapt and make things make more sense. And in many cases in the UI things are configured with slightly different names/groupings that in the API. Just because it's faster than in the API.

I explored a few layouts (e.g. a single deploy command with --catalog), but the cleanest UX ended up being two explicit paths:

hf inference-endpoints deploy hub ... – minimal set of hardware/task configs for Hub models.
hf inference-endpoints deploy catalog ... – one liner using optimized configs from the model catalog.
delete and update endpoints currently live under the "Settings" group in the UI, but feels more natural to keep them top-level in the CLI 🤷‍♀️

> hf inference-endpoints --help
Usage: hf inference-endpoints [OPTIONS] COMMAND [ARGS]...

  Manage Hugging Face Inference Endpoints.

Options:
  --help  Show this message and exit.

Commands:
  delete         Delete an Inference Endpoint permanently.
  deploy         Deploy Inference Endpoints from the Hub or the Catalog.
  describe       Get information about an Inference Endpoint.
  list           Lists all inference endpoints for the given namespace.
  list-catalog   List available Catalog models.
  pause          Pause an Inference Endpoint.
  resume         Resume an Inference Endpoint.
  scale-to-zero  Scale an Inference Endpoint to zero.
  update         Update an existing endpoint.

happy to more iterate if there are more suggestions to make the DX better (and simpler?)

… into inference-endpoints-cli

hanouticelina · 2025-10-09T15:07:57Z

src/huggingface_hub/cli/jobs.py

 from ._cli_utils import TokenOpt, get_hf_api, typer_factory


-logger = logging.get_logger(__name__)


not related to this PR, but the logger isn't used here (same in the other command files)

hanouticelina · 2025-10-09T15:12:30Z

src/huggingface_hub/cli/inference_endpoints.py

+
+
+@deploy_app.command(name="hub", help="Deploy an Inference Endpoint from a Hub repository.")
+def deploy_from_hub(


splitted the deploy into two subcommands instead of using a flag to deploy from the Model Catalog because Typer doesn't easily allow conditional requirements (i.e., "these parameters are required unless that flag (e.g. --from-catalog) is set"), which makes validation and type hints messy so using subcommands lets Typer enforce required options cleanly for each case

HuggingFaceDocBuilderDev · 2025-10-09T15:43:42Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

hanouticelina added 8 commits October 9, 2025 16:17

add inference endpoints cli

4faf7e5

fix naming

30c13d6

update docs

e670188

Merge branch 'v1.0-release' of github.com:huggingface/huggingface_hub…

f387a11

… into inference-endpoints-cli

wording

b49a70a

remove logging

7b7b122

don't instantiate logger when not needed

0862c4a

refactor

d81a59c

hanouticelina requested review from Wauplin and ErikKaum October 9, 2025 15:05

hanouticelina commented Oct 9, 2025

View reviewed changes

hanouticelina added 2 commits October 9, 2025 17:13

remove unused import

6a50b0b

nit

c5b0638

nit

5b4111d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CLI] Add Inference Endpoints Commands #3428

[CLI] Add Inference Endpoints Commands #3428

Uh oh!

hanouticelina commented Oct 9, 2025

Uh oh!

hanouticelina Oct 9, 2025

Uh oh!

hanouticelina Oct 9, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 9, 2025

Uh oh!

Uh oh!

		from ._cli_utils import TokenOpt, get_hf_api, typer_factory


		logger = logging.get_logger(__name__)



		@deploy_app.command(name="hub", help="Deploy an Inference Endpoint from a Hub repository.")
		def deploy_from_hub(

[CLI] Add Inference Endpoints Commands #3428

Are you sure you want to change the base?

[CLI] Add Inference Endpoints Commands #3428

Uh oh!

Conversation

hanouticelina commented Oct 9, 2025

Uh oh!

hanouticelina Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

hanouticelina Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 9, 2025

Uh oh!

Uh oh!