feat(elevenlabs): Add `with_timestamps` support for TTS alignment data #17344

saar-win · 2025-12-02T08:09:47Z

Title

Relevant issues

Pre-Submission checklist

Please complete all items before asking a LiteLLM maintainer to review your PR

I have Added testing in the tests/litellm/ directory, Adding at least 1 test is a hard requirement - see details
I have added a screenshot of my new test passing locally
My PR passes all unit tests on make test-unit
My PR's scope is as isolated as possible, it only solves 1 specific problem

Type

🆕 New Feature

Changes

Add support for ElevenLabs' /with-timestamps endpoint, which returns
character-level timing information alongside the generated audio. This
enables use cases like lip-sync, captions, and audio-text synchronization.

Changes

Core Implementation

litellm/llms/elevenlabs/text_to_speech/transformation.py:
- Add ELEVENLABS_WITH_TIMESTAMPS_KEY constant
- Modify get_complete_url() to append /with-timestamps when requested
- Update transform_text_to_speech_response() to handle JSON responses
- Extract with_timestamps from optional_params in map_openai_params()
litellm/main.py:
- Add with_timestamps parameter to speech() function signature
- Extract and pass with_timestamps flag to litellm_params_dict
- Update return type to Union[HttpxBinaryResponseContent, dict]
litellm/proxy/proxy_server.py:
- Handle dict responses in audio_speech endpoint
- Return ORJSONResponse for with_timestamps requests instead of StreamingResponse

Supporting Changes

litellm/types/llms/elevenlabs.py (new file):
- Add type definitions for ElevenLabsAlignment
- Add type definitions for ElevenLabsNormalizedAlignment
- Add ElevenLabsTextToSpeechWithTimestampsResponse TypedDict
litellm/llms/base_llm/text_to_speech/transformation.py:
- Update transform_text_to_speech_response return type to support dict
litellm/llms/custom_httpx/llm_http_handler.py:
- Update handler return types to support dict responses

Tests

tests/llm_translation/test_elevenlabs.py:
- Add test_with_timestamps_parameter
- Add test_without_timestamps_parameter
- Add test_transform_response_json
- Add test_transform_response_binary

Documentation

docs/my-website/docs/providers/elevenlabs.md:
- Add "Text-to-Speech with Timestamps" section
- Include Python SDK and curl examples
- Document response format with alignment data

Usage

# Python SDK
response = litellm.speech(
    model="elevenlabs/eleven_turbo_v2",
    input="Hello world",
    voice="alloy",
    with_timestamps=True
)
print(response["audio_base64"])
print(response["alignment"])

# curl via LiteLLM Proxy
curl -X POST "http://localhost:4000/v1/audio/speech" \
  -H "Authorization: Bearer $API_KEY" \
  -d '{"model": "eleven_turbo_v2", "input": "Hello", "voice": "alloy", "with_timestamps": true}'

Response Format

When with_timestamps=True, returns JSON instead of binary audio:

audio_base64: Base64-encoded audio data
alignment: Character-level start/end times
normalized_alignment: Normalized timing data

vercel · 2025-12-02T08:09:55Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
litellm	Error			Dec 2, 2025 8:29am

…itellm into elevenlabs/enrichment

krrishdholakia · 2025-12-03T06:32:06Z

litellm/llms/base_llm/text_to_speech/transformation.py

        raw_response: httpx.Response,
        logging_obj: LiteLLMLoggingObj,
-    ) -> "HttpxBinaryResponseContent":
+    ) -> Union["HttpxBinaryResponseContent", Dict]:


@saar-win wouldn't this be non-openai compatible?

let me know if there is a scenario where openai returns a dictionary like object

json.json
ElevenLabs supports a with_timestamps option which uses their /with-timestamps endpoint. This returns a JSON dict containing audio_base64 (base64-encoded audio) and an alignment object with character-level timing data. This is a provider-specific extension—OpenAI TTS has no equivalent and always returns binary audio.

krrishdholakia · 2025-12-03T06:32:51Z

@Sameerlite can you monitor this PR and ensure we get this / something like this in main

saar-win · 2025-12-07T08:02:36Z

@krrishdholakia, something new with that?

saar-win added 4 commits November 28, 2025 12:35

Merge branch 'main' of https://github.com/Loora-ai/litellm

243517f

Merge branch 'main' of https://github.com/Loora-ai/litellm

0e9822d

Elevenlabs: enrich ep with_timestamps

35d9ee3

Elevenlabs: enrich ep with_timestamps

1ca41d9

vercel bot had a problem deploying to Preview December 2, 2025 08:10 Failure

Merge branch 'main' into elevenlabs/enrichment

c0d0563

vercel bot deployed to Preview December 2, 2025 08:15 View deployment

saar-win changed the title ~~Elevenlabs/enrichment~~ feat(elevenlabs): Add with_timestamps support for TTS alignment data Dec 2, 2025

saar-win added 2 commits December 2, 2025 10:25

Elevenlabs: enrich docs

9ed3b68

Merge branch 'elevenlabs/enrichment' of https://github.com/Loora-ai/l…

d632a9e

…itellm into elevenlabs/enrichment

vercel bot had a problem deploying to Preview December 2, 2025 08:27 Failure

Elevenlabs: enrich docs

b5dab13

vercel bot had a problem deploying to Preview December 2, 2025 08:29 Failure

krrishdholakia reviewed Dec 3, 2025

View reviewed changes

krrishdholakia requested a review from Sameerlite December 3, 2025 06:32

saar-win requested a review from krrishdholakia December 3, 2025 09:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat(elevenlabs): Add `with_timestamps` support for TTS alignment data #17344

feat(elevenlabs): Add `with_timestamps` support for TTS alignment data #17344

Uh oh!

saar-win commented Dec 2, 2025 •

edited

Loading

Uh oh!

vercel bot commented Dec 2, 2025 •

edited

Loading

Uh oh!

krrishdholakia Dec 3, 2025

Uh oh!

saar-win Dec 3, 2025 •

edited

Loading

Uh oh!

krrishdholakia commented Dec 3, 2025

Uh oh!

saar-win commented Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

feat(elevenlabs): Add with_timestamps support for TTS alignment data #17344

Are you sure you want to change the base?

feat(elevenlabs): Add with_timestamps support for TTS alignment data #17344

Uh oh!

Conversation

saar-win commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Title

Relevant issues

Pre-Submission checklist

Type

Changes

Changes

Core Implementation

Supporting Changes

Tests

Documentation

Usage

Response Format

Uh oh!

vercel bot commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

krrishdholakia Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

saar-win Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

krrishdholakia commented Dec 3, 2025

Uh oh!

saar-win commented Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(elevenlabs): Add `with_timestamps` support for TTS alignment data #17344

feat(elevenlabs): Add `with_timestamps` support for TTS alignment data #17344

saar-win commented Dec 2, 2025 •

edited

Loading

vercel bot commented Dec 2, 2025 •

edited

Loading

saar-win Dec 3, 2025 •

edited

Loading