more flexibility on Audio Transcriber 

first of all, just want to say this is a great package! 💯  Thanks for putting this out there ❤️ 

on to the audio transcription issue...

* have you thought about using a "local" Transcriber like in `openscenesense-ollama`?
* or alternatively, loading whisper using the huggingface serverless inference api (e.g. see below)
* and an option to turn off audio transcription (which might be useful if the video has no audio track)

```python
import requests

API_URL = "https://api-inference.huggingface.co/models/openai/whisper-small"
headers = {"Authorization": "Bearer hf_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx"}

def query(filename):
    with open(filename, "rb") as f:
        data = f.read()
    response = requests.post(API_URL, headers=headers, data=data)
    return response.json()

output = query("sample1.flac")
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

more flexibility on Audio Transcriber #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

more flexibility on Audio Transcriber #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions