Quantization support

There is a one line change which allows quantized mlx models in Python version of f5-tts-mlx - [source](https://github.com/tikikun/f5-tts-mlx-quantized/blob/3a2aa78284b18df4510e9c8cdd504eed8cb6ea34/f5_tts_mlx/cfm.py#L446)

I tried to patch `F5TTS.fromPretrained` in Swift to accept quantized weights using `quantize(model:,groupSize:,bits:,filter:)` on **f5tts** module but did not succeed. 

Any idea how to add support for quantized weights like [f5-tts-mlx-4bit](https://huggingface.co/alandao/f5-tts-mlx-4bit) and [f5-tts-mlx-8bit](https://huggingface.co/alandao/f5-tts-mlx-8bit)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Quantization support #5

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Quantization support #5

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions