|
1 |
| -## rapid_paraformer |
2 |
| - |
3 |
| -<p align="left"> |
4 |
| - <a href=""><img src="https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg"></a> |
5 |
| - <a href=""><img src="https://img.shields.io/badge/Python->=3.6,<3.12-aff.svg"></a> |
6 |
| - <a href="https://pepy.tech/project/rapid_paraformer"><img src="https://static.pepy.tech/personalized-badge/rapid_paraformer?period=total&units=abbreviation&left_color=grey&right_color=blue&left_text=Downloads"></a> |
7 |
| - <a href="https://pypi.org/project/rapid_paraformer/"><img alt="PyPI" src="https://img.shields.io/pypi/v/rapid_paraformer"></a> |
8 |
| - <a href="https://semver.org/"><img alt="SemVer2.0" src="https://img.shields.io/badge/SemVer-2.0-brightgreen"></a> |
9 |
| - <a href="https://github.com/psf/black"><img src="https://img.shields.io/badge/code%20style-black-000000.svg"></a> |
10 |
| -</p> |
11 |
| - |
12 |
| -- 模型出自阿里达摩院[Paraformer语音识别-中文-通用-16k-离线-large-pytorch](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) |
13 |
| -- 🎉该项目核心代码已经并入[FunASR](https://github.com/alibaba-damo-academy/FunASR) |
14 |
| -- 本仓库仅对模型做了转换,只采用ONNXRuntime推理引擎 |
15 |
| - |
16 |
| -#### TODO |
| 1 | +<div align="center"> |
| 2 | +<div align="center"> |
| 3 | + <h1><b>Rapid Paraformer</b></h1> |
| 4 | +</div> |
| 5 | + |
| 6 | +<a href=""><img src="https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg"></a> |
| 7 | +<a href=""><img src="https://img.shields.io/badge/Python->=3.6,<3.13-aff.svg"></a> |
| 8 | +<a href="https://pepy.tech/project/rapid_paraformer"><img src="https://static.pepy.tech/personalized-badge/rapid_paraformer?period=total&units=abbreviation&left_color=grey&right_color=blue&left_text=Downloads"></a> |
| 9 | +<a href="https://pypi.org/project/rapid_paraformer/"><img alt="PyPI" src="https://img.shields.io/pypi/v/rapid_paraformer"></a> |
| 10 | +<a href="https://semver.org/"><img alt="SemVer2.0" src="https://img.shields.io/badge/SemVer-2.0-brightgreen"></a> |
| 11 | +<a href="https://github.com/psf/black"><img src="https://img.shields.io/badge/code%20style-black-000000.svg"></a> |
| 12 | +</div> |
| 13 | + |
| 14 | +## 简介 |
| 15 | +rapid_paraformer是一个基于阿里达摩院[Paraformer语音识别-中文-通用-16k-离线-large-pytorch](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary)的语音识别工具。 |
| 16 | + |
| 17 | +🎉该项目核心代码已经并入[FunASR](https://github.com/alibaba-damo-academy/FunASR) |
| 18 | + |
| 19 | +本仓库仅对模型做了转换,只采用ONNXRuntime推理引擎 |
| 20 | + |
| 21 | +## TODO |
17 | 22 | - [ ] 整合vad + asr + pun三个模型,打造可部署使用的方案
|
18 | 23 |
|
19 |
| -#### 使用步骤 |
20 |
| -1. Install |
21 |
| - 1. 安装`rapid_paraformer` |
22 |
| - ```bash |
23 |
| - pip install rapid_paraformer |
24 |
| - ``` |
25 |
| - 2. 下载**resources.zip** ([Google Drive](https://drive.google.com/drive/folders/1RVQtMe0eB_k6G5TJlmXwPELx4VtF2oCw?usp=sharing) | [百度网盘](https://pan.baidu.com/s/1zf8Ta6QxFHY3Z75fHNYKrQ?pwd=6ekq)) |
26 |
| - ```bash |
27 |
| - resources |
28 |
| - ├── [ 700] config.yaml |
29 |
| - └── [4.0K] models |
30 |
| - ├── [ 11K] am.mvn |
31 |
| - ├── [824M] asr_paraformerv2.onnx |
32 |
| - └── [ 50K] token_list.pkl |
33 |
| - ``` |
34 |
| - 3. **asr_paraformerv2.onnx**文件可基于modescope下的notebook环境自助转换: |
35 |
| - 1. 打开[快速体验](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) |
36 |
| - 2. 打开notebook → Cell中输入以下命令, 执行即可。 |
37 |
| - ```python |
38 |
| - !python -m funasr.export.export_model --model-name 'damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch' --export-dir "./export" |
39 |
| - ``` |
40 |
| - |
41 |
| -2. 使用 |
42 |
| - ```python |
43 |
| - from rapid_paraformer import RapidParaformer |
44 |
| -
|
45 |
| - config_path = "resources/config.yaml" |
46 |
| -
|
47 |
| - paraformer = RapidParaformer(config_path) |
48 |
| -
|
49 |
| - wav_path = [ |
50 |
| - "test_wavs/0478_00017.wav", |
51 |
| - "test_wavs/asr_example_zh.wav", |
52 |
| - ] |
53 |
| -
|
54 |
| - result = paraformer(wav_path) |
55 |
| - print(result) |
56 |
| - ``` |
57 |
| -3. 查看结果 |
58 |
| - ```text |
59 |
| - ['y', '欢迎大家来体验达摩院推出的语音识别模型'] |
| 24 | +## 安装 |
| 25 | +```bash |
| 26 | +pip install rapid_paraformer |
| 27 | +``` |
| 28 | + |
| 29 | +## 模型下载 |
| 30 | +方法一:从Hugging Face上下载([link](https://huggingface.co/SWHL/RapidParaformer)) |
| 31 | +```python |
| 32 | +from rapid_paraformer import download_hf_model |
| 33 | + |
| 34 | +download_hf_model(repo_id="SWHL/RapidParaformer", save_dir=".") |
| 35 | +``` |
| 36 | + |
| 37 | +方法二:([Google Drive](https://drive.google.com/drive/folders/1RVQtMe0eB_k6G5TJlmXwPELx4VtF2oCw?usp=sharing) | [百度网盘](https://pan.baidu.com/s/1zf8Ta6QxFHY3Z75fHNYKrQ?pwd=6ekq)) |
| 38 | +```bash |
| 39 | +resources |
| 40 | +├── [ 700] config.yaml |
| 41 | +└── [4.0K] models |
| 42 | + ├── [ 11K] am.mvn |
| 43 | + ├── [824M] asr_paraformerv2.onnx |
| 44 | + └── [ 50K] token_list.pkl |
| 45 | +``` |
| 46 | + |
| 47 | +## 模型转换 |
| 48 | +基于modescope下的notebook环境自助转换: |
| 49 | +1. 打开[快速体验](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) |
| 50 | +2. 打开notebook → Cell中输入以下命令, 执行即可。 |
| 51 | + ```bash |
| 52 | + !python -m funasr.export.export_model --model-name 'damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch' --export-dir "./export" |
60 | 53 | ```
|
| 54 | + |
| 55 | +## 使用 |
| 56 | +```python |
| 57 | +from rapid_paraformer import RapidParaformer |
| 58 | +
|
| 59 | +config_path = "resources/config.yaml" |
| 60 | +
|
| 61 | +paraformer = RapidParaformer(config_path) |
| 62 | +
|
| 63 | +wav_path = [ |
| 64 | + "test_wavs/0478_00017.wav", |
| 65 | + "test_wavs/asr_example_zh.wav", |
| 66 | +] |
| 67 | +
|
| 68 | +result = paraformer(wav_path) |
| 69 | +print(result) |
| 70 | +# ['y', '欢迎大家来体验达摩院推出的语音识别模型'] |
| 71 | +``` |
0 commit comments