Skip to content

Small scorer file #10

@ellenhp

Description

@ellenhp

Hey! Very excited to see this. It seems to still build and function in 2024. I had to update the version of the vosk dependency to 0.3.47 to get it to build. For DeepSpeech/English, I had to replace the model files with the 0.8.2 deepspeech releases and add org.gradle.jvmargs=-Xmx16g to gradle.properties to let gradle build with the massive multilingual .scorer file. After that I switched up the paths in app/src/main/java/cat/oreilly/localstt/DeepSpeechRecognitionService.java and it works. I did have to use ndk 20.1.5948944, it wouldn't work with the latest NDK.

I was wondering how you got the very small .scorer file for Catalan? Is there a script somewhere I could run to extract single languages from the big scorer file? Or did you generate your own? It looks like there might be ways to do that, but I haven't looked into it too closely yet. The scorer file from the DeepSpeech release seems to work with the little bits of Spanish I know, so I'm guessing it works with every language that DeepSpeech supports. No wonder it's so big! 😅

Thank you so much for publishing this. It's really helpful.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions