Skip to content

French (doc add) #83

@EtienneAb3d

Description

@EtienneAb3d

As requested in #41, here is how I succeeded in running contextualSpellCheck for French.

Use French spaCy model:

nlp = spacy.load("fr_core_news_sm")

Use camembert/camembert-base-ccnet:

nlp.add_pipe("contextual spellchecker", config={"max_edit_dist": 4,"model_name": "camembert/camembert-base-ccnet"})

Need these dependencies:

pip install sentencepiece
pip install protobuf==3.20

Remark: on the result spaces are lost, thus need a post-processing to get them back properly.

PS: for flaubert/flaubert_large_cased model, need this dependency

pip install sacremoses

Metadata

Metadata

Assignees

No one assigned

    Labels

    documentationImprovements or additions to documentationenhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions