Skip to content

how to handle 'ValueError: empty vocabulary; perhaps the documents only contain stop words' in group_similar_strings #66

@gw00207

Description

@gw00207

currently I am having to use a try/except clause when using group_similar_strings in case all of the strings only contain stopwords. Is it possible to handle this case differently, e.g. just return all strings ungrouped? or perhaps just a more descriptive error so that I can except and handle OnlyStopwordsError or similar instead of any ValueError. great package, many thanks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions