Skip to content

Dealing with high frequency less important stop words #245

@almasaeed2010

Description

@almasaeed2010

Often our indexed data is full of less important words (stop words) such as "the", "or" and "a". ES documentation suggests we deal with this using a common terms query, which prioritizes more important (low frequency) words. It's important that we keep stop words in our indexed data we just have to treat them differently. Removing stop words completely causes the index to lose some of the meaning in each string. For example, removing stop words from "to be or not to be" would yield an empty string "".

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions