Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Chinese StopWords. #331

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

Conversation

PrinOrange
Copy link

Add additional Chinese Stopwords complemtation.
These Chinese Stopwords are collected from https://github.com/yuanjie-ai/stopwords-zh, which is widely used for many Chinese corpus fields that keeps up with times.

@eklem
Copy link
Collaborator

eklem commented Dec 24, 2024

I'll look through the tests. But just looking at Google translate, it seems a bit excessive? Seems there are some words here and there that are not stopwords. What do you think @PrinOrange ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants