GitHub - semnan-university-ai/nlp-stopwords: This is a comprehensive stopwords for natural language processing and text mining.

NLP Stopwords

This is a comprehensive stopwords for natural language processing and text mining.

Usage

tokens = word_tokenize(s)
         filtered_tokens = []
         stopwords = stopwords_output("Persian", "nar")
         for w in tokens:
             if w not in stopwords:
                 filtered_tokens.append(w)
         return filtered_tokens

Note: Package does not support tokenizing process.

Supported Languages

Afrikaans
Arabic
Armenian
Basque
Bengali
Breton
Bulgarian
Catalan
Chinese
Croatian
Czech
Danish
Dutch
English
Esperanto
Estonian
Finnish
French
Galician
German
Greek
Hausa
Hebrew
Hindi
Hungarian
Indonesian
Irish
Italian
Japanese
Korean
Latin
Latvian
Marathi
Norwegian
Persian
Polish
Portuguese
Romanian
Russian
Slovak
Slovenian
Somalia
Southern Sotho
Spanish
Swahili
Swedish
Thai
Turkish
Yoruba
Zulu

Type of Outputs

Dataframe
Numpy Array
Set
List

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
nlpstopwords		nlpstopwords
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Stopwords

Usage

Supported Languages

Type of Outputs

Related Repository

About

Releases 1

Languages

License

semnan-university-ai/nlp-stopwords

Folders and files

Latest commit

History

Repository files navigation

NLP Stopwords

Usage

Supported Languages

Type of Outputs

Related Repository

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Languages