diff --git a/DESCRIPTION b/DESCRIPTION index 77cbc2f..6970280 100644 --- a/DESCRIPTION +++ b/DESCRIPTION @@ -1,7 +1,7 @@ Package: stopwords Type: Package Title: Multilingual Stopword Lists -Version: 2.1.9000 +Version: 2.2 Authors@R: c(person("Kenneth", "Benoit", email = "kbenoit@lse.ac.uk", role = c("aut", "cre")), person("David", "Muhr", email = "muhrdavid+github@gmail.com", role = "aut"), person("Kohei", "Watanabe", email = "watanabe.kohei@gmail.com", role = "aut")) diff --git a/README.md b/README.md index a1496fc..9c3a161 100644 --- a/README.md +++ b/README.md @@ -3,7 +3,7 @@ [![CRAN Version](https://www.r-pkg.org/badges/version/stopwords)](https://CRAN.R-project.org/package=stopwords) -[![](https://img.shields.io/badge/devel%20version-2.1.9000-royalblue.svg)](https://github.com/quanteda/stopwords) +[![](https://img.shields.io/badge/devel%20version-2.2-royalblue.svg)](https://github.com/quanteda/stopwords) [![R build status](https://github.com/quanteda/stopwords/workflows/R-CMD-check/badge.svg)](https://github.com/quanteda/stopwords/actions) [![codecov](https://codecov.io/gh/quanteda/stopwords/branch/master/graph/badge.svg)](https://codecov.io/gh/quanteda/stopwords) @@ -81,7 +81,7 @@ source over the “stopwords-iso” source, for instance.) The following languages are currently available: | Language | Code | snowball | marimo | nltk | stopwords-iso | Other | -|:----------------|:-----|:--------:|:------:|:----:|:-------------:|:-----------------| +| :-------------- | :--- | :------: | :----: | :--: | :-----------: | :--------------- | | Afrikaans | af | | | | ✓ | | | Arabic | ar | | ✓ | ✓ | ✓ | misc | | Armenian | hy | | | | ✓ | | @@ -186,7 +186,7 @@ stopwords::stopwords_getlanguages("snowball") ## Modifying stopword lists It is now possible to edit your own stopword lists, using the -interactive editor, with functions from the **quanteda** package (>= +interactive editor, with functions from the **quanteda** package (\>= v2.02). For instance to edit the English stopword list for the Snowball source: @@ -205,7 +205,7 @@ my_stopwordlist <- quanteda::list_edit(stopwords("en", source = "marimo", simpli Finally, it’s possible to remove stopwords using pattern matching. The default is the easy-to-use [“glob” style -matching](https://en.wikipedia.org/wiki/Glob_(programming)), which is +matching](https://en.wikipedia.org/wiki/Glob_\(programming\)), which is equivalent to fixed matching when no wildcard characters are used. So to remove personal pronouns from the English Snowball word list, for instance, this would work: @@ -213,9 +213,7 @@ instance, this would work: ``` r library("quanteda", warn.conflicts = FALSE) ## Package version: 2.9.9000 -## Unicode version: 10.0 -## ICU version: 61.1 -## Parallel computing: 12 of 12 threads used. +## Parallel computing: 8 of 8 threads used. ## See https://quanteda.io for tutorials and examples. posspronouns <- stopwords::data_stopwords_marimo$en$pronoun$possessive posspronouns diff --git a/docs/404.html b/docs/404.html index aa8c07b..e702506 100644 --- a/docs/404.html +++ b/docs/404.html @@ -73,7 +73,7 @@
diff --git a/docs/LICENSE-text.html b/docs/LICENSE-text.html index e2e5bd9..7eb2075 100644 --- a/docs/LICENSE-text.html +++ b/docs/LICENSE-text.html @@ -73,7 +73,7 @@ diff --git a/docs/authors.html b/docs/authors.html index 05f5b10..3c0af2c 100644 --- a/docs/authors.html +++ b/docs/authors.html @@ -73,7 +73,7 @@ diff --git a/docs/index.html b/docs/index.html index fe3ee0b..60705be 100644 --- a/docs/index.html +++ b/docs/index.html @@ -32,7 +32,7 @@ @@ -741,7 +741,7 @@
library("quanteda", warn.conflicts = FALSE)
## Package version: 2.9.9000
-## Parallel computing: 12 of 12 threads used.
+## Parallel computing: 8 of 8 threads used.
## See https://quanteda.io for tutorials and examples.
posspronouns <- stopwords::data_stopwords_marimo$en$pronoun$possessive
posspronouns
@@ -763,20 +763,20 @@
Adding stopwords to your own package
-As of version 1.1, we’ve made it a one-step process to add stopwords()
to your package through a re-export. Simply call use_stopwords()
like this:
-> stopwords::use_stopwords()
-✔ Setting active project to '/Users/me/GitHub/mypackage'
-✔ Adding 'stopwords' to Imports field in DESCRIPTION
-✔ Writing 'R/use-stopwords.R'
-● Run `devtools::document()` to update 'NAMESPACE'
-
-> devtools::document()
-Updating mypackage documentation
-Updating collate directive in /Users/me/GitHub/mypackage/DESCRIPTION
-Writing NAMESPACE
-Loading mypackage
-Writing NAMESPACE
-Writing stopwords.Rd
+In v2.2, we’ve removed the function use_stopwords()
because the dependency on usethis added too many downstream package dependencies, and stopwords is meant to be a lightweight package.
+However it is very easy to add a re-export for stopwords()
to your package by adding this file as stopwords.R
:
+
+#' Stopwords
+#'
+#' @description
+#' Return a character vector of stopwords.
+#' See \code{stopwords::\link[stopwords:stopwords]{stopwords()}} for details.
+#' @usage stopwords(language = "en", source = "snowball")
+#' @name stopwords
+#' @importFrom stopwords stopwords
+#' @export
+NULL
+and add stopwords
to the list of Imports:
in your DESCRIPTION
file.
@@ -828,9 +828,9 @@ Developers
Dev status
diff --git a/docs/news/index.html b/docs/news/index.html
index fab4703..1e78e87 100644
--- a/docs/news/index.html
+++ b/docs/news/index.html
@@ -73,7 +73,7 @@
@@ -107,9 +107,17 @@ Changelog
Source: NEWS.md
use_stopword()
, as this introduces too many higher-order dependencies via usethis.ancient
source, and moves older ancient stopwords lists to a new perseus
source.use_stopwords()
, for adding a re-export of this function to any package.use_stopwords()
, for adding a re-export of this function to any package.list available stopwords sources
Use stopwords in your package