idiolect

The idiolect R package is designed to provide a comprehensive suite of tools for performing comparative authorship analysis within a forensic context using the Likelihood Ratio Framework (e.g. Ishihara 2021; Nini 2023). The package contains a set of authorship analysis functions that take a set of texts as input and output scores that can then be calibrated into likelihood ratios. The package is dependent on quanteda (Benoit et al. 2018) for all Natural Language Processing functions.

Installation

You can install idiolect from CRAN:

install.packages("idiolect")

Workflow

The main functions contained in the package reflect the typical workflow for authorship analysis for forensic problems:

Input data using create_corpus();
Optionally mask the content/topic of the texts using contentmask();
Launch an analysis (e.g. delta(), ngram_tracing(), impostors());
Test the performance of the method on ground truth data using performance();
Finally, apply the method to the questioned text and generate a likelihood ratio with calibrate_LLR().

Check the website and the vignette for examples.

References

Benoit, Kenneth, Kohei Watanabe, Haiyan Wang, Paul Nulty, Adam Obeng, Stefan Müller, and Akitaka Matsuo. 2018. “Quanteda: An r Package for the Quantitative Analysis of Textual Data.” Journal of Open Source Software 3 (30). https://doi.org/10.21105/joss.00774.

Ishihara, Shunichi. 2021. “Score-Based Likelihood Ratios for Linguistic Text Evidence with a Bag-of-Words Model.” Forensic Science International 327: 110980. https://doi.org/10.1016/j.forsciint.2021.110980.

Nini, Andrea. 2023. A Theory of Linguistic Individuality for Authorship Analysis. Elements in Forensic Linguistics. Cambridge, UK: Cambridge University Press.

Name		Name	Last commit message	Last commit date
Latest commit History 128 Commits
.github		.github
R		R
data-raw		data-raw
data		data
inst		inst
man		man
pkgdown/favicon		pkgdown/favicon
tests		tests
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE.md		LICENSE.md
NAMESPACE		NAMESPACE
NEWS.md		NEWS.md
README.Rmd		README.Rmd
README.md		README.md
_pkgdown.yml		_pkgdown.yml
cran-comments.md		cran-comments.md
idiolect.Rproj		idiolect.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

idiolect

Installation

Workflow

References

About

Releases 1

Packages

Languages

License

andreanini/idiolect

Folders and files

Latest commit

History

Repository files navigation

idiolect

Installation

Workflow

References

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages