Training documents for the OBON projects meeting in November 2024.
This training is meant for eDNA data providers interested in the formatting and addition of eDNA data to the OBIS database. The training runs through simple steps in adding eDNA data. In addition, we will try out how to access and work with eDNA data already in OBIS.
The example eDNA dataset is available at in the dataset folder. We are going to process this dataset to a darwin core archive, including all of the information available in this folder. As is typically the case for amplicon data, the data is provided in multiple tables:
- An otu table
- A taxonomy table
- A sample information table
- A fasta file for the sequences
An example R markdown solution to the data cleaning exercise is also available. To start with your own work, you can clone the repository to your computer, and copy the solution "pieter.Rmd", to a file with your own name.
- The DNADerivedData extension
- Download the repository to work with the dataset
- Process the eDNA data to Darwin Core
- Access and work with eDNA data from OBIS
To prepare for training (in case no internet available), clone this repository to your computer. Make sure that the following libraries are installed:
if (!requireNamespace("pacman", quietly = TRUE)) install.packages("pacman")
pacman::p_load(robis, dplyr, ggplot2, tidyr, knitr, ggtree, mapview, sf, RColorBrewer, rmarkdown, readxl, rmarkdown, lubridate, purrr, leaflet, Biostrings, dwcawriter, devtools, obistools, worrms)
remotes::install_github("pieterprovoost/r-dwca-writer")