title | parent | nav_order |
---|---|---|
FAQ |
About the Playbook Template |
6 |
The answers to these questions may vary based on the resources, policies, and capacities of different organizations. Always refer to your institution’s guidelines and data management policies for specifics.
- This depends on your research. If you're re-using data, ensure it is properly cited and conforms to data-sharing agreements. If you're generating new data, detail the types, formats, and volumes of data expected, such as species occurrence records or environmental data.
- Estimate your storage needs early on. For example, raw sequencing data may generate several terabytes (TB) of data. Image and sound files can vary significantly in size based on resolution and length. Your institution should have guidance on where to store this data, such as on institutional servers or cloud-based solutions.
- Essential documentation may include a Data Management Plan (DMP), README files, and codebooks. Metadata standards like Ecological Metadata Language (EML) and Darwin Core (DwC) are common for environmental and biodiversity data.
- Data storage options include institutional networked storage, cloud-based services, or repositories like Zenodo, GitHub for smaller datasets. Backup procedures should be clearly defined, ensuring data is regularly copied to secure storage locations. Also check with your national resources as well (often provided by a science foundation or a similar organization).
- If handling personal data, ensure compliance with legal frameworks like GDPR. Define data ownership clearly, especially when collaborating with multiple institutions. Institutional data-sharing agreements should outline how IP is protected.
- Select data for long-term preservation based on scientific value, uniqueness, and reusability. Data repositories such as GBIF and ENA can be used, with licensing arrangements like Creative Commons (CC-BY).
- Data such as species occurrence or environmental data will often be made available under licenses that encourage re-use, like Creative Commons. Ensure this data is well-documented and accessible through appropriate repositories.
- Data management costs can include data curation, repository fees, and training. Some costs, such as cloud storage or software licenses, can be minimized by using open-source tools or institutional resources. Check with your organization's data and ICT support for guidance.