Introduction to FAIR Data Management for Long-term Agriculture Experiments
Good research data management for LTE data matters because
- LTE data is costly to produce
- LTE data is unique and cannot be reproduced
- LTE data takes many years to generate
- Good RDM is necessary to ensure the continuity of data over time
- Good RDM is necessary to ensure the accessibility of data over time
- Good RDM is necessary to ensure the usability and interpretation of data over time
FAIR Principles for long-term agricultural experiments data
- Applying FAIR Principles make it easier to find research data.
- Following FAIR principles makes you start to follow best practices for research data management.
- FAIR data is not open data, but open data should be FAIR data.
- Good research data management through adopting FAIR principles has a cost.
Metadata for long-term experiments
- Metadata provides essential context for understanding a dataset
- To be reusable, metadata should be consistent across datasets of the same type
Organising data for long-term experiments
- Humans and computers can interpret data in Excel differently.
- Be aware of common problems using Excel for managing data.
- Excel is not a database.
Making LTE data FAIR
- Providing a well described table makes it easier for researchers to understand what data it contains.
- Standardising table structures and using open data formats make it easier to use the data in statistical packages.
- Semantic annotation allows different datasets to be combined on common concepts.
databases-for-lte-data
- Use
.md
files for episodes when you want static content - Use
.Rmd
files for episodes when you need to generate output - Run
sandpaper::check_lesson()
to identify any issues with your lesson - Run
sandpaper::build_lesson()
to preview your lesson locally
Publishing LTE data
- Data repositories provide features supporting FAIR data.
- Published datasets are immutable - the data must not be altered.
- Published datasets are versioned.