Putting it all together
Overview
Teaching: 0 min
Exercises: 0 minQuestions
What will your journey to be FAIR productive entail?
Objectives
Discuss steps and changes in your habits you will take after this course
List of resources that will help you achieve these goals
For the past three days we have discussed how you can embark on a journey to be FAIR, we also showed you tools you can use in your daily life and throughout the research data life cycle to make your data more: Findable, Accessible, Interoperable and Reproducible
Simple things like thinking about the metadata that can accompany your data, use of templates and preparation of your data for future sharing can help in achieving Open Research and better FAIR practices. We now want to hear from you what you will change after this course to be more FAIR:
Exercise 1: Your journey to be FAIRproductive
What are the things you are going to change in your work habits, or actions you are going to take after this course?
Solution
- For each paper I publish I will make a Zenodo deposit with all the data files
- I will use GitHub for all my code instead of using folders
- I will start using ELNs
- Write a data management plan for my project
- Reorganise my folder structure
- Use templates
- Start using readme files (make a readme template)
- Write protocols on protocols.io
- Back up my data accordingly
- Think as a user of my data instead of just a creator
- Start using Jupyter notebooks, or RMarkdown to describe analysis workflows
- Write intelligible metadata
As you can see, anything counts. Even simple things like reorganisation of your folder structure for a better overview or to stop using USB sticks as backup material are steps towards FAIR. We have collated here the list of resources throughout the course for a better overview which will hopefully help you to guide future practices.
Resources
The BioRDM team has a lot of information about the here taught course material on their BioRDM wiki.
- Details of funding bodies and their involvement and requirements can be found at Plan S/cOAlition S.
- cOAlition S journal checker tool to assess compliance being developed.
- The Directory of Open Access Journals (DOAJ) is a tool to find which journals are Open Access
- Challenges & benefits of OS
- Centre for Open Science
- Ted talk supporting OS
- Wiki Open Science
- European Open Science Cloud
- Science is necessarily collaborative - The Biochemist article
- Digital Object Identifiers (DOI)
- ORCID is a free, unique, persistent identifier that you own and control—forever
- Minimum Information for Biological and Biomedical Investigations (MIBBI) is the collection of the most known standards
- FAIRSharing offers excellent search service for finding standards
- Metadata - How to FAIR
- Good enough practices in scientific computing (Wilson et al., 2017)
- Good Enough Practices for Scientific Computing
- Organising your data
- Organising files and folders
- File naming
- Library Carpentry FAIR Data
- Why Jupyter is a data scientists’ computational notebook of choice
- Coding Standards best practices
- Reproducible analysis and Research Transparency
- DMP Online is a UK tool that is available via subscription to many UK Universities and Institutions
- BioRDM wiki DMPs
- Benefits of writing a DMP - UoE RDS
Repositories and Ontologies
Data agnostic repositories
There are general “data agnostic” repositories, for example:
Or domain specific, for example:
- UniProt protein data,
- GenBank sequence data,
- MetaboLights metabolomics data
- GitHub for code.
Similar to ORCID, there are other registries that can be used to identify many of biological concepts and entities:
- species e.g. NCBI taxonomy
- chemicals e.g. ChEBI
- proteins e.g. UniProt
- genes e.g. GenBank
- metabolic reactions, enzymes e.g KEGG
BioPortal or NCBI
are good places to start searching for a registry or a term.Here some links to help you find the right ontologies for your research:
- Finding ontologies: https://bioportal.bioontology.org/
- List of recommended ontologies: http://www.obofoundry.org/
Licences we recommend using
Licenses explicitly declare conditions and terms by which data and software can be re-used. Here, we recommend:
- for data Creative Commons Attribution (CC BY) license
- for code a permissive open source license such as the MIT, BSD, or Apache license.
List of ELNs and online protocols
- Benchling
- Procols.io
- Colabra (free version available)
- SciNote (free version available)
- Rspace (trial possible)
- Labarchives (trial possible contacting the company)
- WikiBench (works on top of Confluence)
ELN Resources
- Comprehensive summary of ELNs on the University of Edinburgh Wiki by BioRDM
- Our Benchling tutorial: BioRDM Wiki
- Benchling’s training kit for academics
- ELN Guide
- Colabra’s lab management course
Other useful courses
You can explore more courses developed by the Carpentries Incubator
Key Points
There are simple steps to help make your data more FAIR throughout the research data lifecycle
Implement these steps throughout to keep track of your data and changes