Putting it all together

Overview

Teaching: 0 min
Exercises: 0 min

Questions

What will your journey to be FAIR productive entail?

Objectives

Discuss steps and changes in your habits you will take after this course

List of resources that will help you achieve these goals

For the past three days we have discussed how you can embark on a journey to be FAIR, we also showed you tools you can use in your daily life and throughout the research data life cycle to make your data more: Findable, Accessible, Interoperable and Reproducible

Simple things like thinking about the metadata that can accompany your data, use of templates and preparation of your data for future sharing can help in achieving Open Research and better FAIR practices. We now want to hear from you what you will change after this course to be more FAIR:

Exercise 1: Your journey to be FAIRproductive

What are the things you are going to change in your work habits, or actions you are going to take after this course?

Solution

For each paper I publish I will make a Zenodo deposit with all the data files

I will use GitHub for all my code instead of using folders

I will start using ELNs

Write a data management plan for my project

Reorganise my folder structure

Use templates

Start using readme files (make a readme template)

Write protocols on protocols.io

Back up my data accordingly

Think as a user of my data instead of just a creator

Start using Jupyter notebooks, or RMarkdown to describe analysis workflows

Write intelligible metadata

As you can see, anything counts. Even simple things like reorganisation of your folder structure for a better overview or to stop using USB sticks as backup material are steps towards FAIR. We have collated here the list of resources throughout the course for a better overview which will hopefully help you to guide future practices.

Resources

The BioRDM team has a lot of information about the here taught course material on their BioRDM wiki.

Details of funding bodies and their involvement and requirements can be found at Plan S/cOAlition S.

cOAlition S journal checker tool to assess compliance being developed.

The Directory of Open Access Journals (DOAJ) is a tool to find which journals are Open Access

Challenges & benefits of OS

Centre for Open Science

Ted talk supporting OS

Wiki Open Science

European Open Science Cloud

Science is necessarily collaborative - The Biochemist article

Digital Object Identifiers (DOI)

ORCID is a free, unique, persistent identifier that you own and control—forever

Minimum Information for Biological and Biomedical Investigations (MIBBI) is the collection of the most known standards

FAIRSharing offers excellent search service for finding standards

Metadata - How to FAIR

Good enough practices in scientific computing (Wilson et al., 2017)

Good Enough Practices for Scientific Computing

Organising your data

Organising files and folders

File naming

Library Carpentry FAIR Data

Why Jupyter is a data scientists’ computational notebook of choice

Coding Standards best practices

Reproducible analysis and Research Transparency

DMP Online is a UK tool that is available via subscription to many UK Universities and Institutions

BioRDM wiki DMPs

Benefits of writing a DMP - UoE RDS

Repositories and Ontologies

Data agnostic repositories

There are general “data agnostic” repositories, for example:

Dryad,

Zenodo,

FigShare,

Dataverse.

Or domain specific, for example:

UniProt protein data,

GenBank sequence data,

MetaboLights metabolomics data

GitHub for code.

Similar to ORCID, there are other registries that can be used to identify many of biological concepts and entities:

species e.g. NCBI taxonomy

chemicals e.g. ChEBI

proteins e.g. UniProt

genes e.g. GenBank

metabolic reactions, enzymes e.g KEGG

BioPortal or NCBI
are good places to start searching for a registry or a term.

Here some links to help you find the right ontologies for your research:

Finding ontologies: https://bioportal.bioontology.org/

List of recommended ontologies: http://www.obofoundry.org/

Licences we recommend using

Licenses explicitly declare conditions and terms by which data and software can be re-used. Here, we recommend:

for data Creative Commons Attribution (CC BY) license

for code a permissive open source license such as the MIT, BSD, or Apache license.

List of ELNs and online protocols

Benchling

Procols.io

Colabra (free version available)

SciNote (free version available)

Rspace (trial possible)

Labarchives (trial possible contacting the company)

WikiBench (works on top of Confluence)

ELN Resources

Comprehensive summary of ELNs on the University of Edinburgh Wiki by BioRDM

Our Benchling tutorial: BioRDM Wiki

Benchling’s training kit for academics

ELN Guide

Colabra’s lab management course

Other useful courses

You can explore more courses developed by the Carpentries Incubator

Data Cleaning with OpenRefine for Ecologists

Open Science with R

Key Points

There are simple steps to help make your data more FAIR throughout the research data lifecycle

Implement these steps throughout to keep track of your data and changes

previous episode

FAIR in (biological) practice

next episode

Putting it all together

Overview

Exercise 1: Your journey to be FAIRproductive

Solution

Resources

Repositories and Ontologies

Data agnostic repositories

Or domain specific, for example:

Here some links to help you find the right ontologies for your research:

List of ELNs and online protocols

ELN Resources

Other useful courses

Key Points

previous episode

next episode