Summary and Schedule
This short course teaches tools and practices for producing and sharing quality, sustainable and FAIR (Findable, Accessible, Interoperable and Reusable) research software to support open and reproducible research. The course can be delivered over 2 full or 4 half days.
Target audience
- Post-graduate students, early career researchers or junior Research Software Engineers (RSEs) who are starting their research or software projects, have foundational knowledge of Python, version control and using software tools from command line shell, and want to develop software to support their research using established best practices
- Researchers or scientists who had foundational software training before but wish to refresh, reinforce or improve their skills and practices in the wider context of FAIR research and sharing and writing software for open and reproducible research
Check out a few example learner profiles, to see if this course is a right fit for you.
Prerequisites
Foundational knowledge of the following is required to be able to understand code examples used in the course:
- Python used to write scientific code
- Version control with Git
- Working in a command line interface (shell)
Attending a Software Carpentry workshop or a similar course will help you gain the skills and experience needed.
Please also make sure you have all the required software installed before attending this course.
Learning objectives
After attending this training, you will be able to:
- List challenges typically faced by researchers developing software and managing data for modern computational, reproducible research, including those commensurate with the FAIR (Findable, Accessible, Interoperable, Reusable) principles.
- Build on top of your existing knowledge of Python, Git and command line computing to enhance your research software development workflow with some good open and reproducible research software practices around structuring, writing, documenting, testing, sharing and reusing code (including software licencing and citation).
Acknowledgements
This course was originally developed by the UK’s Software Sustainability Institute and funded by the UK Reproducibility Network (UKRN). See CITATION.cff for the full list of authors.
Setup Instructions | Download files required for the lesson | |
Duration: 00h 00m | 1. Course introduction |
What is open and reproducible research? Why are these practices important, in particular in the context of software used to support such research? |
Duration: 00h 20m | 2. Better start with a software project |
What is a version control system? Why is version control essential to building good software What does a standard version control workflow look like? |
Duration: 01h 20m | 3. Reproducible software environments |
What are virtual environments in software development and why use
them? How can we manage Python virtual coding environments and external (third-party) libraries on our machines? |
Duration: 01h 50m | 4. Code readability |
Why does code readability matter? How can I organise my code to be more readable? What types of documentation can I include to improve the readability of my code? |
Duration: 03h 20m | 5. Code structure |
How can we best structure code? What is a common code structure (pattern) for creating software that can read input from command line? What are conventional places to store data, code, results, tests, auxiliary information and metadata within our software or research project? |
Duration: 04h 50m | 6. Code correctness |
How can we verify that our code is correct? How can we automate our software tests? What makes a “good” test? Which parts of our code should we prioritise for testing? |
Duration: 06h 20m | 7. Software documentation |
How should we document our code? Why are documentation and repository metadata important? What are the minimum elements of documentation needed? |
Duration: 07h 50m | 8. Open software management & collaboration |
How do I ensure my code is citable? How do we track issues with code in GitHub? How can we ensure that multiple developers can work on the same files simultaneously? |
Duration: 09h 20m | 9. Wrap-up |
What are the FAIR principles? How can FAIR principles help us develop better research software? What are the wider research software development principles in the context of you team, peers and the world? |
Duration: 10h 05m | Finish |
The actual schedule may vary slightly depending on the topics and exercises chosen by the instructor.
To go through the course material on your own or at a workshop, you will need the following software installed and working correctly on your system:
-
Command
line terminal (shell) (such as Bash,
Zsh or Git Bash)
- Git version control tool
- Python 3
- Visual Studio Code (VS Code) integrated development environment (IDE)
You will also need to create a GitHub account if you do not have one already, make sure that you are able to log into it, and download the Spacewalks data and analysis code which we will be used for exercises in the course.
Please follow the installation instructions to install the above tools and set up for the course.