Coding conventions and modular coding
Last updated on 2025-12-19 | Edit this page
Estimated time: 75 minutes
Overview
Questions
- Why should you follow software code style conventions?
- How can formatters and linters help you with this in Python and R?
- How can nested code be targeted and improved through modularization?
- How can I write a new function in R?
Objectives
- Know how to write readable code
- Know how to write modular code
Coding formatting and linting
Readable code - for others and our future selves - should be descriptive, cleanly and consistently formatted, and use sensible, descriptive names for variables, functions and modules. Furthermore, our code should be efficient and well designed.
Programs called formatters and linters can help us with the above. What do they do and how do they differ?
Formatters
With a light touch, formatters ensure the code adheres to standard style guides without altering functionality. A style guide is a set of conventions that we agree upon with our colleagues or community, to ensure that people produce code which looks similar in style. For examples, check out the PEP 8 Style Guide or the Tidyverse style guide for Python and R, respectively.
Good, consistent formatting makes the code easier to read and can prevent merge conflicts.
Linters
The work of “linters” is a bit more heavy-handed. These programs can check your code for common programming errors, convoluted syntax, unnecessary complexity, unused code, performance bottlenecks and other violations of best practices. Often they can fix the issues they encounter, or suggest appropriate improvements otherwise.
Improving your code
Challenge
Install and use a formatter and a linter to improve the style of your code.
(Optional) Git pre-commit hooks
So far, formatting and linting were conscious choices: you have to
remember to execute them yourself every once in a while. A more robust
approach would be to take away this mental load and automate linting and
formatting. This can be achieved through “git hooks”, which are a set of
scripts git can run every time a certain action is
performed. Here we have a look at “pre-commit hooks” that check your
code changes before you commit them.
Challenge
The amount of git pre-commit hook scripts can grow rather large on
bigger projects. pre-commit manages your commit hooks and
helper programs in a declarative way and makes them easy to share
between collaborators.
Use a git pre-commit hook to run formatters and linters
every time before a git commit.
⚠️ Failed pre-commit checks will also fail the commit! Make sure to resolve the issues - be it fixing the code, or excluding certain lines from being checked - and commit again.
Further reading: CI/CD
We went from running formatting and linting locally to automating the checks for every commit. This still leaves a margin for error since your collaborators will need to set up the git hooks first. The next line of defense against messy contributions is CI/CD (Continuous Integration / Continuous Delivery).
It goes beyond the scope of this module, but you can set up your remote repository such that every push (or pull/merge request) triggers a set of checks. For GitHub it is named GitHub Actions, while GitLab just calls it CI/CD. It is useful beyond just code style; other common usecases include automated testing (even on different platforms, like Linux, Windows or MacOS), compilation and packaging, deployment, benchmarking, security checks… There is a whole marketplace for reusing CI/CD recipes.
Usually the tools you use will supply instructions on how to run them
in this automated way. Check out ruff
integrations for Python, or lintr
integrations and styler
integrations for R. Both rely
on the usethis package.
Modular coding
What is modularity?
Modularity refers to the practice of building software from smaller, self-contained, and independent elements. Each element is designed to handle a specific set of tasks, contributing to the overall functionality of the system.
Modular coding is explained in more detail in these slides.
Writing functions
One of the best ways to improve your code and to make it more modular is to write functions. Functions allow you to automate common tasks in a more powerful and general way than copy-and-pasting. Writing a function has four big advantages over using copy-and-paste:
You can give a function an evocative name that makes your code easier to understand.
As requirements change, you only need to update code in one place, instead of many.
You eliminate the chance of making incidental mistakes when you copy and paste (i.e. updating a variable name in one place, but not in another).
It makes it easier to reuse work from project-to-project, increasing your productivity over time.
A good rule of thumb is to consider writing a function whenever you’ve copied and pasted a block of code more than twice (i.e. you now have three copies of the same code).
Defining a function
Challenge: Identify code that can be put in a function
In your own project: identify code that would fit better in a function. Try to look for pieces of code that you repeat throughout your project.
Create an issue in your project for each possible function that you find. (Actually implementing the function is beyond the scope of this workshop).
GitHub issues are a good way to track your progress and to-do list. As well as a way for others to signal issues with your code.
(Optional) Modularity in Python
Challenge
Carefully review the following code snippet:
PYTHON
def convert_temperature(temperature, unit):
if unit == "F":
# Convert Fahrenheit to Celsius
celsius = (temperature - 32) * (5 / 9)
if celsius < -273.15:
# Invalid temperature, below absolute zero
return "Invalid temperature"
else:
# Convert Celsius to Kelvin
kelvin = celsius + 273.15
if kelvin < 0:
# Invalid temperature, below absolute zero
return "Invalid temperature"
else:
fahrenheit = (celsius * (9 / 5)) + 32
if fahrenheit < -459.67:
# Invalid temperature, below absolute zero
return "Invalid temperature"
else:
return celsius, kelvin
elif unit == "C":
# Convert Celsius to Fahrenheit
fahrenheit = (temperature * (9 / 5)) + 32
if fahrenheit < -459.67:
# Invalid temperature, below absolute zero
return "Invalid temperature"
else:
# Convert Celsius to Kelvin
kelvin = temperature + 273.15
if kelvin < 0:
# Invalid temperature, below absolute zero
return "Invalid temperature"
else:
return fahrenheit, kelvin
elif unit == "K":
# Convert Kelvin to Celsius
celsius = temperature - 273.15
if celsius < -273.15:
# Invalid temperature, below absolute zero
return "Invalid temperature"
else:
# Convert Celsius to Fahrenheit
fahrenheit = (celsius * (9 / 5)) + 32
if fahrenheit < -459.67:
# Invalid temperature, below absolute zero
return "Invalid temperature"
else:
return celsius, fahrenheit
else:
return "Invalid unit"
Refactor the code by extracting functions without altering its functionality.
- What functions did you create?
- What strategies did you use to identify them?
Share your answers in the collaborative document.
PYTHON
def celsius_to_fahrenheit(celsius):
"""
Converts a temperature from Celsius to Fahrenheit.
Args:
celsius (float): The temperature in Celsius.
Returns:
float: The temperature in Fahrenheit.
"""
return (celsius * (9 / 5)) + 32
def fahrenheit_to_celsius(fahrenheit):
"""
Converts a temperature from Fahrenheit to Celsius.
Args:
fahrenheit (float): The temperature in Fahrenheit.
Returns:
float: The temperature in Celsius.
"""
return (fahrenheit - 32) * (5 / 9)
def celsius_to_kelvin(celsius):
"""
Converts a temperature from Celsius to Kelvin.
Args:
celsius (float): The temperature in Celsius.
Returns:
float: The temperature in Kelvin.
"""
return celsius + 273.15
def kelvin_to_celsius(kelvin):
"""
Converts a temperature from Kelvin to Celsius.
Args:
kelvin (float): The temperature in Kelvin.
Returns:
float: The temperature in Celsius.
"""
return kelvin - 273.15
def check_temperature_validity(temperature, unit):
"""
Checks if a temperature is valid for a given unit.
Args:
temperature (float): The temperature to check.
unit (str): The unit of the temperature. Must be "C", "F", or "K".
Returns:
bool: True if the temperature is valid, False otherwise.
"""
abs_zero = {"C": -273.15, "F": -459.67, "K": 0}
if temperature < abs_zero[unit]:
return False
return True
def check_unit_validity(unit):
"""
Checks if a unit is valid.
Args:
unit (str): The unit to check. Must be "C", "F", or "K".
Returns:
bool: True if the unit is valid, False otherwise.
"""
if not unit in ["C", "F", "K"]:
return False
return True
def convert_temperature(temperature, unit):
"""
Converts a temperature from one unit to another.
Args:
temperature (float): The temperature to convert.
unit (str): The unit of the temperature. Must be "C", "F", or "K".
Returns:
tuple: A tuple containing the converted temperature in Celsius and Kelvin units.
Raises:
ValueError: If the unit is not "C", "F", or "K".
ValueError: If the temperature is below absolute zero for the given unit.
Examples:
>>> convert_temperature(32, "F")
(0.0, 273.15)
>>> convert_temperature(0, "C")
(32.0, 273.15)
>>> convert_temperature(273.15, "K")
(0.0, -459.67)
"""
if not check_unit_validity(unit):
raise ValueError("Invalid unit")
if not check_temperature_validity(temperature, unit):
raise ValueError("Invalid temperature")
if unit == "F":
celsius = fahrenheit_to_celsius(temperature)
kelvin = celsius_to_kelvin(celsius)
return celsius, kelvin
if unit == "C":
fahrenheit = celsius_to_fahrenheit(temperature)
kelvin = celsius_to_kelvin(temperature)
return fahrenheit, kelvin
if unit == "K":
celsius = kelvin_to_celsius(temperature)
fahrenheit = celsius_to_fahrenheit(celsius)
return celsius, fahrenheit
if __name__ == "__main__":
print(convert_temperature(0, "C"))
print(convert_temperature(0, "F"))
print(convert_temperature(0, "K"))
print(convert_temperature(-500, "K"))
print(convert_temperature(-500, "C"))
print(convert_temperature(-500, "F"))
print(convert_temperature(-500, "B"))
PYTHON
class TemperatureConverter:
"""
A class for converting temperatures between Celsius, Fahrenheit, and Kelvin.
"""
def __init__(self):
"""
Initializes the TemperatureConverter object with a dictionary of absolute zero temperatures for each unit.
"""
self.abs_zero = {"C": -273.15, "F": -459.67, "K": 0}
def celsius_to_fahrenheit(self, celsius):
"""
Converts a temperature from Celsius to Fahrenheit.
Args:
celsius (float): The temperature in Celsius.
Returns:
float: The temperature in Fahrenheit.
"""
return (celsius * (9 / 5)) + 32
def fahrenheit_to_celsius(self, fahrenheit):
"""
Converts a temperature from Fahrenheit to Celsius.
Args:
fahrenheit (float): The temperature in Fahrenheit.
Returns:
float: The temperature in Celsius.
"""
return (fahrenheit - 32) * (5 / 9)
def celsius_to_kelvin(self, celsius):
"""
Converts a temperature from Celsius to Kelvin.
Args:
celsius (float): The temperature in Celsius.
Returns:
float: The temperature in Kelvin.
"""
return celsius + 273.15
def kelvin_to_celsius(self, kelvin):
"""
Converts a temperature from Kelvin to Celsius.
Args:
kelvin (float): The temperature in Kelvin.
Returns:
float: The temperature in Celsius.
"""
return kelvin - 273.15
def check_temperature_validity(self, temperature, unit):
"""
Checks if a given temperature is valid for a given unit.
Args:
temperature (float): The temperature to check.
unit (str): The unit to check the temperature against.
Returns:
bool: True if the temperature is valid for the unit, False otherwise.
"""
if temperature < self.abs_zero[unit]:
return False
return True
def check_unit_validity(self, unit):
"""
Checks if a given unit is valid.
Args:
unit (str): The unit to check.
Returns:
bool: True if the unit is valid, False otherwise.
"""
if unit not in ["C", "F", "K"]:
return False
return True
def convert_temperature(self, temperature, unit):
"""
Converts a temperature from one unit to another.
Args:
temperature (float): The temperature to convert.
unit (str): The unit of the temperature.
Returns:
tuple: A tuple containing the converted temperature in the other two units.
"""
if not self.check_unit_validity(unit):
raise ValueError("Invalid unit")
if not self.check_temperature_validity(temperature, unit):
raise ValueError("Invalid temperature")
if unit == "F":
celsius = self.fahrenheit_to_celsius(temperature)
kelvin = self.celsius_to_kelvin(celsius)
return celsius, kelvin
if unit == "C":
fahrenheit = self.celsius_to_fahrenheit(temperature)
kelvin = self.celsius_to_kelvin(temperature)
return fahrenheit, kelvin
if unit == "K":
celsius = self.kelvin_to_celsius(temperature)
fahrenheit = self.celsius_to_fahrenheit(celsius)
return celsius, fahrenheit
if __name__ == "__main__":
converter = TemperatureConverter()
print(converter.convert_temperature(0, "C"))
print(converter.convert_temperature(0, "F"))
print(converter.convert_temperature(0, "K"))
print(converter.convert_temperature(-500, "K"))
print(convert_temperature(-500, "C"))
print(convert_temperature(-500, "F"))
print(convert_temperature(0, "X"))
(Optional): Writing good functions in R
Challenge 1
Write a function called kelvin_to_celsius() that takes a
temperature in Kelvin and returns that temperature in Celsius.
Hint: To convert from Kelvin to Celsius you subtract 273.15
Write a function called kelvin_to_celsius that takes a
temperature in Kelvin and returns that temperature in Celsius
R
kelvin_to_celsius <- function(temp) {
celsius <- temp - 273.15
return(celsius)
}
Combining functions
The real power of functions comes from mixing, matching and combining them into ever-larger chunks to get the effect we want.
Let’s define two functions that will convert temperature from Fahrenheit to Kelvin, and Kelvin to Celsius:
R
fahr_to_kelvin <- function(temp) {
kelvin <- ((temp - 32) * (5 / 9)) + 273.15
return(kelvin)
}
kelvin_to_celsius <- function(temp) {
celsius <- temp - 273.15
return(celsius)
}
Challenge 2
Define the function to convert directly from Fahrenheit to Celsius, by reusing the two functions above (or using your own functions if you prefer).
Define the function to convert directly from Fahrenheit to Celsius, by reusing these two functions above
R
fahr_to_celsius <- function(temp) {
temp_k <- fahr_to_kelvin(temp)
result <- kelvin_to_celsius(temp_k)
return(result)
}
The Modular coding section is based on the following sources:
- Modular Code Development from Good practices in research software development
- Functions explained from R for Reproducible Scientific Analysis Software Carpentry lesson
- Functions chapter from R for Data Science (2e)
- Coding conventions help you create more readable code that is easier to reuse and contribute to.
- Consistently formatted code including descriptive variable and function names is easier to read and write
- Software is built from smaller, self-contained elements, each handling specific tasks.
- Modular code enhances robustness, readability, and ease of maintenance.
- Modules can be reused across projects, promoting efficiency.
- Good modules perform limited, defined tasks and have descriptive names.