Measurement
Last updated on 2025-03-21 | Edit this page
Overview
Questions
- How are emissions measured and classified under the GHG protocol?
- How do I use the GHG protocol to estimate emissions from my use of HPC?
Objectives
- Understand how emissions are classified in the widely-used GHG protocol
- Know which GHG protocol classifications are relevant to use of HPC systems
- Learn a methodology to use GHG protocol to estimate the emissions associated with our use of HPC systems
- Understand how emission rates from HPC can be calculated
Introduction
The Greenhouse Gas (GHG) protocol is the most commonly-used method for organisations to measure their total carbon emissions. Understanding GHG scopes and how to measure your software against industry standards will help you see to what extent you are reducing carbon emissions and how that fits with wider activities to reduce emissions.
To complement the GHG protocol, and if yo develop software that is used by others, you can also use the Software Carbon Intensity (SCI) specification. While the GHG is a more generic measurement suitable for all types of organisations, the SCI is specifically for measuring a rate of software emissions and designed to incentivise the elimination of those emissions.
The GHG protocol
The Greenhouse Gas protocol is the most widely used and internationally recognized greenhouse gas accounting standard. 92% of Fortune 500 companies use the GHG protocol when calculating and disclosing their carbon emissions and it provides the basis of emissions reporting for most countries (including the UK). Using the GHG protocol allows us to compare our emissions from use of HPC systems to other sources of emissions in a quantitative way.
The GHG protocol divides emissions into three scopes:
- Scope 1: Direct emissions from operations owned or controlled by the reporting organisation, such as on-site fuel combustion or fleet vehicles.
- Scope 2: Indirect emissions related to emission generation of purchased energy, such as heat and electricity.
-
Scope 3: Other indirect emissions from all the
other activities you are engaged in. Scope 3 emissions are typically
split into two further categories: Upstream Emissions and
Downstream Emissions:
- Upstream Scope 3 Emissions: Includes all emissions from an organisation’s supply chain, e.g. emissions from manufacturing and shipping a product
- Downstream Scope 3 Emissions: Emissions resulting from the use of a product, e.g. the electricity customers may consume when using your product or waste output from the product
Scope 3, sometimes referred to as value chain emissions, is the most significant source of emissions and the most complex to calculate for many organisations. These encompass the full range of activities needed to create a product or service, from conception to distribution. In the case of a laptop, for example, every raw material used in its production emits carbon when being extracted and processed (part of the upstream scope 3 emissions). Value chain emissions also include emissions from the use of the laptop, meaning the emissions from the energy used to power the laptop after it has been sold to a customer (part of downstream scope 3 emissions).
Through this approach, it’s possible to sum up all the GHG emissions from every organization and person in the world and reach a global total.
What scope does my application fall into?
We have already seen how the GHG protocol asks us to bucket emissions from HPC system use according to scopes 1-3. But how do we actually do this?
Exercise: What scope for HPC emissions?
Throughout this course we have spoken about emissions from two different sources associated with our use of HPC: emissions from the electricity used to run our models/simulations on HPC systems and embodied emissions from the HPC system hardware. Given the definitions of scope 1-3 emissions given above, what scope do you think these two different sources of HPC system use emissions fall into?
- Emissions from electricity used: these would be classified as Scope 2 emissions
- Embodied emissions from HPC system hardware: these would be classified as Upstream Scope 3 emissions
HPC electricity: Scope 2 or Downstream Scope 3?
Whether the emissions from electricity use on HPC systems are Downstream Scope 3 or Scope 2 really depends on who is computing the emissions and for what purpose. From the viewpoint of the hardware vendor who sells and manufactures the HPC system, the electricity use falls into Downstream Scope 3 emissions but for operators and users of the HPC system they would classified as Scope 2 emissions. As we are approaching this subject as users of HPC systems we will always classify the emissions from our electricity use on HPC systems as Scope 2.
In short, Downstream Scope 3 emissions are not usually relevant for use of HPC systems as this part of the emissions is reported in the Scope 2 emissions.
How to calculate your HPC emissions
Quantifying the emissions from your work (and generating an emissions
rate, as described below) are critical steps on the path to reducing and
potentially eliminating emissions from your use of HPC systems. The
formula for calculating your emissions from use of HPC systems
(HPC-E
) is straightforward:
HPC-E = (E * I) + M
-
E
= Energy consumed by HPC use (in kWh) -
I
= Location-based marginal carbon intensity (in kgCO2/kWh) -
M
= Embodied emissions
You can calculate this on a per job basis or for a larger grouping of HPC use - even for a full lifetime of an HPC service.
Include failed jobs and jobs that did not produce useful output
While it can be tempting to only include the use of HPC that produced useful output in our emissions calculations this should be avoided. The true amount of emissions includes all of our HPC system use to get us to the results we use and so failed jobs (due to errors) or calculations that did not produce useful output must be included. On the positive side, one of the ways we can reduce our emissions from use of HPC systems is to be more careful and eliminate emissions arising from these types of non-productive jobs.
Instead of bucketing the carbon emissions of HPC use into scopes 1-3,
it buckets them into operational emissions (carbon
emissions from the electricity required for your HPC use, represented by
E * I
) and embodied emissions (carbon
emissions from the physical HPC system components, represented by
M
).
Follow these steps to calculate your HPC emissions.
- Gather your energy use - this can be measured or estimated and is often a combination of measured and estimated data
- Determine the carbon intensity at the location of the HPC system you are using
- Determine the embodied emissions associated with your use of HPC
- Compute your total HPC emissions
1. Gather energy use
Many HPC systems now provide energy use data for jobs run on the system. If this is the case, you can use these as the starting point for calculating your energy use of HPC resources. If this is not available, then you may need to estimate the energy use of your use of resources from component power draw. Even if you have energy use data available this may only cover energy use of compute nodes (or even processors on compute nodes) so you do typically have to do some estimation of power draw of other components to know how much extra energy to add on to include them in your calculation.
If you are really lucky, the HPC system staff will have done this calculation for you. As has been done for the UK National Supercomputing Service, ARCHER2: Estimating emissions from ARCHER2
We will cover two different ways to estimate the power draw of HPC systems which can then be used to compute energy use.
- Use the total power draw of the system
- Use the per-component power draw of the system
Heterogeneous systems
The methodologies outlined below all assume the compute nodes are heterogeneous. What do you do if this is not the case and some nodes on the system have GPU and others do not, for example? In this case, you should try, as much as possible, to treat each homogeneous partition as its own smaller HPC system to help calculate energy use.
a. Using total power draw
One of the simplest ways to estimate your energy use is to use the
total power draw of the HPC system and divide it by the number of
components to get a mean power draw per component that can be used to
estimate energy use. For example, if the total power draw of the system
is 250 kW and it contains 512 GPU, then the mean power draw per GPU is
250 kW / 512 GPU = 0.488 kW/GPU
. This, in turn, means the
energy used for 12 hours use of 2 GPU is
12 hours * 2 GPU * 0.488 kW/GPU = 11.7 kWh
.
You should use the component that you measure resource use in to compute the mean power draw. For example. if your usage is measured in GPUh, then compute the power draw per GPU; if your usage is measured in nodeh, compute the power draw per node.
b. Using per-component power draws
This approach requires more detailed information being available on the power draw of different components though measurement or from information from the vendors of the components. If you are getting your energy use from counters on the compute nodes (as is sometimes possible on HPC systems) then this approach allows you to estimate additional energy overheads that need to be added on in addition to the measured power draw.
We illustrate this approach using the estimates for the ARCHER2 HPC system:
Component | Count | Loaded power draw per unit (kW) | Loaded power draw (kW) | % Total | Notes |
---|---|---|---|---|---|
Compute nodes | 5,860 nodes | 0.41 | 2,400 | 85% | Measured by on system counters |
Interconnect switches | 768 switches | 0.24 | 240 | 9% | Measured by on system counters |
Lustre storage | 5 file systems | 8 | 40 | 1% | Estimate from vendor |
NFS storage | 4 file systems | 8 | 32 | 1% | Estimate from vendor |
Coolant distribution units | 6 CDU | 16 | 96 | 3% | Estimate from vendor |
Total | 2,808 | 99% |
In this case, we have a mix of data measured on the system (power
draw of the compute nodes and power draw of the interconnect switches)
and estimates from the vendor (storage systems and CDU). Here, the total
power draw is estimated at 2,808 kW, there are 5,860 compute nodes and
the unit of resource is nodeh so we can calculate the mean per node
power draw (including all the components in the table) in the same way
as we did for method (a) with
2,808 kW / 5,860 nodes = 0.480 kW/node
and use this to
compute energy consumption based on how many nodeh we use.
However, on ARCHER2 we also have the total compute node energy use available per job to users from the Slurm scheduler. The table above shows that the compute nodes contribute around 85% of the total power draw of ARCHER2 (of the components included) so an alternative method to compute the energy use is to use the measurement from the scheduler and add an additional 15% to cover the energy used by other components. This is, in fact, the methodology used for computing per job energy use on ARCHER2.
Add in energy from plant overheads
As well as the energy used by the system itself, there is also the energy used by the plant that supplies power and cooling to the HPC system. Different data centres have different sizes of overheads and this is given by PUE (Power Use Efficiency). For example, a PUE of 1.25 indicates that an additional 25% energy use is added on top of the system energy use to account for the plant.
The PUE will vary with outside weather conditions at the data centre. For the ARCHER2 example, PUE is typically less than 10% so, s a conservative estimate, they add an additional 10% energy use to the total to account for plant overheads.
So, for the ARCHER2 example, the process for computing your total energy use becomes:
- Measure total energy use from all jobs run
- Add 15% extra energy to cover energy use from other components
- Add another 10% energy use top of this new total to cover plant overheads
2. Determine local carbon intensity
Once you have your energy use then you need to convert this to emissions using the carbon intensity for the electricity supply for the HPC system. In most cases, HPC systems are powered by the energy grid and many energy grids provide details on the carbon intensity as a function of time.
As we saw earlier, for the UK, the carbon intensity is dependent on location and time. You can access the values through different web services but one that is commonly used is the Carbon Intensity API. Carbon intensity is reported for every region every 30 minute interval. To estimate your emissions you can either use the fine grained intensity matched to the run times of your HPC system use or use an aggregate value over a longer period. The aggregate value is a simpler choice for a first estimate. The table below shows the approximate average carbon intensities for the different regions of the UK national grid for 2024 ordered from lowest to highest.
Type | Regions | Carbon Intensity (gCO2e/kWh) |
---|---|---|
Low | N. Scotland, S. Scotland, N.E. England, N.W. England | 22 - 48 |
Low Medium | N. Wales | 77 |
Medium | E. England, London, W. Midlands, S.E. England, Yorkshire | 108 - 135 |
High Medium | S. England, E. Midlands | 186 - 203 |
High | S.W. England, S. Wales, | 242 - 255 |
3. Determine embodied emissions
Upstream Scope 3 emissions
Remember that we are considering only upstream Scope3 emissions here. The emissions from electricity use are captured in the Scope 2 emissions estimates.
Calculating the embodied emissions can be more difficult than the operational emissions from energy consumption as it can be more difficult to get information on embodied emissions associated with HPC hardware. You may, of course, be lucky and the HPC system you are using could already provide estimates of the embodied emissions which you can use!
If you need to estimate this yourself, the major contributors to embodied emissions are likely to be:
- Compute nodes
- Interconnect switches
- Storage
so these are the best place to start. Bear in mind that each HPC system is different so other components may need to be taken into account. As a rule of thumb, you should look at the HPC system to see which components there are lots of and use that as the starting place. Complex components (such as nodes, storage and switches) are likely to have much higher embodied emissions than simpler components (pumps, fans, cables etc.).
As an example, here is how we have estimated embodied emissions for ARCHER2:
Component | Count | Estimated kgCO2e per unit | Estimated kgCO2e | % Total Scope 3 | References |
---|---|---|---|---|---|
Compute nodes | 5,860 nodes | 1,100 | 6,400,000 | 84% | (1) |
Interconnect switches | 768 switches | 280 | 150,000 | 2% | (2) |
Lustre HDD | 19,759,200 GB | 0.02 | 400,000 | 6% | (3) |
Lustre SSD | 1,900,800 GB | 0.16 | 300,000 | 4% | (3) |
NFS HDD | 3,240,000 GB | 0.02 | 70,000 | 1% | (3) |
Total | 7,320,000 | 100% |
References:
- IRISCAST Final Report
- Estimate taken from IBM z16™ multi frame 24-port Ethernet Switch Product Carbon Footprint
- Tannu and Nair, 2023
Note that there is a large amount of uncertainty for Scope 3 emissions due to lack of high quality embodied emissions data. The number used for the compute node emissions is at the high end of estimated values for a CPU-only compute node and the actual value could be as much as 15% lower at around 900 kgCO2e/node. If the lower value is used, it reduces the overall estimated embodied emissions but does not significantly change the fraction of emissions attributed to the compute nodes.
Other embodied emissions sources
We have not included embodied emissions associated with the data centre buildings and plant in the analysis above. The IRISCAST report mentioned above provides an evaluation of these values. While the total embodied emissions can be high for these items, their long lifespan means that their contribution to the embodied emissions during the lifespan of a particular HPC system are generally much less significant than the embodied emissions from the HPC system hardware itself.
4. Compute your total HPC emissions
Now we should have all the data we need to compute our total emissions from HPC system use:
-
E
- Total energy used -
I
- Carbon intensity -
M
- Embodied emissions estimate
Remember the equation for computing total emissions from HPC system
use (HPC-E
):
HPC-E = (E * I) + M
we can plug the numbers in and come up with a value for the total emissions arising from our use of HPC.
E * I
on a per job basis
Rather than computing total energy use and then using an aggregate
value for the carbon intensity, it may make more sense to compute
E * I
on a per-job basis using the carbon intensity value
at the job time.
Exercise: Computing total emissions from HPC system use
You are using (or running) a GPU-based HPC service and a particular project has used the following amounts of resource over 3 months
- 1,100 GPUh
- 3,542,000 kWh
The total embodied emissions for the service are 6,500,000 kgCO2e, the service lifetime is 7 years and there are 1000 compute nodes each with 8 GPU. The service is hosted in a location with a carbon intensity of 40 gCO2e/kWh.
- Compute the scope 2 emissions for the project use
- Compute the scope 3 emissions rate in kgCO2e/GPUh
- Compute the scope 3 emissions for the project use
- Compute the total emissions for the project use
- Do scope 2 or scope 3 emissions dominate or are they evenly matched?
The scope 2 emissions from energy use by the project are given by
E * I
, the energy used multiplied by the carbon intensity of the electricity supply. In this case, this is given by3,542,000 kWh x 0.040 kgCO2e/kWh = 14,170 kgCO2e
.-
The scope 3 emissions rate per GPUh is the total scope 3 emissions for the service divided by number of GPUh available over the lifetime of the service.
- The total GPUh over the service lifetime is estimated by
7 years x 365 days x 24 hours x 1000 nodes x 8 GPU per node = 490,560,000 GPUh
. - The scope 3 emissions rate is given by
6,500,000 kgCO2e / 490,560,000 GPUh = 0.013 kgCO2e/GPUh
- The total GPUh over the service lifetime is estimated by
The scope 3 emissions for the project use is the number of GPUh used multiplied by the scope 3 emissions per GPUh:
1,100 GPUh * 0.013 kgCO2e/GPUh = 14.3 kgCO2e
Total emissions are scope 2 + scope 3 emissions:
14,170 kgCO2e + 14.3 kgCO2e = 14,184 kgCO2e
Scope 2 emissions (from electricity use) heavily dominate the emissions in this example.
HPC Carbon Intensity (HPC-CI) specification
Software Carbon Intensity (SCI)
The definition, application and calculation of the HPC-CI is heavily inspired by the Software Carbon Intensity (SCI) from the Green Software Foundation.
We now describe a methodology (the HPC Carbon Intensity specification, HPC-CI) to calculate your emissions from HPC system use and to encourage action towards eliminating emissions.
It is not a replacement for the GHG protocol, but an additional metric that helps you understand how your HPC system use can be measured in terms of carbon emissions so they can make more informed decisions. While the GHG protocol calculates the total emissions, the HPC-CI is about calculating the rate of emissions. In automotive terms, HPC-CI is more like a miles per gallon measurement and the GHG protocol is more like the total carbon footprint of a car manufacturer and all their cars they produce every year.
An important thing to note is that it is not possible to reduce your HPC-CI rate by purchasing offsets in the form of neutralisations, compensations, or by offsetting electricity in the form of renewable energy credits (we will cover this in more detail in the next section of the workshop). This means that HPC system use that makes no effort toward reducing emissions but spends money on carbon credits cannot reduce the associated HPC-CI rate.
Offsets are an essential component of any climate strategy; however, offsets are not eliminations and therefore are not included in the HPC-CI metric.
If you make your HPC use more energy efficient, hardware efficient, or carbon aware, your HPC-CI rate will decrease. The only way to reduce your rate is to invest time or resources into one of those three principles. As such, adopting the HPC-CI metric for your HPC use, will drive investment into one of the three pillars of green HPC use.
The HPC-CI equation
The equation to calculate an HPC-CI rate is simple and very closely related to the calculation of total emissions presented above:
HPC-CI = [(E * I) + M] per R
-
E
= Energy consumed by HPC use (in kWh) -
I
= Location-based marginal carbon intensity (in kgCO2/kWh) -
M
= Embodied emissions -
R
= Functional unit (e.g. iterations, simulated time, calculation cycles, research papers published, cost)
This yields an emissions rate in carbon emissions per functional unit
(HPC-E per R
), e.g. kgCO2e/iteration.
The steps to calculate your HPC-CI score are the similar as calculating your emissions described above with additional steps to produce the rate. Steps 1-4 are identical the methodology described above:
- Gather your energy use
- Determine the carbon intensity at the location of the HPC system you are using
- Determine the embodied emissions associated with your use of HPC
- Compute your total HPC emissions
- Select your functional unit (
R
) - Calculate your HPC-CI rate
5. Select your functional unit (R
)
As we have seen, the HPC-CI is a rate rather than a total and measures the intensity of emissions according to the chosen functional unit. The specification currently does not prescribe the functional unit and you are free to pick whichever best describes the output from your use of HPC systems. For example, this could be a metric from the software you use (ns simulated, number of years simulated, iterations) or a metric tied to research progress (number of compounds modelled, data points analysed). There may also be ideas in literature related to your research area of what a good choice may be. You may want to trial different functional units to see which one works best for your work.
As a concrete example, imagine that you are simulating the dynamics of a biomolecular system (using software such as GROMACS, Amber or NAMD) then you could well chose the number of ns simulated as your functional unit.
You can use multiple functional units simultaneously to have multiple HPC-CI values for your use of HPC systems. Different HPC-CI units may be more or less useful in different contexts.
6. Calculate HPC-CI rate
Now you have both the total emisssions for your use of HPC systems and the number of functional units arising from the same use of HPC systems you can calculate the HPC-CI by dividing the total emissions by the total number of functional units.
To continue our example of biomolecular simulation that we mentioned above, we would take the total emissions from our HPC system use - let us say this came out to be 1500 kgCO2e - and the total number of functional units - say 950 ns simulated - and combine them:
HPC-CI = 1500 kgCO2e / 950 ns = 1.58 kgCO2e/ns
Exercise: HPC-CI rate
In the previous exercise we computed the total HPC system emissions for 3 months of project use to be 14,184 kgCO2e. The project was modelling the climate and managed to simulate 3,680 years of Earth’s climate during that 3 month period using 1,100 GPUh of resource.
- What is the HPC-CI in kgCO2e per simulated year?
- What is the HPC-CI in kgCO2e per GPUh?
Given by:
14,184 kgCO2e / 3,680 simulated years = 3.85 kgCO2e/simulated year
Given by:
14,184 kgCO2e / 1,100 GPUh = 12.89 kgCO2e/GPUh
Estimating emissions associated with future use of HPC systems
As well as using the HPC-CI metric as a tool to help quantify and reduce your emissions from HPC system use it can also be used to project the emissions from HPC system use from future or planned projects. Many funding bodies are starting to ask for emissions estimates as part of the submission process. Calculating HPC-CI values can help you provide these estimates.
Now we understand how to estimate our emissions (HPC-E) and how to define a useful metric to understand our emissions rate linked to some concrete output from our use of HPC systems (HPC-CI). The next section looks at how emissions can be reduced and, ideally, eliminated.
Key Points
- The GHG protocol is a metric for measuring an organisation’s total carbon emissions and is used by organisations all over the world.
- The GHG protocol puts carbon emissions into three scopes. Scope 3, also known as value chain emissions, refers to the emissions from organisations that supply others in a chain. In this way, one organisation’s scope 1 and 2 will sum up into another organization’s scope 3.
- You can use the GHG protocol to estimate your emissions from HPC system use but it requires access to good quality information from the HPC systems you are using.
- The HPC-CI is a metric designed specifically to calculate emissions from HPC systems and is a rate rather than a total. This can be used to measure improvements in emissions efficiency and drive reductions in emissions.