Available PhD Projects

ExaGEO equips students with the skills, knowledge, and principles of exascale computing — drawing from geoscience, computer science, statistics, and computational engineering — to tackle some of the most pressing challenges in Earth and environmental sciences and computational research. Students will work under expert supervision in the below fields:

Atmosphere, hydrosphere, cryosphere, and ecosystem processes and evolution
Geodynamics, geoscience and environmental change
Geologic hazard analysis, prediction and digital twinning
Sustainability solutions in engineering, environmental, and social sciences

Each student will be positioned within a supervisory team consisting of multidisciplinary supervisors; one computational, one domain expert, and one from an Earth or environmental, and/or social science research background. This ‘team-based’ supervisory approach is designed to enhance multidisciplinary training.

Please note that some projects may have incomplete supervisory teams, however the full teams will be finalised before the start of the PhD.

Overview of the ExaGEO Student Experience

Project Selection and Information

You must apply for three projects. Each project has two project variations, i.e., teaser projects. During your first year, after working on both teaser projects (under the same supervisory team), you will select the project that best aligns with your interests. For further information on how this process will work, please see the FAQs section on our Apply page.

Your PhD institution will be determined by the Principal Supervisor’s institutional affiliation.

You can apply for projects at different institutions.

Projects are grouped by research field.

If you have any queries regarding a specific project, please contact the supervisor listed first (this will be the Principal Supervisor).

Projects are funded via ExaGEO; this includes fees, stipends and a Research Training Support Grant. For further information, please see our Apply page.

Projects with a focus on Geodynamics, Geosciences and Environmental Change:

AI-Driven Satellite Embeddings for Fine-resolution Mapping and Tracking Invasive Species on Global Reclaimed Lands
Project institution:

Project supervisor(s):
Dr Meiliu Wu (University of Glasgow), Dr Alex Bush (Lancaster University), Dr Wenxin Zhang (University of Glasgow) and Prof Brian Barrett (University of Glasgow)

Full details
Overview and Background

Reclaimed lands (e.g., post-mining and post-industrial sites) are expanding globally and are particularly susceptible to colonisation by invasive plant species, which may cause negative effects on restoration, biodiversity, and ecosystem functions. We propose to evaluate and extend AlphaEarth Foundations, i.e., Google’s Satellite Embedding V1 (annual, 10m by 10m, 64-Dimension), for fine-resolution identification of invasive plant species in reclaimed areas worldwide and for tracking temporal change in species composition and spread. AlphaEarth encodes multi-sensor Earth Observation (EO) time series into consistent, analysis-ready embeddings during 2017-2024, supporting scalable classification and efficient change detection. We will combine embedding-based models with climate and reclamation histories to identify key drivers, quantify uncertainty, and align the results with ExaGEO’s “exascale model & big-data coupling” platform.

Methodology and Objectives

Data & Methods Used:

We will use Google Earth Engine’s Satellite Embedding V1 as the core feature space; embeddings are unit-length, consistent over years, and produced by AlphaEarth Foundations (Brown, Christopher F., et al., 2025), i.e., multi-modal assimilation across optical, radar, and LiDAR, facilitating both classification and dot-product/angle-based change metrics. We will (i) assemble global reclaimed-area masks, beginning with open global-scale mining polygons (v2) (Maus, Victor, et al., 2022) and complementary sources; (ii) compile invasive plant labels from the Global Invasive Species Database (GISD) linked to the Global Biodiversity Information Facility (GBIF) occurrences; (iii) develop label-efficient methods (e.g., positive-unlabelled, semi-supervised, and weak-supervision with quality controls); (iv) build temporal transformers over annual embeddings for trend/change analysis; and (v) perform GPU-accelerated distributed training/inference with rigorous uncertainty quantification (e.g., deep ensembles) and domain shift tests across regions and biomes.

Teaser Project 1

Suggested headings: Fine-resolution invasive species mapping on global reclaimed lands

Objectives:

Benchmark AlphaEarth embeddings vs. standard EO features for species-level classification on reclaimed sites (start with mining areas, then generalise), measuring macro-F1, calibration, and cross-region transfer.

Build a label-efficient pipeline that fuses GISD species lists with spatially filtered GBIF occurrences and negative sampling within reclaimed buffers; evaluate sensitivity to sampling bias.

Develop multi-scale embedding aggregation (tile/patch pooling and linear composability) for species distinguishability; compare tree-based methods vs. shallow nets on the 64-Dimension space for compute efficiency.

Produce regional probability maps (e.g., Scotland, Ruhr, Appalachia, Hong Kong, Shantou, and Jiangsu) with uncertainty maps to guide validation and management.

How it becomes a full PhD: Global expansion across reclaimed typologies (e.g., mines, landfills, and brownfields), deeper label curation, and active learning with end-user feedback; deliver a reproducible mapping stack and evaluation framework.

Teaser Project 2

Suggested headings: Temporal dynamics and early detection of invasive spread post-reclamation

Objectives:

Use annual embeddings (2017–2024) to compute intra-site temporal similarity (e.g., cosine/dot-product shifts) and detect emergence or range expansion of invasive species; and integrate climate covariates with Dr Zhang’s modelling to attribute underlying drivers.

Train temporal sequence models (temporal transformers on annual 64-Dimension vectors) to predict next-year probabilities and time-to-detection, with quantified uncertainty.

Quantify management impacts by comparing trajectories across reclamation strategies (where metadata exist) and by disentangling the influence of climate anomalies vs. anthropogenic disturbance signals.

Deliver global change products and a watchlist of high-risk sites for early intervention.

How it becomes a full PhD: Scale temporal modelling globally, generalise to additional taxa, and operationalise early-warning thresholds with end-users.

References & Further Reading

Brown, Christopher F., et al. “Alphaearth foundations: An embedding field model for accurate and efficient global mapping from sparse label data.” arXiv preprint arXiv:2507.22291 (2025).

Maus, Victor; da Silva, Dieison M; Gutschlhofer, Jakob; da Rosa, Robson; Giljum, Stefan; Gass, Sidnei L B; Luckeneder, Sebastian; Lieber, Mirko; McCallum, Ian (2022): Global-scale mining polygons (Version 2) [dataset]. PANGAEA, https://doi.org/10.1594/PANGAEA.942325
Chasing fluid pathways: GPU-enabled multiscale subduction models to unravel how subduction driven melt dynamics determine surface deformation and topography
Project institution:

Project supervisor(s):
Dr Antoniette Greta Grima (University of Glasgow), Dr Tobias Keller (University of Glasgow) and Dr Luca Parisi (University of Edinburgh)

Full details
Overview and Background

Subduction zones are the primary gateways through which water, carbon, and other volatiles are transported into the Earth’s mantle. These fluids are central to Earth’s evolution, they trigger partial melting in the mantle wedge, sustain the deep water and carbon cycles, drive arc volcanism, and ultimately help maintain Earth’s long-term habitability (Tian et al., 2019). At shallower depths, fluids and melts profoundly modify the strength of the lithosphere and continental crust. They reduce viscosity, promote faulting and deformation, and localize magmatic pathways (Nakao et al., 2016). These processes not only shape surface landscapes but also govern volcanic hazards and the emplacement of economically critical mineral deposits (Faccenda, 2014).

Despite their importance, the mechanisms of reactive fluid transport in subduction systems remain poorly constrained. Fundamental open questions include:

How do transient pulses of fluid release alter the rheology of the overriding plate and guide surface deformation?

To what extent do fluid–rock interactions control the focusing of melts and the distribution of arc magmatism?

Can slab dehydration events leave observable topographic or geophysical signals that serve as precursors to volcanic unrest or continental rifting?

Answering these questions is a formidable challenge, because the governing processes span scales from grain boundaries to tectonic plates, and from seconds to millions of years. Current CPU-based models cannot capture this range: resolving fluid pathways requires kilometre- to metre-scale resolution, while system-scale simulations demand computational domains hundreds of kilometres across. Bridging these scales dynamically has remained beyond reach.

This PhD project will break this barrier by developing GPU-accelerated, multi-scale models of subduction zone dynamics that explicitly couple fluid release, volatile transport, melt migration, and surface deformation. By exploiting exascale computing architectures, the project will integrate fine-scale reactive flow models with large-scale geodynamic simulations in ways not previously possible. Adaptive mesh refinement and GPU-enabled solvers will allow kilometre-scale fluid processes to be embedded directly within tectonic-scale models of subduction and topographic evolution.

The scientific goal is to establish how fluid transport interacts with subduction dynamics to reshape continental surfaces. By linking volatile release to lithospheric weakening, melt focusing, and measurable topographic responses, this research will provide new insight into the origins of volcanic hazards, the localisation of critical resources, and the long-term evolution of continents. The project will also deliver community-relevant GPU software and HPC workflows, contributing to ExaGEO’s mission to prepare Earth science for the exascale era.

Methodology and Objectives

This project will use GPU-accelerated numerical modelling to directly couple reactive fluid transport with thermo-mechanical subduction dynamics. The approach is designed to bridge processes from the scale of fluid migration pathways within the crust to the scale of plate interactions and topographic evolution.

The student will:

Develop new computational tools in Julia and Python to implement GPU-enabled solvers for two-phase flow and thermo-mechanical coupling.

Extend the open-source ASPECT code to run efficiently on GPU architectures such as using GPU-accelerated finite-element matrix free methods and adaptive mesh refinement (AMR).

Integrate multi-scale models by embedding high-resolution, crustal-scale simulations of fluid migration within regional 2D/3D subduction models.

Conduct systematic numerical experiments to test how fluid release influences deformation patterns, melt focusing, and surface uplift/subsidence.

Benchmark and validate results against laboratory experiments, field observations, and published numerical benchmarks to ensure robustness.

The novelty lies in the computational design: instead of treating fluid migration and lithospheric deformation as separate problems, the project will couple them dynamically within the same simulation framework. GPU acceleration and exascale platforms make this coupling computationally feasible, enabling parameter sweeps and real-time tracking of fluid–rock interactions across scales.

The research will begin with two focused “teaser projects” that provide distinct skill sets and scientific insights, before converging into an integrated PhD focus.

Teaser Project 1: GPU-Optimized Two-Phase Flow Model

In this project the student will implement a simplified two-phase flow model, based on Darcy–Stokes coupling, in Julia with GPU acceleration. The initial focus will be on ensuring computational performance and numerical stability, followed by validation of the solver against analytical benchmarks and published test cases to establish robustness. With this foundation in place, the accelerated framework will then be used to carry out systematic parameter studies, exploring how permeability, viscosity contrasts, and fluid mobility influence fluid transport. These experiments will identify the conditions under which reactive channelization and focused flow emerge, and the results will be used to assess how such migration behaviours modify the bulk rheology of the overriding plate. In turn, these outcomes will provide valuable boundary conditions and insights that can be transferred into larger-scale subduction models.

Teaser Project 2: GPU-Enabled Thermo-Mechanical Subduction Modelling

The second project turns to the thermo-mechanical evolution of subduction systems and will involve extending existing finite-element tools to incorporate GPU acceleration. The student will run two and three-dimensional subduction models that incorporate visco-plastic rheology, free-surface deformation, and slab dehydration parameterised via phase diagrams in ASPECT. These simulations will be used to investigate how episodic dehydration alters stress fields, deformation patterns, and topographic response at scales from a few kilometres to hundreds of kilometres. While ASPECT is not yet GPU enabled, it leverages the deal.ii library, which does support GPUs. This allows investigating different solvers, identify performance bottlenecks and identify which aspects of the code need be adapted to efficiently use accelerators (such as using matrix free implementations, managing data transfers between CPU and GPU), without needing to extensively change ASPECT’s framework. If time allows, the student will have the opportunity to implement those changes in ASPECT .

References & Further Reading

Faccenda, M. (2014). Water in the slab: A trilogy. Tectonophysics, 614, 1–30. https://doi.org/10.1016/j.tecto.2013.12.020

Heister, T., Dannberg, J., Gassmöller, R., & Bangerth, W. (2017). High accuracy mantle convection simulation through modern numerical methods – II: Realistic models and problems. Geophysical Journal International, 210(2), 833–851. https://doi.org/10.1093/gji/ggx195

Keller, T. and Suckale, J., 2019. A continuum model of multi-phase reactive transport in igneous systems. Geophysical Journal International, 219(1), pp.185-222. 

Nakao, A., Iwamori, H., & Nakakuki, T. (2016). Effects of water transportation on subduction dynamics: Roles of viscosity and density reduction. Earth and Planetary Science Letters, 454, 178–191. https://doi.org/10.1016/j.epsl.2016.08.016
Data-Driven and Physics-Informed Hybrid Modelling of Landslide Dynamics
Project institution:

Project supervisor(s):
Prof Jin Sun (University of Glasgow), Prof Andrew McBride (University of Glasgow), Dr Jingtao Lai (University of Glasgow), Prof Todd Ehlers (University of Glasgow) and Dr Eric Breard (University of Edinburgh)

Full details
Overview and Background

Landslides and debris flows threaten human life, infrastructure, and economies worldwide, especially as climate change drives more extreme rainfall events. Timely prediction of such mass movements is therefore crucial for disaster risk reduction. Achieving this requires combining accurate physics-based modelling of granular flows with rich observational data: for example, remote-sensing change-detection of slope deformation. Recent advances in data-driven modelling and physics-informed machine learning offer new opportunities to integrate physical laws with observational and simulation data. This project aims to establish a unified hybrid modelling framework that bridges high-fidelity particle-scale simulations and large-scale field data for improved prediction of landslide initiation and runout, thereby laying the foundation for future digital twinning of landslides.

Methodology and Objectives

The overarching goal of this PhD project is to develop a hybrid modelling framework that combines data-driven learning with physics-based constitutive modelling to describe the transition between the solid-like and fluid-like behaviour, which is critical for predicting slope failure and runout, and to bridge the gap between microscale simulations and macroscale landslide observations.

Teaser Project (TP) 1: Discrete Element Simulations of Slope Flows

This project focuses on performing high-fidelity discrete element method (DEM) simulations to capture the failure, flow, and deposition processes in granular slopes under different inclinations. Simulations will be conducted using the open-source LAMMPS software, with GPU acceleration implemented to enhance computational efficiency. The objective is to generate a comprehensive dataset describing particle-scale kinematics, contact forces, and evolving stress fields during slope instability. By varying slope angles and material parameters, the study will investigate the onset of failure, the transition from solid-like to fluid-like flow, and the subsequent deposition patterns. These DEM results will provide detailed micro-mechanical insights and serve as training data for subsequent data-driven model development.

A GPU-accelerated solver will be developed to optimize LAMMPS for large-scale granular simulations. The solver will utilize domain decomposition and parallel computation to handle millions of particles efficiently. The output data—velocity, stress, strain, and microstructure—will be analysed to characterize flow regime transitions. This will enable formulation of rheological indicators linking particle-scale dynamics to continuum measures of deformation.

Teaser Project 2: Physics-Informed Machine Learning for Granular Rheology

The second project will apply physics-informed neural networks (PINNs) or sparse regression techniques to learn rheological models for granular flow where constitutive relations are already known, such as the μ(I) rheology for steady-state flow. The objective is to test and validate the physics-informed learning methodology by comparing the discovered models against the analytical forms of these known relationships.

Synthetic datasets will be generated using controlled numerical experiments from TP1 to train the PINNs or sparse regression models. By incorporating physical constraints such as positive energy dissipation, the learned models are expected to exhibit improved generalization and interpretability. This project will demonstrate how hybrid modelling can faithfully recover known constitutive relationships while providing a robust foundation for future discovery of new rheological forms from experimental and DEM data.

Together, these teaser projects establish a methodological framework for subsequent years of research, which will extend toward modelling large-scale landslides by learning with data from both DEM simulations and field observations, coupling learned rheologies with continuum-scale solvers and validating against field data. The integration of physics-based and data-driven models will ultimately enable prediction of landslide initiation and runout with improved accuracy and computational efficiency.

References & Further Reading

Iverson, R. M. The physics of debris flows. Reviews of Geophysics 35, 245 296 (1997).

Forterre, Y. & Pouliquen, O. Flows of Dense Granular Media. Annual Review of Fluid Mechanics 40, (2008).

Chialvo, S., Sun, J. & Sundaresan, S. Bridging the rheology of granular flows in three regimes. Phys Rev E 85, 021305 021305 (2012).

Raissi, M., Perdikaris, P. & Karniadakis, G. E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 378, 686–707 (2019).
Towards exa-scale simulations of slabs, core-mantle heterogeneities and the geodynamo
Project institution:

Project supervisor(s):
Prof Radostin Simitev (University of Glasgow), Dr Antoniette Greta Grima (University of Glasgow) and Dr Kevin Stratford (University of Edinburgh)

Full details
Overview and Background

Magnetic fieldlines in a geodynamo simulation by Silva et al (2020) using the code of Silva and Simitev (2018).

Scientific computing is crucial for understanding geophysical fluid flows, such as the geodynamo that sustains Earth’s magnetic field. This project will adapt an existing pseudospectral geodynamo code for magnetohydrodynamic simulations in rotating spherical geometries to GPU architectures, improving efficiency on modern computing systems and enabling simulations of more realistic regimes. This will advance our understanding of Earth’s geomagnetic field and its broader interactions, such as those with mantle heterogeneities.

Evidence from seismology and geodynamics shows that the core-mantle boundary (CMB) is highly heterogeneous, influencing heat transport and geodynamo dynamics. By combining compressible, thermochemical convection with geodynamo simulations, this project will further investigate how deep slab properties affect the CMB heat flux, mantle heterogeneity, and the geodynamo.

Methodology and Objectives

Teaser project 1: What is the impact of ancient slabs on core-mantle boundary heterogeneities and the geodynamo?

Evidence from seismology and geodynamics reveals that the lowermost mantle and the coremantle boundary (CMB) are highly heterogeneous due to the presence of post-perovskite, large low shear wave velocity provinces and ancient, subducted slab material. CMB heterogeneity results in variable heat transport from the core and plays a key role in core and mantle dynamics, the geodynamo, and ultimately the Earth’s habitability. Previous work shows that the spatiotemporal evolution of the CMB heterogeneity is closely linked to deep slab dynamics (e.g., Heron et al., 2024, 2025), however these remain poorly understood. This teaser project will investigate the role of deep slab properties on temporal evolution of the deep mantle heterogeneity, the CMB heat flux and the geodynamo. This will involve modelling compressible, multiphase, thermochemical convection in a 3D spherical shell following the approach of Dannberg et al., (2024) and Heron et al., (2024, 2025) using the state of the art, open-source, adaptive mesh refinement, finite element software ASPECT (Heister et al., 2017). These models will include the subduction history over the last 1 billion year from Merdith et al., (2021) and will be supported by high resolution 3D regional models investigating the role of end-member slab properties (e.g., weak vs. strong slabs) on the CMB heterogeneity. Temporal variations in CMB heat flux from these models will then be analysed using spherical harmonics across the first 4 harmonic degrees similar to the approach of Dannberg et al., (2024) and used as thermal boundary condition for the geodynamo simulations. The goal is to expand teaser project 1 to investigate the influence the deep slab on core-mantle dynamics and the implications this has for magnetic field generation and the strength and frequency of polarity reversals.

Teaser Project 1 Objectives:

Use global convection models to calculate the temporal evolution of heat flux at the CMB

Investigate the influence of end member slab rheologies and geometries on the heat flux heterogeneity at the CMB

Apply the calculated heat flux across the CMB from geodynamic models as a boundary condition to geodynamo simulations to investigate heterogeneity in magnetic field strength and the timing and frequency of magnetic field reversals

Use GPU architecture to couple finite element mantle convection with geodynamo simulations

Teaser Project 2: Spectral expansion transforms in spherical geometry

Modelling the geodynamo involves solving the coupled 3D, time-dependent, nonlinear NavierStokes equations, pre-Maxwell electrodynamics, and heat transfer equations for a rotating fluid. At present, the pseudo-spectral method is the most accurate and widely used numerical discretisation method in this context. The method requires applying physical to spectral space transforms which are generally in integral form and have been difficult to adapt to GPU architectures. With GPUs becoming increasingly powerful and accessible, this sub-project aims to port an existing versatile pseudo-spectral code for magnetohydrodynamic simulations in rotating spherical geometries to GPU systems.

Teaser Project 2 Objectives:

Investigate alternative orthogonal polynomial basis function families that can be used to expand fields in spherical geometry, including Legendre, Jones-Worland, Jacobi and Galerkin.

Implement alternatives in and assess/compare convergence, stability and consistency of the resulting discretisations as well as their efficiency for GPU acceleration.

References & Further Reading

Dannberg, J., Gassmoeller, R., Thallner, D., LaCombe, F., & Sprain, C. (2023). Changes in core-mantle boundary heat flux patterns throughout the supercontinent cycle. arXiv preprint arXiv:2310.03229.

Paul H Roberts and Eric M King. 2013. On the genesis of the Earth’s magnetism. Rep. Prog. Phys. 76 096801 http://dx.doi.org/10.1088/0034-4885/76/9/096801

Gary A. Glatzmaier. 2014. Introduction to Modeling Convection in Planets and Stars: Magnetic Field, Density Stratification, Rotation. Princeton https://press.princeton.edu/books/hardcover/9780691141725/introduction-to-modelingconvection-in-planets-and-stars

Heister, T., Dannberg, J., Gassmöller, R., & Bangerth, W. (2017). High accuracy mantle convection simulation through modern numerical methods – II: Realistic models and problems. Geophysical Journal International, 210(2), 833–851. https://doi.org/10.1093/gji/ggx195

Heron, P.J., Dannberg, J., Gassmöller, R., Shephard, G.E., & Pysklywec, R. N. (2025). The impact of Pangaean subducted oceans on mantle dynamics: passive piles and the positioning of deep mantle plumes. Gondwana Research.

Heron, P.J., Gün, E., Shephard, G.E., Dannberg, J., Gassmöller, R., Martin, E., Sharif, A., Pysklywec, R. N., Nance, R.D., & Murphy, J.B. (2025). The role of subduction in the formation of Pangaean oceanic large igneous provinces. Geological Society London, Special Publications, 542(1).

Merdith, A.S. Williams. S.E., Brune, S., Collins, A.S., & Müller, D..R. (2021). Extending fullplate tectonic models into deep time: linking the Neoproterozoic and the Phanerozoic, EarthSci. Rev., 214, Doi:10.1016/j.earscirev.2020.103477

Silva. L, Simitev, R., 2018. Pseudo-spectral code for numerical simulation of nonlinear thermocompositional convection and dynamos in rotating spherical shells,zenodo.org, 1311203, 2018. https://doi.org/10.5281/zenodo.1311203
When Mountains Meet the Sea: Simulating Landslide-Generated Tsunamis

Project institution:

Project supervisor(s):
Dr Kevin Stratford (University of Edinburgh), Dr Eric Breard (University of Edinburgh), Prof Jin Sun (University of Glasgow) and Dr Arianna Gea Pagano (University of Glasgow)

Full details

Three-phase flow simulation of a gas–particle granular collapse into water, performed with a CPU-based DEM–CFD–VoF solver. The method is limited to small-scale cases because of its high computational cost. (Image: Breard and Desjardins)

Overview and Background

Climate change is showing tangible consequences on our environment. As glaciers retreat and rainfalls intensify, the frequency and scale of landslides and debris flows are rising worldwide. Their destructive power extends beyond the areas where these phenomena are initiated: unstable masses may travel several kilometres, depending on the evolving characteristics of the solids involved and their interaction with water; when masses plunge into large water bodies, they can unleash catastrophic tsunamis. Yet, our understanding and prediction of granular flows are limited by knowledge gaps in how fluids and solids interact. Key challenges include unravelling how grain shape and breakage affect flow mobility, and how mass, momentum, and energy are transferred during violent impacts. Using the GPU-accelerated multiphase solver MFIX-Exa, this project will pioneer next-generation simulations and physics-based laws to transform landslide hazard forecasts in a changing climate.

Methodology and Objectives

Teaser Project 1: When Earth Hits Water — Geophysical Flows Triggering Tsunamis

Objective: During the initial six-month project, the focus will be on developing and validating a simplified GPU-accelerated Volume of Fluid (VoF) module within the MFIX-Exa framework to represent two-phase air–water interactions during a solid-body impact. This will establish the numerical infrastructure and performance benchmarks needed to later include granular particles. Using canonical test cases (e.g., water entry of wedges, deformable intrusions), we will assess how impact geometry and velocity control wave generation and energy transfer.

This short-term work will lay the foundation for the full PhD, which will extend the solver to fully three-phase (solid–gas–liquid) conditions, include granular rheology and pore-pressure coupling, and simulate natural examples such as pyroclastic flows entering the sea. The long-term goal is to derive physics-based coupling laws that can inform exascale tsunami forecasting and hazard models.

Teaser Project 2: Evolving Grain Size and Shape in Geophysical Granular Flows

Objective: The first six months will focus on implementing and testing a basic bonded-sphere representation of irregular grains in MFIX-Exa, without breakage. The aim is to quantify how initial particle shape (aspect ratio, angularity) modifies packing density and stress transmission under simple shear. Benchmark simulations and comparisons with existing experimental datasets will be used to verify the new contact model and establish computational efficiency on GPUs.

This work forms the foundation for a PhD that would progressively incorporate fracture and breakage physics, enabling the grain size and shape to evolve dynamically. Later stages would explore how fragmentation alters permeability, pore-fluid pressure response, and bulk rheology in flows such as landslides, debris avalanches, and pyroclastic currents, ultimately yielding improved continuum closures for natural-hazard prediction.

References & Further Reading

Svennevig, Kristian, et al. “A rockslide-generated tsunami in a Greenland fjord rang Earth for 9 days.” Science 385.6714 (2024): 1196-1205.

https://www.exascaleproject.org/research-project/mfix-exa/

https://github.com/NREL/BDEM.git

Musser, J., Almgren, A. S., Fullmer, W. D., Antepara, O., Bell, J. B., Blaschke, J., … & Syamlal, M. (2022). MFIX-Exa: A path toward exascale CFD-DEM simulations. The International Journal of High Performance Computing Applications, 36(1), 40-58.

Lu, L., Gao, X., Shahnam, M., & Rogers, W. A. (2021). Simulations of biomass pyrolysis using glued-sphere CFD-DEM with 3-D intra-particle models. Chemical Engineering Journal, 419, 129564.

Projects with a focus on Sustainability Solutions in Engineering, Environmental, and Social Sciences:

AI Weather Prediction for Renewable Energy Forecasting
Project institution:

Project supervisor(s):
Dr Xiaochen Yang (University of Glasgow), Prof Jethro Browell (University of Glasgow), Mr Dan Travers (Open Climate Fix) and Mr Jack Kelly (Open Climate Fix)

Full details
Overview and Background

AI weather models replace components of traditional numerical weather prediction systems with deep neural networks underpinned by GPUs. They had developed rapidly over recent years offering potential gains in forecast skill and reduced computational demand, with leading weather centres now running AI weather models operationally. However, many research questions remain open and validation in key application domains, including the energy sector, is lacking. This project will deploy and develop AI weather models to optimise performance for renewable energy forecasting, e.g. by focusing on relevant atmospheric parameters (near-surface winds, clouds, radiation) and directly forecasting and/or assimilating power production from wind and solar farms. This project is partnered with Open Climate Fix, an award-winning AI-first, renewable power forecasting company.

Methodology and Objectives

For both Teaser Projects:

ECWMF recently proposed an AI weather model, Artificial Intelligence Forecasting System (AIFS) [1]. The model is trained on ERA5 re-analysis data or ECWMP’s operational numeric weather prediction (NWP) analyses to produce forecasts for upper-air variables, surface weather parameters and tropical cyclone tracks. AIFS employs an encoder–processor–decoder architecture inspired by recent advances in graph and transformer networks, offering substantial improvement in computational efficiency compared with traditional NWP approaches.

GPU requirement: According to [1], full AIFS training requires 64 NVIDIA A100 (40 GB) GPUs, while inference (forecast generation) is achievable on a single A100 GPU. Fine-tuning, which updates only a small subset of parameters, is expected to demand 4-8 GPUs; the specific GPU requirements depend on model size and precision settings.

Teaser Project 1 Objectives: Wind speed forecasting
Background: Wind speeds at heights corresponding to wind turbine rotors are key inputs to wind power forecasts; however, computing wind speeds at heights between 10m and 300m is challenging due to atmospheric stability and boundary layer effects.

Description of teaser project: This teaser project will implement the AIFS to forecast wind speeds at these heights and to benchmark its performance against physics-based models. Over a six-month period, the PhD student will undertake the following three tasks.

Download the AIFS model, set it up within a GPU-enabled computing environment and reproduce the forecasting results reported in [1]. This task will require developing a thorough understanding of the model’s architecture and its efficient parallel implementation on GPUs.

Acquire the UKV dataset (Met Office high resolution data) and the MIDAS data (land surface stations data). These datasets are essential for validating the AIFS forecasts and for subsequent PhD. This task will train student in data handling, pre-processing and quality control.

Implement AIFS to forecast wind speeds at heights between 10m and 300m. Model outputs will be compared against NWP forecasts to identify strengths and weaknesses in AIFS.

Development into a full PhD: The subsequent PhD will adapt AIFS for the forecasting of atmospheric variables critical to renewable energy. The focus will be on improving spatial resolution from the current 0.25° (~27.75km) grid spacing to 1.5 km, matching the UKV dataset, and producing location-specific forecasts. Additionally, the adaptation should incorporate physical constraints relevant to the UK’s orography and coastline. To achieve these, the student will investigate parameter-efficient fine-tuning techniques, such as using adapter layers for the graph-transformer encoder and decoder and LoRA for the transformer-based processor. There is also scope for students to investigate probabilistic forecasting to quantify forecasting uncertainty.

Teaser Project 2 Objectives: Solar PV power forecasting

Background: Accurate solar PV power forecasting is vital for maintaining grid stability and optimising renewable energy integration. However, one of the most significant challenges arises from predicting the formation, evolution and dissipation of clouds, which strongly influence solar irradiance at the Earth’s surface. Traditional forecasting pipelines often rely on predicting a suite of intermediate meteorological variables, such as temperature, humidity and cloud fraction, on a regular spatial grid before converting these forecasts into estimates of solar generation. While this multi-step approach is physically interpretable, it introduces additional aleatoric uncertainty and accumulates errors at each stage. This project proposes a more direct approach that decodes the embeddings from AIFS into estimates of solar power generation, bypassing intermediate meteorological forecasting. Validation will be conducted using real solar generation data across Great Britain’s more than 300 “Grid Supply Point” regions.

Description of teaser project: Over a six-month period, the PhD student will explore the potential of using embeddings from AIFS to predict solar energy output by completing the following three tasks.

Download the pre-trained AIFS model and set it up within a GPU-enabled computing environment. This task will require developing a thorough understanding of the model’s and its efficient parallel implementation on GPUs.

Obtain historical solar power generation data from Great Britain’s over 300 “Grid Supply Point” regions.

Train a simple fully connected neural network with the embeddings extracted from AIFS as input to forecast solar energy generation. The results will be benchmarked against observed data, establishing a baseline for subsequent methodological innovation

Development into a full PhD: The subsequent PhD will aim to develop end-to-end AI frameworks that forecast renewable energy generation directly from learned weather representations. This constitutes a transfer learning task, as the embeddings originally learned for meteorological forecasting will be repurposed for energy prediction. The research will progress along three directions. First, a custom, data-driven decoder will be designed to transform the embeddings produced by the AIFS model directly into solar power forecasts. Second, if the pre-trained embeddings are found to be suboptimal for energy forecasting, the quality of the embeddings will be refined by fine-tuning the encoder and/or processor of the pre-trained AIFS. Finally, the research will develop a physics-informed decoder that integrates established physical relationships between solar irradiance and meteorological conditions, thereby improving both accuracy and interpretability.

[1] Chantry, Matthew, et al. “AIFS-ECMWF’s Data-Driven Forecasting System.” 105th Annual AMS Meeting 2025. Vol. 105. 2025
Changing Ecological Role of Coral Reef Marine Protected Areas

Project institution:

Project supervisor(s):
Prof Nick Graham (Lancaster University), Prof Rachel McCrea (Lancaster University), Dr David Bailey (University of Glasgow), Dr James Robinson (Lancaster University) and Prof M Aaron MacNeil (Dalhousie University)

Full details

Overview and Background

No-take marine protected areas (MPAs) are a key management approach for coral reef ecosystems, with decades of research, including global meta-analyses, determining the expected ecological outcomes: higher coral cover, greater species richness of fish, and more fish biomass that is dominated by higher trophic levels. However, climate disturbance and human pressures are fundamentally changing the ecological foundation of coral reefs, with evidence that the ecological outcomes of MPAs may be fundamentally changing. This project will integrate diverse datasets to assess how the response of coral reefs to MPAs is changing at a global scale, drawing on remote sensing of ocean conditions, human pressures, reef habitats, species phylogeny, and a global coral reef database of underwater surveys from over 2,000 reef sites.

Methodology and Objectives

Coral reef ecosystems are rapidly transforming due to climate change and direct human pressure, leading to bottom-up habitat mediated shifts in community composition interacting with MPAs that are aimed at controlling top-down fishing pressure. This PhD will draw on diverse and complex data comprising remote sensing global coral reef habitats (Allen Coral Atlas), climate-induced heat stress, ocean conditions (NOAA), proxies for human pressure, and species phylogeny. These datasets will be confronted with benthic and fish community surveys spanning over 2,000 coral reef sites throughout the tropics, containing up to 30 years of repeated sampling. Using these data, the student will use advances in machine learning, hierarchical modelling of species communities, and Bayesian modelling to determine and project the changing role of MPAs across the tropics.

Teaser Project 1: Uncovering compositional shifts in coral reefs within Marine Protected Areas

This teaser project will identify potential and realized drivers of community composition shifts in coral reef MPAs, using both a space for time and a temporal perspective.

Objective 1 – quantify long-term trends in ocean temperatures, primary productivity and anthropogenic run-off in protected coral reefs, and use these trends to uncover temporal drivers of benthic community composition. The analyses will use Google Earth Engine to process global oceanographic and reef habitat datasets, and draw on machine learning approaches, such as spatial random forest, and on hierarchical Bayesian modelling.

Objective 2 – determine how reef fish community structure in MPAs is responding to changes in benthic condition (Obj 1). This objective will leverage coral reef fish phylogeny and underwater fish surveys to conduct Hierarchical Modelling of Species Communities (HMSC) incorporating species associations to ensure poorly samples species are appropriately modelled.

Further PhD development would investigate the underlying mechanisms of community composition change. This would involve compiling additional datasets on remotely sensed environmental stress variables and key social drivers. These large and interacting databases will require machine learning to handle nonlinearities, and using the latest casual discovery uncover the key underlying mechanisms leading to altered community composition and trophic structures in MPAs.

Teaser Project 2: Projecting Marine Protected Area outcomes for coral reefs globally.

This teaser project will draw on downscaled projections of key environmental and social drivers that influence coral reef ecology, coupled with contemporary analyses of change to MPA ecological outcomes to project future MPA outcomes across scales.

Objective 1 – Build and compile down-scaled biophysical model projections (e.g. CMIP6) of key environmental (e.g. sea surface temperature anomalies, site species temperature variation, wind and wave energy) and social (human gravity, land use) parameters for coral reef cells globally. By pairing each driver with expert expectations for reef ecological variables (e.g. coral cover), these data will be used to describe different SSP outcomes to 2100 for current coral reef MPAs.

Objective 2 – for the environmental and social variables showing most change into the future in Objective 1, hindcasts will determine environmental and social changes experienced by >2,000 coral reef sites. Drivers will then be linked to temporal shifts in benthic composition, fish trophic structure, and key community-level processes (e.g. fish productivity), with careful exploration of uncertainty quantification, helping infer how these variables will likely change into the future.

Further PhD development would model ecological outcomes on different SSP scenarios through to 2100, under different MPA characteristics and for different climate models (e.g. multimodel ensembles). The candidate will determine the trophic and community scale outcomes under different scenarios, how this will vary spatially, and how uncertain these outcomes are. Finally, they will explore optimal configurations of future MPA designations to reach 30% national targets, in the context of optimising for multiple ecological outcomes.

References & Further Reading

Lester SE, et al. (2009) Biological effects within no-take marine reserves: a global synthesis. Mar Ecol Prog Ser 384: 33-46 https://www.int-res.com/abstracts/meps/v384/meps08029

Graham NAJ, Robinson JPW, Smith SE, Govinden R, Gendron G, Wilson SK (2020) Changing role of coral reef marine reserves in a warming climate. Nature Communications 11: 2000 https://www.nature.com/articles/s41467-020-15863-z

Hadj-Hammou J, et al. (2024) Global patterns and drivers of fish reproductive potential on coral reefs. Nature Communications 15: 6105 https://www.nature.com/articles/s41467-024-50367-0

Hughes TP, et al. (2018) Spatial and temporal patterns of mass bleaching of corals in the Anthropocene. Science 359: 80-83 https://www.science.org/doi/full/10.1126/science.aan8048

Mellin C, Brown S, Heron SF, Fordham DA (2025) CoralBleachRisk—Global Projections of Coral Bleaching Risk in the 21st Century. Global Ecology and Biogeography 34: e13955 https://onlinelibrary.wiley.com/doi/full/10.1111/geb.13955

Cinner JE, et al. (2018) The gravity of human impacts mediates coral reef conservation gains. Proceedings of the National Academy of Sciences of the USA 115: E6116-E6125 https://www.pnas.org/doi/abs/10.1073/pnas.1708001115
High-fidelity exascale-enabled infrastructure for analysing the impact of wind farm wakes on wind/sea interactions
Project institution:

Project supervisor(s):
Dr M. Sergio Campobasso (Lancaster University), Prof Adrian Jackson (University of Edinburgh), Dr Evgenij Belikov (University of Edinburgh), Dr Wenxin Zhang (University of Glasgow), Dr Andrea Mazzeo (Lancaster University), Dr Stefano Federico (Institute of Atmospheric Sciences and Climate) and Dr Miriam Marchante Jiménez (Orsted)

Full details
From Hasager, C.B.; Rasmussen, L.; Peña, A.; Jensen, L.E.; Réthoré, P.-E. Wind Farm Wake: The Horns Rev Photo Case. Energies 2013, 6, 696-716. https://doi.org/10.3390/en6020696.

Overview and Background

The extraction of energy from the wind yields the formation of low-speed regions (wakes) behind wind farms (WFs). Wakes are particularly persistent offshore [2], and were recently shown to affect the heat exchange between sea and atmosphere, due to reduced convective heat transfer close to the sea surface [1]. With worldwide offshore wind capacity en-route to achieve 2,000+ GW already by 2050, WF wakes may alter ocean dynamics and marine ecosystems to extents comparable to anthropogenic climate change [2]. Evaluating wakes’ environmental impact credibly requires regional- to mesoscale climate simulations with high-fidelity WF parametrizations at temporal and spatial resolution beyond present supercomputers’ capabilities. Using Graphics Processing Unit (GPU) computing [3], this project will develop the code infrastructure to support these simulations on exascale machines, demonstrating prototype physical investigations using the developed technology.

Methodology and Objectives

METHODOLOGY
Two community codes for short-to-long term climate modelling are considered: the Weather Research and Forecasting (WRF) model [4], and the Model for Prediction Across Scales (MPAS) [5]. The codes feature similar models of atmospheric physics, but use different numerical methods. WRF uses structured grids with nested domains to increase resolution in WF wake regions, whereas MPAS uses a single unstructured Voronoi grid with controllable local refinement. WRF has state-of-the-art WF parametrisations [6,7] but little GPU work reported; MPAS uses GPU acceleration but has little work reported on WF parametrization.
This research aims at combining the strengths of both codes to develop a reliable exascale-scalable code for the considered problem. The choice of the baseline code for the project’s core development and demonstrations will follow the teaser projects (TPs) below, which offer hands-on training in climate modelling, wind farm aerodynamics and distributed-memory and GPU parallel computing, and assess the codes’ strengths. Following the TPs, the student will focus on specific topics, e.g. improving the overall code GPU framework or optimizing the parallelized WF model in existing GPU framework, depending on the code selected.

The TPs will share one test case, to compare the two codes’ predictive capabilities and computational performance (execution speed) without GPU acceleration. The GPU development work will be performed on Lancaster University’s HEC cluster and the Bede supercomputer [9].

Teaser project 1 (TP1): WRF-based. To investigate and optimize the predictive capabilities of the two WF parametrizations [6,7] in WRF, analyses (TC1) of the North Sea area containing two real WFs [10] will be performed. The capabilities of both models to predict wind turbine (WT) and WF wakes will be optimised using regression methods for the models’ parameters, and lidar and satellite wind speed measurements to steer the optimization. Measured WT power will also be used in the process, as this parameter is affected by wakes.
A second test-case (TC2) without WFs will be used to perform parallel profiling studies of WRF, identifying the code’s computationally most intensive parts and familiarising with its structure. These analyses will identify the code sections that would benefit most from GPU acceleration.
TC2 will also be used to cross-compare the predictive capability of WRF and MPAS, assessing it by comparing predicted near-sea surface wind speed maps to measurements from satellites and lidars. Boundary and initial conditions for TC1 and TC2 will be taken from the ERA5 global climate reanalysis [8]

Teaser project 2 (TP2): MPAS-based. First, TC2 will be set up and analyzed without GPUs to cross-compare the computational speed and prediction capabilities of wind speed field of MPAS and WRF. Then, more comprehensive TC2-based parametric analyses of the performance of MPAS using different numbers of CPUs and GPUs will be undertaken to study the dependence of the computational performance of the hybrid parallelization on the CPU and GPU counts, and determine the largest achievable acceleration and the corresponding optimal ratio of GPU and CPU counts – an information paramount for exascale porting. These analyses also enable familiarising with the MPAS structure, knowledge needed to optimally merge wind farm models with the MPAS GPU infrastructure.

Teaser Project 1 Objectives:

Familiarise with WRF: assess predictive capabilities of 3D wind fields with/without WFs; analyze/optimize best suited WF parametrization.

Assess computational performance and estimate potential of GPU acceleration.

Teaser Project 2 Objectives:

Familiarise with MPAS: assess predictive capabilities of 3D wind fields; investigate performance of hybrid CPU/GPU parallelisation.

Investigate optimal integration of WF model into GPU framework.

References & Further Reading

1) Akhtar, N. et al., Impacts of accelerating deployment of offshore wind farms on near-surface climate. Sci Rep 12, 18307 (2022). https://doi.org/10.1038/s41598-022-22868-9.
2) Platis A. et al., First in situ evidence of wakes in the far field behind offshore wind farms. Sci Rep. 2018;8(1):2163. https://www.nature.com/articles/s41598-018-20389-y
3) Hijma, P., et al. Optimization techniques for GPU programming. ACM Computing Surveys 55.11 (2023): 1-81, https://dl.acm.org/doi/10.1145/3570638.
4) Powers, J. G., et al. “The weather research and forecasting model: Overview, system efforts, and future directions.” Bulletin of the American Meteorological Society 98.8 (2017): 1717-1737. (see also: Weather Research and Forecasting (WRF) model, https://www.mmm.ucar.edu/models/wrf).
5) Skamarock, W. C., et al. “A multiscale nonhydrostatic atmospheric model using centroidal Voronoi tesselations and C-grid staggering.” Monthly Weather Review 140.9 (2012): 3090-3105. (see also: Model for prediction across scales (MPAS), https://www.mmm.ucar.edu/models/mpas).
6) Fitch, A. C. et al., Local and Mesoscale Impacts of Wind Farms as Parameterized in a Mesoscale NWP Model, Mon. Weather Rev., 140, 3017–3038, https://doi.org/10.1175/MWRD-11-00352.1, 2012.
7) Volker, P. et al., The Explicit Wake Parametrisation V1.0: A Wind Farm Parametrisation in the Mesoscale Model WRF, Geosci. Model Dev., 8, 3715–3731, https://doi.org/10.5194/gmd-8-3715-2015, 2015.
8) ERA5 Global Climate Reanalysis. https://www.ecmwf.int/en/forecasts/dataset/ecmwf-reanalysis-v5.
9) N8 CIR, The Bede supercomputer. https://n8cir.org.uk/bede/.
10) Orsted, Offshore wind measurement and operation data. https://orsted.com/en/what-we-do/renewable-energy-solutions/offshore-wind/offshore-wind-data.
High-resolution nowcasting of wind speed and power generation

Project institution:

Project supervisor(s):
Prof Jethro Browell (University of Glasgow), Dr Joe O'Connor (University of Edinburgh) and Dr Tiffany Vlaar (University of Glasgow)

Full details

Overview and Background

Operating energy systems with a high penetration of wind power challenges conventional approaches to power system operation. Variability of the wind resource and resulting power generation must be actively managed, underpinned by predictive analytics. There is an emerging need to not only forecast energy (average power over some time period, typically 15 to 60 minutes) but variability of instantaneous power production within these periods. This is challenging as conventional weather forecasts only predict atmospheric variables, such as wind speed, at hourly resolution. This project will develop novel methods for weather and power forecasting exploiting high-performance computing for high-resolution numerical weather prediction and weather-to-power modelling, including uncertainty quantification.

Methodology and Objectives

Methods Used: Numerical Weather Prediction, Neural Networks, Gradient Bosting, WRF, Generative Modelling

Teaser Project 1 Objectives: Generative modelling of sub-hourly wind power generation

This project will develop methods for within-day forecasting of wind power variability on sub-hourly time scales based on generative modelling conditioned on conventional Numerical Weather Prediction. Considerations include model architecture and GPU implementation, representation of relevant atmospheric processes, uncertainty quantification and generalisability/transferability between wind farms. GPU software development will be required for GPU-native training of the generative models, with distributed data pipelines and efficient parallelism to fully exploit large GPU clusters.

Steps will include gathering and processing data from two offshore wind farms, Anholt and Westermost Rough, each comprising two years of 10-minute resolution SCADA data; gathering and processing historic NWP data from ECMWF HRES model and/or UK Met Office UKV model (via the BADC), developing GPU software to implement one or more generative models (e.g. variational auto-encoder, generative adversarial network, or similar) to produce high-resolution wind power forecasts conditioned on NWP, and establishing an evaluation framework for this type of forecast information including naïve and competitive benchmark methods.

This project may develop into a PhD extending these ideas alone, for instance through development of novel neural network architectures and GPU software implementations, and/or by modelling intra-wind farm effects such as wakes, and/or in combination with high-resolution weather modelling from Teaser Project 2. In all cases, scalability to fleets (100s-1000s) of wind farms and exascale compute will be a necessary component.

Teaser Project 2 Objectives: High-resolution weather and wind power forecasting

High-resolution (in space and time) numerical weather prediction aims to resolve small and fast atmospheric processes to better describe atmospheric conditions. This project will establish a high-resolution NWP set-up targeting near surface winds with boundary conditions coming from conventional NWP. Methods will be developed to convert high-resolution NWP output to wind power production and variability forecasts, including uncertainty quantification. GPU software developed will enable offloading of compute-intensive NWP kernels using performance-portable approaches (e.g. OpenMP, OpenACC), guided by profiling, and exascale-ready in-situ post-processing (e.g. via ADIOS2) to handle the extreme data volumes from sub-hourly NWP ensembles.

Steps will include gathering and processing historic NWP data from ECMWF HRES model and/or UK Met Office UKV model (via the BADC) and meteorological stations, configuring a WRF model for high-resolution wind speed and direction forecasting at multiple heights, leveraging GPU acceleration, generate a new dataset of re-forecasts using the high-resolution set-up and validate against observations from meteorological stations and wind farm data from Teaser Project 1.

This project may develop into a PhD extending these ideas including novel GPU implementations of WRF/similar, AI weather models, or in combination with ideas from Teaser Project 1. In all cases, scalability to fleets (100s-1000s) of wind farms and exascale compute will be a necessary component.
Sufficiency and Carbon Efficiency of exascale computing for environmental modelling and AI
Project institution:

Project supervisor(s):
Prof Adrian Friday (Lancaster University), Dr Carolynne Lord (UKCEH), Dr Kelly Widdicks (UKCEH), Dr Kirsty Pringle (University of Edinburgh) and Prof Mike Berners-Lee (Small World Consulting)

Full details
Overview and Background

Core research question: Exascale computing exemplifies environmental science’s growing array of ‘digital research infrastructure’ (DRI), offering exciting promise of exploring new frontiers of knowledge via larger scale data analyses, models, and increasingly, the application of AI at a scale not previously possible. As exascale implies, vast compute has vast implications for the environment due to its associated operational emissions, water use, and embodied material footprints as it’s created, embedded into research methods, and at its end of life. This raises core challenges for researchers and scientific organisations: how can we transform research software engineering practice to make optimal and sufficient use of the latest hardware (e.g. GPU) given the legacy and complexity of scientific software; how can we engage practitioners with better tools and feedback to evaluate the implications of digital software and hardware on the environment; and to what extent can this support exploration of gains in new
environmental science knowledge against competing digital environmental costs? We are specifically interested the energy and carbon footprint of exascale computing, how to radically lower this footprint at and beyond the point of use through better software; and wider systemic thinking around this such as notions of ‘sufficiency’ as exemplified by ‘Green AI’, to understand and communicate its impacts and find the right balance of environmental modelling and AI for Earth Systems.

Why is this relevant to ExaGeo? In the time of the Anthropocene and given the urgency of the climate crisis, it’s imperative that future environmental scientists are equipped with new understandings and principles to support the responsible use of computational methods. Our prior work (ARINZRIT) has found significant gaps in software engineering practice, training and with legacy scientific software which has led to inefficient use of state of the art HPC and accelerated hardware (e.g. GPUs) due not least to a lack of feedback on hardware use, energy marginal carbon intensity, apportioned embodied emissions, tools and training. This PhD, aligned with goals of NetDrive and SDRI, will help bolster the Exascale cohort with exactly this critical lens making a positive contribution to the overarching project in terms of better ‘energy and carbon aware’ practice. This will help strengthen the cohort as environmentally responsible scientists and practitioners and green software engineers.

Methodology and Objectives

Methods Used:
The PhD project will advance knowledge in this domain by extending current understandings of the lifecycle impacts of digital research infrastructure (particularly large scale HPC) for environmental science and developing technical tools and practical solutions that support the environmental science community in using these infrastructures sustainably. We will explore both efficiency and utilisation of exascale hardware features by scientific software, especially the use of GPU vector operations, AI acceleration and ‘carbon efficient methods’ for achieving scientific results at the lowest environmental cost. We also wish to explore to what extent we can improve carbon literacy in green software engineering practice for scientific outcomes, the trade-offs between environmental impacts of computation and scientific results (for instance, ‘sufficiency’), and how to support scientists to embrace these methods.

To focus the PhD, and align most closely with the ExaGEO ambitions, the student will explore the environmental impacts of exascale computing in relation to integrative Earth Systems modelling and AI. Two teaser, transdisciplinary research projects in this domain have been outlined below, each drawing together methods from across computer science, environmental data science and qualitative methods from social science.

Teaser Project 1: ‘Scientific software and sustainable exascale Earth System modelling’
Objectives:

Examine the energy and carbon performance of earth and environmental science software, especially exploitation (or lack of) relating to GPU acceleration and underutilisation of concurrency;

Work with green software engineering and scientific software communities to improve software, toolchains, feedback and practices to enable lower carbon Earth System models and data science/AI methods

Develop technical mechanisms to reduce computational waste and lower the footprint of large scale HPC (e.g., development of sustainable job queuing systems) for different exascale computing facilities, testing and evaluating these

Produce practical guidance, training and policies to accelerate better and more sustainable use of exascale computing

Teaser Project 2: ‘Lean/ green data science for the future of exascale Earth System modelling’
Objectives:

Evaluate the purpose, outcomes and accuracy of common Earth System models, exploring the role of AI emulation in its potential to reduce model complexity and run-time

Build a benchmark framework for evaluating the use phase performance, marginal emissions (scope 2) and embodied costs (scope 3) of Earth Systems modelling and AI in exascale highperformance computing

Drive more responsible and leaner use of exascale computing and digital research infrastructure with users and developers, e.g. using AI emulation of models or more efficient or frugal ML and data science alternatives to existing models (c.f. ‘Green AI’, (Schwartz, 2020))

Develop technical solutions (e.g., sustainable environmental software in shared repositories), offering environmental scientists more efficient and less impactful models and methods, and testing and evaluating these

Produce practical guidance, training and policies for exascale-ready model and method development that embodies sustainability and reduces waste and rebound effects in all stages of its innovation and use

References & Further Reading

Pringle, K. It’s time to decarbonise digital research, Research Professional News, April 2025,
https://www.researchprofessionalnews.com/rr-news-uk-views-of-the-uk-2025-april-it-s-time-todecarbonise-digital-research/

Lord, C., Friday, A., Jackson, A., Bird, C., Preist, C., Lambert, S., Kayumbi, G. and Widdicks, K., 2025, April. The world is not enough: growing waste in HPC-enabled academic practice. In Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (pp. 1-14).

Freitag, C., Berners-Lee, M., Widdicks, K., Knowles, B., Blair, G.S. and Friday, A., 2021. The real climate and transformative impact of ICT: A critique of estimates, trends, and regulations. Patterns, 2(9).

Widdicks, K., Lucivero, F., Samuel, G., Croxatto, L.S., Smith, M.T., Ten Holter, C., Berners-Lee, M., Blair, G.S., Jirotka, M., Knowles, B. and Sorrell, S., 2023. Systems thinking and efficiency under emissions constraints: Addressing rebound effects in digital innovation and policy. Patterns, 4(2).

Mytton, D. and Ashtine, M., 2022. Sources of data center energy estimates: A comprehensive review. Joule, 6(9), pp.2032-2056.

Juckes, M., Bane, M., Bulpett, J., Cartmell, K., MacFarlane, M., MacRae, M., Owen, A., Pascoe, C. and Townsend, P., 2023. Sustainability in Digital Research Infrastructure: UKRI Net Zero DRI Scoping Project final technical report.

Lannelongue, L., Aronson, H.E.G., Bateman, A., Birney, E., Caplan, T., Juckes, M., McEntyre, J., Morris, A.D., Reilly, G. and Inouye, M., 2023. GREENER principles for environmentally sustainable computational science. Nature Computational Science, 3(6), pp.514-521.

Schwartz, R., Dodge, J., Smith, N. A., and Etzioni, O. Green AI. Commun. ACM 63, 12 (Nov. 2020), 54–63.
VINE — Value of Information for Nature & Economics

Project institution:

Project supervisor(s):
Dr Alex Bush (Lancaster University), Dr Katherine Simpson (University of Glasgow), Prof Richard Reeve (University of Glasgow) and Dr Ben Payne (Natural England)

Full details

Overview and Background

To transform societies towards a sustainable footing we must reposition how environmental impacts are considered within standard economic models. Nature Markets are emerging as a means of mobilising finance and incentivising sustainable land management and ecosystem restoration, but also risk becoming greenwash. A key challenge is how biodiversity units are measured and traded, as this underpins both ecological success and economic efficiency. The recently developed irreplaceability metric offers a promising solution by capturing ecological complexity while supporting cost-effective investments. However, uncertainties in data, climate impacts, and socioeconomic factors remain critical concerns. Addressing these challenges through integrated risk management and advanced data science offers an opportunity to design resilient and credible Nature markets that align economic and conservation objectives.

Methodology and Objectives

In the last 5 years the UK has passed some progressive environmental policies that other nations are hoping to learn from and emulate. Increasing private investment through nature markets is a core goal, and Biodiversity Net Gain (England) defines a mandatory obligation for developers to engage with ecosystem restoration. Gaps in the current legislation and policy have been widely publicised, but it remains a step in the right direction. This project will build upon the irreplaceability metric proposed by Bush et al (2023), drawn from the literature on systematic conservation planning (SCP), to define the strategic value of landscapes. If successful, a method for systematically scoring Nature will offer a robust and transparent framework for achieving Nature positive futures that serve everyone – and change how society perceives Nature and the services we gain from it.

While the original concepts could be demonstrated with basic simulations, and could adopt several simplifying assumptions, to operationalise this new form of nature market will require the development of new tools to suit the volume, velocity and variability of Big Data, be flexible to users demands, and face the realities of sparse and incomplete environmental data. The first teaser therefore focuses on strengthening our ability to reassure stakeholders the recommendations are robust to uncertainty. User’s concerns over uncertainty also underpin the second teaser, which seeks to show how irreplaceability markets would create an opportunity to optimize new monitoring and data collection and reduce costs. Each will require new method development, as well as adaptation to maximise efficiency within CPU/GPU environments.

Methods: The project will combine simulation modelling with prototype policy scenarios using UK environmental data. Simulations allow testing of strategies under “complete knowledge” conditions, enabling rigorous evaluation of uncertainty management. Conversely while true ecological and environmental data are incomplete, defining spatial covariance and distribution of features helps refine the problem space, as well as improve the communication of the project outcomes.

Teaser Project 1 Objectives: Promises of environmental sustainability can be achieved through market means will only be trusted if decisions are transparent. Yet, uncertainties emerge at many stages including our knowledge of ecosystems, how to restore them, profitability for landowners, and of course outcomes over long-term forecasts. Standard approaches to SCP do not integrate objectives for risk that arise from future climate uncertainty, but tools exists within the risk management sector for precisely this purpose and provide a rich opportunity for innovation. This project will test how to integrate different sources of uncertainty through existing methodologies like Modern Portfolio Theory, alongside those that preserve the benefits of the systematic approach to Nature markets. Subsequent tasks could then focus on the further challenge to scale those solutions to Big Data, including UK relevant datasets on land use, biodiversity, land values and climate change scenarios.

Teaser Project 2 Objectives: Monitoring is fundamental to any systematic approach, but diverse ecological and environmental surveys cannot be sustained at large scales posing a potential barrier to the market. However, when sources of uncertainty are identified and a decision criterion is well defined (i.e. irreplaceability), we can act systematically to reduce the uncertainty present by understanding the Value of Information (VoI). VoI methods are commonly used in other fields but have been rarely adopted in conservation settings. This project provides a chance to demonstrate how monitoring and research can become highly organised, taking advantage of a range of novel technologies at different times to minimising the regulatory costs. We propose exploring Bayesian Decision Analysis, Information-Gap Decision Theory, and Real Options Analysis to evaluate VoI adaptive learning strategies on synthetic environments to improve scalability, before proceeding to UK national datasets.

References & Further Reading

Bush et al. (2024). Systematic nature positive markets. https://doi.org/10.1111/cobi.14216

Hanley and Simpson (2025), Markets in Biodiversity Offsets. https://doi.org/10.1111/1467-8489.70027

Bolam et al. (2019), Using the Value of Information to improve conservation decision making. https://doi.org/10.1111/brv.12471

Popov et al. (2022). Managing risk and uncertainty in systematic conservation planning with insufficient information. https://doi.org/10.1111/2041-210X.13725

Available PhD Projects

Available PhD Projects

Project Selection and Information

Projects with a focus on Atmosphere, Hydrosphere, Cryosphere, and Ecosystem Processes and Evolution:

A Dangerous Duo: Exploring the Impact of Heatwaves on Air Pollution

Antarctic Ice Loss in High Definition: Analysing novel high-resolution satellite data streams for quantifying 21st century change

Computationally scalable data fusion for real-time water quantity and quality forecasting

Detecting hotspots of water pollution in complex constrained domains and networks

Firn Futures: Examining Antarctic Ice Shelf Stability with GPU-Accelerated Firn Modelling

Forests in the Exascale Era: High-resolution Modelling of Global Biomass Drivers, Loss and Recovery

Mechanisms for and predictions of occurrence of ocean rogue waves

Mixed-precision multigrid for weather and climate applications

Modelling threatened biodiversity at national, continental and planetary scales

Multi-scale modelling of volcanoes and their deep magmatic roots: Constitutive model development using data-driven methods

Near-real-time monitoring of supraglacial lake drainage events across the Greenland Ice Sheet

Scalable Deep Learning for Biodiversity Monitoring under Real-World Constraints

Scotland landscape response to past abrupt climate change: GPU-accelerated numerical simulations and model-data integration

Unlocking understanding of floods and droughts through data assimilation and exascale computing

Projects with a focus on Geodynamics, Geosciences and Environmental Change:

AI-Driven Satellite Embeddings for Fine-resolution Mapping and Tracking Invasive Species on Global Reclaimed Lands

Chasing fluid pathways: GPU-enabled multiscale subduction models to unravel how subduction driven melt dynamics determine surface deformation and topography

Data-Driven and Physics-Informed Hybrid Modelling of Landslide Dynamics

Towards exa-scale simulations of slabs, core-mantle heterogeneities and the geodynamo

When Mountains Meet the Sea: Simulating Landslide-Generated Tsunamis

Projects with a focus on Geologic Hazard Analysis, Prediction and Digital Twinning:

An earth system digital twin to predict land use change from climate change

Are we learning the weather right? Climate-based flood catastrophe analysis using AI and exascale compute

Developing large-scale hydrodynamic flood forecasting models for exascale GPU systems

Development of landscape evolution models and monitoring in anthropogenically influenced tropical regions

Earth system twin for coastal erosion in the UK

Exascale modelling for resilient woodland expansion

Extreme weather event impacts from global warming in the UK

NAME as a digital twin for explosive volcanic eruptions for streamlined response and real-time impact assessment

Smart sensing for ecological catchments

Statistical Emulation Development for Landscape Evolution Models

Towards a volcano digital twin: Coupled models of shallow conduit processes at basaltic volcanoes

Projects with a focus on Sustainability Solutions in Engineering, Environmental, and Social Sciences:

AI Weather Prediction for Renewable Energy Forecasting

Changing Ecological Role of Coral Reef Marine Protected Areas

High-fidelity exascale-enabled infrastructure for analysing the impact of wind farm wakes on wind/sea interactions

High-resolution nowcasting of wind speed and power generation

Sufficiency and Carbon Efficiency of exascale computing for environmental modelling and AI

VINE — Value of Information for Nature & Economics