UCL CASA The Bartlett UrbanAI 2026

Shaping
Cool Cities

Multi-source data fusion for modelling urban heat mitigation across six European cities

Gerardo Ezequiel Martin Carreno

MSc Urban Spatial Science · UCL

Multi-source data fusion: satellite, street network, and street-level views converging to map urban heat and cooling

Six cities · Five data sources · 118 features · Open data

WhyHowFindingsAction

61,000

Deaths from heat across Europe.
One summer. 2022.

Among the deadliest natural disasters in modern European history.
Streets designed to maximise winter sunlight had become heat traps.

Ballester et al. (2023) Nature Medicine

WhyHowFindingsAction

Three Gaps Preventing Action

Integration

Studies look at buildings, trees, or streets separately. Heat comes from how they interact.

Transferability

The same tree cools 8–12°C in Berlin but only 0–4°C in Athens. Same strategy, different results.

Action

Planners need the where, the why, and the how. Most models only give the first.

RQ1: What drives urban heat across contexts?

RQ2: Where should cities intervene to protect the most vulnerable?

Three research gaps: Integration, Transferability, and Action
Schwaab et al. (2021) Nat. Comms. · Camps-Valls et al. (2025) Nature Communications. · Lundberg & Lee (2017) SHAP.
WhyHowFindingsAction

Six Cities, Three Climates

Oceanic (Cfb): Amsterdam, Paris

Mediterranean (Csa): Athens, Barcelona

Transitional: Berlin Cfb/Dfb · Madrid Csa/BSk

40,344 grid cells at 30m resolution · ~6 km² per city

30m study area grids across six European cities
Stewart & Oke (2012) BAMS. · Yap et al. (2023) Urbanity, npj Urban Sustainability.
WhyHowFindingsAction

Five Data Sources, One Framework

118 features from 5 open-source tools. Zero proprietary data. Any European city can replicate this tomorrow.

Analytical framework diagram
Gorelick et al. (2017) GEE · Fujiwara et al. (2026) VoxCity · Yap et al. (2023) Urbanity · Hou et al. (2024) GlobalStreetscapes · Milojevic-Dupont et al. (2023) EUBUCCO
WhyHowFindingsAction
Framework context

Google Earth Engine

Landsat 30m · Surface temperature · Vegetation indices
Land Surface Temperature across six cities Land Surface Temperature
NDVI vegetation index across six cities NDVI Vegetation Index

Where vegetation disappears, temperatures spike. This inverse pattern is the dependent variable our model learns to predict.

Gorelick, N. et al. (2017) ‘Google Earth Engine’, Remote Sensing of Environment, 202, pp. 18-27.
WhyHowFindingsAction
Framework context

Urbanity

Street network topology · Connectivity · Ventilation pathways
Urbanity: network analysis across global cities Source: Urbanity global dataset
Our six cities: street network topology Our six cities: network topology

Amsterdam’s regular canal grid channels cooling winds. Athens’s organic fabric traps them. Network topology determines whether park cooling reaches surrounding streets.

Yap, W. et al. (2023) ‘Urbanity’, npj Urban Sustainability.
WhyHowFindingsAction
Framework context

GlobalStreetscapes

10 million street-level images · Green View Index · Pedestrian perspective
GlobalStreetscapes: millions of street-level images worldwide Source: 10M street-level images
Our six cities: street-level imagery coverage Our six cities: coverage points

Satellites see canopy from above. Street imagery sees shade from below. Divergence between the two validates the multi-source approach.

Hou, Y. et al. (2024) GlobalStreetscapes. · Li, X. et al. (2015) Green View Index.
WhyHowFindingsAction
Framework context

EUBUCCO

3D building morphology · Heights · Footprints · Canyon geometry
Building height distribution across six European cities from EUBUCCO

Canyon geometry drives nighttime heat retention. Paris’s uniform Haussmann fabric vs Athens’s irregular growth produce fundamentally different thermal dynamics.

Milojevic-Dupont, N. et al. (2023) ‘EUBUCCO v0.1’, Scientific Data, 10, 146.
WhyHowFindingsAction
Framework context

Six Cities in 3D

VoxCity 3D city model vertical cross-section VoxCity 3D model
3D voxel representation of urban morphology Voxelised urban form
3D urban cores voxelised - 6 cities
Green View Index across 6 cities Sky View Factor across 6 cities
Diffuse solar irradiance across 6 cities
Fujiwara, K. et al. (2026) ‘VoxCity’, Computers, Environment and Urban Systems, 123, 102366. · Biljecki et al. (2015) ISPRS Int. J. Geo-Inf. · Li, X. et al. (2015) Green View Index.
WhyHowFindingsAction

From Prediction to Prescription

XGBoost with SHAP explainability: the model shows its work at every stage. The planner keeps the final call.

Three-stage pipeline diagram: regression predicts temperature, classification identifies hotspots weighted by vulnerability, scenario grid search tests 648 intervention combinations
Chen & Guestrin (2016) XGBoost. · Lundberg & Lee (2017) SHAP. · Eyni et al. (2025) Scientific Reports.
WhyHowFindingsAction
RegressionClassificationScenarios

XGBoost Regression: Why This Approach?

40,344 cells · 30m resolution · 6 cities · Target: UHI anomaly (each cell’s temperature minus its city mean — not raw LST, which inflates R² to 0.97 by capturing climate, not urban form)

Why XGBoost? Heat responds non-linearly to urban form. Trees cool effectively until water stress kicks in. Sealed surfaces trap heat, but the tipping point differs by climate. We need a model that captures these thresholds and shows its reasoning.

  • Non-linear thresholds: captures regime changes linear models miss
  • SHAP explainability: every prediction decomposes into feature contributions, so the planner sees why
  • Hierarchical blend: global + climate-zone specialists learn what each city needs differently

Global R² 0.695 → Blended 0.841. Climate-zone specialisation cuts error by 48%.

Paris 0.93 Barcelona 0.93 Amsterdam 0.88 Athens 0.73 Berlin 0.73 Madrid 0.72
XGBoost regression with hierarchical climate-zone blending: global model (−0.151) + Cfb specialist (0.764) + Csa specialist (0.647) = blended R² 0.841
Chen & Guestrin (2016) XGBoost, KDD. · Lundberg & Lee (2017) SHAP values. · Oke et al. (2017) 300m advective transport scale.
WhyHowFindingsAction
RegressionClassificationScenarios

Can We Trust the Numbers?

Hot areas sit next to hot areas. Without careful testing, the model could just memorise where things are instead of learning why they’re hot. Five checks prevent that.

1

Spatial cross-validation

The model never sees its test area’s neighbours during training. 5 folds, 600m spatial blocks, stratified by heat intensity and density.

2

City-demeaned targets

Raw temperature R² = 0.97, but that mostly captures climate. Subtracting each city’s mean isolates what urban form does. R² = 0.84.

3

Scale sensitivity (MAUP)

Tested at 30m, 60m, 90m. At 60m the model loses 17% of explained variance; it falls between radiative and advective physics. 30m is the right scale.

4

Residual autocorrelation

Moran’s I confirms significant spatial clustering in predictions (I = 0.66–0.92, p<0.001), validating why spatial blocking was essential.

5

Stability & tuning

5 random seeds (σR² = 0.064). 200 Bayesian trials optimised 820 trees, depth 6, lr 0.03. Features: 165 base → 189 with spatial lags → 118 after correlation filter (|r|>0.92).

The model learns morphology, not spatial patterns.

Oke et al. (2017) Urban Climates. · Openshaw (1984) MAUP. · Anselin (1995) Moran’s I. · Roberts et al. (2017) Spatial CV. · Valavi et al. (2018) blockCV.
WhyHowFindingsAction
RegressionClassificationScenarios

Does It Work Across Contexts?

CityRMSE
Paris0.9320.49°C
Barcelona0.9260.85°C
Amsterdam0.8800.44°C
Athens *0.7300.63°C
Berlin0.7270.66°C
Madrid0.7181.53°C

The model works, but accuracy isn’t the finding. What the model reveals about what drives heat is.

* Athens R² drops with blending (−2.8%): its topographic basin overrides climate-zone correction. That’s not a failure; it’s a finding. Geography matters.

Observed vs predicted UHI spatial comparison across cities
Chen & Guestrin (2016) XGBoost, KDD. · Tanoori et al. (2024) ML for UHI benchmarks.
WhyHowFindingsAction
RegressionClassificationScenarios

What Actually Drives Urban Heat?

Energy balance diagram showing how sealed surfaces trap heat versus permeable surfaces enabling evaporative cooling

De-sealing matters 3× more than tree planting.

Trees provide shade, biodiversity, air quality. But for temperature reduction, the surface is the stronger lever.

Global SHAP feature importance beeswarm plot
Lundberg & Lee (2017) SHAP. · Schwaab et al. (2021) Nat. Comms. · Eyni et al. (2025) Scientific Reports.
WhyHowFindingsAction
RegressionClassificationScenarios

Geography Changes Everything

SHAP importance for Temperate oceanic
Climate context diagram

Geographic contingency isn’t noise.
It’s the signal.

SHAP importance for Mediterranean
Schwaab et al. (2021) Nat. Comms. · Giannopoulou et al. (2011) Athens basin. · Stewart & Oke (2012) LCZ.
WhyHowFindingsAction
RegressionClassificationScenarios

From Temperature to Risk: Classifying Hotspots

The regression tells us how hot each cell is. But planners need to know which areas are dangerously hot. That’s a classification problem: we turn continuous temperature into a binary risk flag.

  • XGBoost classification, upweighted to catch rare hotspots
  • Calibrated probabilities: when the model says 0.7, it means 70%
  • Threshold set to catch at least 60% of real hotspots
AUC 0.911
6,389 of 40,344 cells (15.8%) classified as hotspots
Athens: 30.8% vs Paris: 7.2%
Calibrated hotspot probability surfaces across six cities
Chen & Guestrin (2016) XGBoost. · Platt, J. (1999) Probabilistic outputs for SVMs. · Santamouris, M. (2020) Heat vulnerability.
WhyHowFindingsAction
RegressionClassificationScenarios

Where Heat Meets Vulnerability

Temperature prediction to hotspot classification: demographics leap to #1

When classifying hotspots, child density becomes #1. ~ higher hotspot probability in vulnerable areas.

Priority intervention zones across six cities

Priority: Heat risk (40%) + Vulnerability (35%) + Cooling potential (25%)

Santamouris (2020) Heat vulnerability. · Kovats & Hajat (2008) Annu. Rev. Public Health. · Hoffman et al. (2020) Heat inequality.
WhyHowFindingsAction
RegressionClassificationScenarios

How We Test Interventions

Counterfactual scenario grid search: 4 levers × 648 combinations → re-predict → convergence on 50% de-sealing

Process per scenario

Modify features in priority zones (top 10%)
Re-predict with blend model
Compute cooling Δ vs. baseline
Rank by cooling and cost-effectiveness

⚠ Limitations

Correlation ≠ causation: predictions are hypotheses for field validation

Feature correlation: impervious & vegetation (r≈−0.7) partially double-count cooling

All 5 objectives converge → 50% de-sealing

Counterfactual grid: itertools.product over 4 levers, bootstrapped 500×. Priority zones: 40% heat + 35% vulnerability + 25% cooling potential.
WhyHowFindingsAction
RegressionClassificationScenarios

The 50% Regime Boundary

We know where hotspots are and what drives heat. Now the prescription question: how much change is enough? We tested four levers in a full grid search: depaving (30-50%) · vegetation (+10-50%) · tree canopy (+15-50%) · albedo (+0-20%) = 648 combinations, each bootstrapped 500 times.

All optimal strategies converge on 50% de-sealing. Below this threshold, evaporative cooling pathways become viable — a threshold effect.

StrategyDepavingVegetationTreesAlbedoCooling
Maximum50%+50%+50%+20%−1.27°C
Cost-effective50%+10%+20%0%−1.20°C

95% of maximum cooling with substantially fewer resources.

SHAP dependence plot showing impervious fraction threshold at 300m Regime boundary diagram illustrating the nonlinear cooling response at 50% de-sealing threshold where evaporative cooling pathways become viable
Oke et al. (2017) Urban Climates. · Gunawardena et al. (2017) Sci. Total Environ.
WhyHowFindingsAction
RegressionClassificationScenarios

What Should Each City Do?

Same anchor everywhere: de-seal first. Supporting levers change by climate and morphology.

Athens Csa
−1.45°C
95% sealed · 78% priority

e.g. permeable paving in basin-floor neighbourhoods like Omonia

Barcelona Csa
−1.31°C
95% sealed · 25% priority

e.g. opening sealed Eixample courtyards, Superblocks-style depaving

Paris Cfb
−1.22°C
94% sealed · 44% priority

e.g. OASIS schoolyard depaving, extending Seine corridor cooling

Berlin Cfb
−1.08°C
94% sealed · 30% priority

e.g. Schwammstadt sponge-city approach in polycentric cores

Amsterdam Cfb
−0.92°C
93% sealed · 36% priority

e.g. cool surfaces + permeable paving (canals already cap water cooling)

Madrid Csa/BSk
−0.89°C
91% sealed · 21% priority

e.g. cool roofs + drought-tolerant planting (avoid water-intensive greening)

Predicted cooling from cost-effective intervention

Predicted cooling from cost-effective strategy

Gunawardena et al. (2017) Sci. Total Environ. · Giannopoulou et al. (2011) Athens basin.
WhyHowFindingsAction

Co-Benefits & Equity Safeguards

Co-benefits diagram: de-sealing produces cooler streets, fewer floods, more biodiversity, water security

The cost-effective strategy still includes +20% tree canopy. This isn't "stop planting trees". It's "invest in the surface beneath, not just the canopy above."

⚠ Green gentrification risk

Greening raises property values and can displace the people it aims to protect. Documented in Barcelona Superblocks and NYC’s High Line.

Our vulnerability weighting (35%) targets current vulnerable populations, but the model can’t prevent market dynamics. That requires policy.

The model says where. Communities decide how.

Gunawardena et al. (2017) Sci. Total Environ. · Anguelovski et al. (2019) Green gentrification. · Hoffman et al. (2020) Heat inequality. · Eyni et al. (2025) Scientific Reports.
WhyHowFindingsAction

Three Insights for Urban Heat Policy

300m neighbourhood scale 1

Heat is a neighbourhood problem

Cooling propagates 300 m under wind. Coordinated action across blocks, not plots.

De-sealing 3:1 ratio 2

Surface matters more than canopy

Permeability drives 3× more cooling than vegetation. De-sealing deserves equal investment.

Cfb vs Csa climate context 3

Every city needs its own strategy

Same tree cools 8–12°C in Berlin but 0–4°C in Athens. Geography is the signal.

Oke et al. (2017) 300m advective transport. · Schwaab et al. (2021) Nat. Comms. · Eyni et al. (2025) Diversified depaving reduces health disparities more than tree-only.
WhyHowFindingsAction

What This Framework Cannot Do

  • Daytime only: Landsat passes at 10:30 AM, not nighttime when mortality peaks
  • Correlations, not causes: the 50% threshold needs field validation before policy
  • 30m resolution: screens neighbourhoods, not individual streets
  • Uneven coverage: street-level imagery covers 29–71% by city

But: GEE features hold the top 8 SHAP positions — satellite data alone carries most of the signal. Any city with free Landsat access can run this pipeline tomorrow.

Waiting for perfect evidence while heatwaves kill thousands is itself a policy choice.

Oke et al. (2017) Diurnal UHI cycle, Urban Climates. · Cheval et al. (2024) Research gaps, Climate Risk Management. · Voogt & Oke (2003) Thermal remote sensing.

Every city can start
shaping cooler cities
today.

Open data. Open tools. Open method.
61,000 deaths demand tools that work now, with data cities already have.

References

Anguelovski, I. et al. (2019) Green gentrification. Landscape Urban Plan.

Ballester, J. et al. (2023) Heat-related mortality in Europe, summer 2022. Nature Medicine, 29(7), 1857–1866.

Ballester, J. et al. (2025) Heat-related mortality in Europe, summer 2024. Nature Medicine.

Camps-Valls, G. et al. (2025) AI for modeling and understanding extreme weather and climate events. Nature Communications, 16, 1919.

Chen, T. & Guestrin, C. (2016) XGBoost. Proc. ACM SIGKDD, 785–794.

Cheval, S. et al. (2024) Systematic review of UHI. Climate Risk Mgmt., 44.

Eyni, A. et al. (2025) Distributional outcomes of UHI reduction pathways. Scientific Reports, 15, 93896.

Fujiwara, K. et al. (2026) VoxCity: 3D city model generation. Comput. Environ. Urban Syst.

Giannopoulou, K. et al. (2011) Athens urban heat island. Climatic Change, 104(3).

Gorelick, N. et al. (2017) Google Earth Engine. Remote Sens. Environ., 202, 18–27.

Gunawardena, K. et al. (2017) Evaporative cooling in urban areas. Sci. Total Environ., 590, 758–775.

Hoffman, J. S. et al. (2020) Historical housing policies and intra-urban heat. Climate, 8(1), 12.

Hou, Y. et al. (2024) GlobalStreetscapes. ISPRS J. Photogramm.

Kovats, R.S. & Hajat, S. (2008) Heat stress and public health. Annu. Rev. Public Health, 29.

Lundberg, S. & Lee, S. (2017) SHAP values. NeurIPS, 30.

Milojevic-Dupont, N. et al. (2023) EUBUCCO. Scientific Data, 10, 146.

Oke, T.R. et al. (2017) Urban Climates. Cambridge University Press.

Santamouris, M. (2020) Heat vulnerability. Energy and Buildings.

Schwaab, J. et al. (2021) Urban trees reducing LST in European cities. Nat. Comms., 12, 6763.

Seneviratne, S.I. et al. (2021) Extreme events in a changing climate. IPCC AR6 WG1, Ch. 11.

Stewart, I.D. & Oke, T.R. (2012) Local Climate Zones. BAMS, 93(12), 1879–1900.

Yap, W. et al. (2023) Urbanity: automated modelling and analysis of multidimensional networks. npj Urban Sustainability, 3, 45.

Questions?

Open data · Open tools · Open method

Code & Data
gerardo.ezequiel.22@ucl.ac.uk
LinkedIn

Gerardo Ezequiel Martin Carreno · UCL CASA · The Bartlett