A Process-Informed Determination of Credibility Across Different Downscaling Methods
Despite the ongoing advancements in Earth system simulation, the results from Global Climate Models (GCMs) are still not refined enough to be directly applied to numerous climate impact issues. There are many techniques available to downscale GCM outputs to finer resolutions, from basic statistical adjustments to more complex methods like dynamical downscaling and machine learning. However, these methods often yield different results, making it difficult to assess their relative reliability, particularly when comparing statistical versus dynamical downscaling methods.
We consider downscaled results to be credible when the phenomena and processes generating results for a variable of interest are consistent. For instance, if it’s raining, the necessary conditions for rain, such as lift and atmospheric moisture, should be present. To evaluate various downscaling techniques and demonstrate a framework for this type of credibility assessment, we examine the occurrence of rainfall at a location in the Southern Great Plains, specifically near the DOE ARM site in Oklahoma during May, the rainiest month. In this scenario, we are looking for an atmospheric setup that produces uplift at this location and corresponds with the northward movement of moisture from the Gulf of Mexico.
By comparing the composite synoptic-scale meteorological conditions on days with and without rain from the GCM being downscaled or from the downscaling method, as appropriate, we can verify if the outcomes of downscaling GCM precipitation align with the processes that drive them. This method offers a process-informed added-value analysis strategy for all kinds of downscaling techniques, which extends beyond basic measures of statistical resemblance.
We have used two regional climate models (RegCM4 & WRF), a machine learning technique (U-Net CNN), and four statistical methods of different complexities to downscale precipitation from three distinct GCMs. By using this framework to compare them with each other and the raw GCM results, we’ve discovered that all downscaling methods can yield plausible outcomes when the GCM performs well, as they inherit its credibility. However, when a GCM’s performance is subpar, only dynamical methods can rectify regional circulation errors, unlike the other methods. Interestingly, we also found that simpler statistical methods outperform more complex non-dynamical methods when dealing with poor GCM inputs.