diff --git a/.github/workflows/cicd.yaml b/.github/workflows/cicd.yaml index 0d4b188..aa11d7b 100644 --- a/.github/workflows/cicd.yaml +++ b/.github/workflows/cicd.yaml @@ -12,7 +12,7 @@ on: branches: - "**" schedule: - - cron: "20 4 * * 0" + - cron: "20 16 * * 0" release: types: [created] diff --git a/.github/workflows/codeql.yaml b/.github/workflows/codeql.yaml deleted file mode 100644 index 0084918..0000000 --- a/.github/workflows/codeql.yaml +++ /dev/null @@ -1,17 +0,0 @@ -name: "CodeQL" - -on: - push: - branches: ["master"] - schedule: - - cron: "19 11 * * 0" - -concurrency: - group: codeql-${{ github.ref }} - cancel-in-progress: true - -jobs: - ## Discover vulnerabilities - ## - CodeQL: - uses: ./.github/workflows/_codeql.yaml diff --git a/Makefile b/Makefile index dbdc017..be02861 100644 --- a/Makefile +++ b/Makefile @@ -93,11 +93,11 @@ changelog: ## .PHONY: clean clean: - rm -rf .pytest_cache .cache \ + rm -rf .mypy_cache .pytest_cache .cache \ build/ dist/ python_cmethods.egg-info \ docs/_build \ examples/.ipynb_checkpoints .ipynb_checkpoints \ - .mypy_cache .pytest_cache + doc/_build rm -f .coverage cmethods/_version.py diff --git a/doc/src/methods.rst b/doc/src/methods.rst index a3f1e33..7eb7705 100644 --- a/doc/src/methods.rst +++ b/doc/src/methods.rst @@ -2,6 +2,10 @@ Bias Correction Methods ======================= +Please note that the formulas are meant to be applied to a single time series. +The python-cmethods package can apply these formulas to single and +multidimensional data. + Linear Scaling -------------- @@ -9,26 +13,26 @@ The Linear Scaling bias correction technique can be applied on stochastic and non-stochastic climate variables to minimize deviations in the mean values between predicted and observed time-series of past and future time periods. -Since the multiplicative scaling can result in very high scaling factors, a -maximum scaling factor of 10 is set. This can be changed by adjusting -:attr:`CMethods.MAX_SCALING_FACTOR`. +This method requires that the time series can be grouped by ``time.month``. -This method must be applied on a 1-dimensional data set i.e., there is only one -time-series passed for each of ``obs``, ``simh``, and ``simp``. +Since the multiplicative scaling can result in very high scaling factors, a +maximum scaling factor of 10 is set. This can be changed by passing the desired +value to the hidden ``max_scaling_factor`` argument. The Linear Scaling bias correction technique implemented here is based on the -method described in the equations of Teutschbein, Claudia and Seibert, Jan (2012) -*"Bias correction of regional climate model simulations for hydrological climate-change -impact studies: Review and evaluation of different methods"* +method described in the equations of Teutschbein, Claudia and Seibert, Jan +(2012) *"Bias correction of regional climate model simulations for hydrological +climate-change impact studies: Review and evaluation of different methods"* (https://doi.org/10.1016/j.jhydrol.2012.05.052). In the following the equations for both additive and multiplicative Linear Scaling are shown: **Additive**: - In Linear Scaling, the long-term monthly mean (:math:`\mu_m`) of the modeled data :math:`X_{sim,h}` is subtracted - from the long-term monthly mean of the reference data :math:`X_{obs,h}` at time step :math:`i`. - This difference in month-dependent long-term mean is than added to the value of time step :math:`i`, - in the time-series that is to be adjusted (:math:`X_{sim,p}`). + In Linear Scaling, the long-term monthly mean (:math:`\mu_m`) of the modeled + data :math:`X_{sim,h}` is subtracted from the long-term monthly mean of the + reference data :math:`X_{obs,h}` at time step :math:`i`. This difference in + month-dependent long-term mean is than added to the value of time step + :math:`i`, in the time-series that is to be adjusted (:math:`X_{sim,p}`). .. math:: @@ -36,8 +40,8 @@ for both additive and multiplicative Linear Scaling are shown: **Multiplicative**: - The multiplicative Linear Scaling differs from the additive variant in such way, that the changes are not computed - in absolute but in relative values. + The multiplicative Linear Scaling differs from the additive variant in such + way, that the changes are not computed in absolute but in relative values. .. math:: @@ -70,22 +74,26 @@ for both additive and multiplicative Linear Scaling are shown: Variance Scaling ---------------- -The Variance Scaling bias correction technique can be applied only on non-stochastic -climate variables to minimize deviations in the mean and variance +The Variance Scaling bias correction technique can be applied only on +non-stochastic climate variables to minimize deviations in the mean and variance between predicted and observed time-series of past and future time periods. -This method must be applied on a 1-dimensional data set i.e., there is only one -time-series passed for each of ``obs``, ``simh``, and ``simp``. +This method requires that the time series can be grouped by ``time.month``. + +Since the the scaling by ratio can result in very high scaling factors, a +maximum scaling factor of 10 is set. This can be changed by passing the desired +value to the hidden ``max_scaling_factor`` argument. The Variance Scaling bias correction technique implemented here is based on the -method described in the equations of Teutschbein, Claudia and Seibert, Jan (2012) -*"Bias correction of regional climate model simulations for hydrological climate-change -impact studies: Review and evaluation of different methods"* +method described in the equations of Teutschbein, Claudia and Seibert, Jan +(2012) *"Bias correction of regional climate model simulations for hydrological +climate-change impact studies: Review and evaluation of different methods"* (https://doi.org/10.1016/j.jhydrol.2012.05.052). In the following the equations of the Variance Scaling approach are shown: -**(1)** First, the modeled data of the control and scenario period must be bias-corrected using -the additive linear scaling technique. This adjusts the deviation in the mean. +**(1)** First, the modeled data of the control and scenario period must be +bias-corrected using the additive linear scaling technique. This adjusts the +deviation in the mean. .. math:: @@ -93,8 +101,8 @@ the additive linear scaling technique. This adjusts the deviation in the mean. X^{*LS}_{sim,p}(i) = X_{sim,p}(i) + \mu_{m}(X_{obs,h}(i)) - \mu_{m}(X_{sim,h}(i)) -**(2)** In the second step, the time-series are shifted to a zero mean. This enables the adjustment -of the standard deviation in the following step. +**(2)** In the second step, the time-series are shifted to a zero mean. This +enables the adjustment of the standard deviation in the following step. .. math:: @@ -144,12 +152,11 @@ The Delta Method bias correction technique can be applied on stochastic and non-stochastic climate variables to minimize deviations in the mean values between predicted and observed time-series of past and future time periods. -Since the multiplicative scaling can result in very high scaling factors, -a maximum scaling factor of 10 is set. This can be changed by adjusting -:attr:`CMethods.MAX_SCALING_FACTOR`. +This method requires that the time series can be grouped by ``time.month``. -This method must be applied on a 1-dimensional data set i.e., there is only one -time-series passed for each of ``obs``, ``simh``, and ``simp``. +Since the multiplicative scaling can result in very high scaling factors, a +maximum scaling factor of 10 is set. This can be changed by passing the desired +value to the hidden ``max_scaling_factor`` argument. The Delta Method bias correction technique implemented here is based on the method described in the equations of Beyer, R. and Krapp, M. and Manica, A. (2020) @@ -159,12 +166,16 @@ for both additive and multiplicative Delta Method are shown: **Additive**: - The Delta Method looks like the Linear Scaling method but the important difference is, that the Delta method - uses the change between the modeled data instead of the difference between the modeled and reference data of the control - period. This means that the long-term monthly mean (:math:`\mu_m`) of the modeled data of the control period :math:`T_{sim,h}` - is subtracted from the long-term monthly mean of the modeled data from the scenario period :math:`T_{sim,p}` at time step :math:`i`. - This change in month-dependent long-term mean is than added to the long-term monthly mean for time step :math:`i`, - in the time-series that represents the reference data of the control period (:math:`T_{obs,h}`). + The Delta Method looks like the Linear Scaling method but the important + difference is, that the Delta method uses the change between the modeled + data instead of the difference between the modeled and reference data of the + control period. This means that the long-term monthly mean (:math:`\mu_m`) + of the modeled data of the control period :math:`T_{sim,h}` is subtracted + from the long-term monthly mean of the modeled data from the scenario period + :math:`T_{sim,p}` at time step :math:`i`. This change in month-dependent + long-term mean is than added to the long-term monthly mean for time step + :math:`i`, in the time-series that represents the reference data of the + control period (:math:`T_{obs,h}`). .. math:: @@ -172,8 +183,9 @@ for both additive and multiplicative Delta Method are shown: **Multiplicative**: - The multiplicative variant behaves like the additive, but with the difference that the change is computed using the relative change - instead of the absolute change. + The multiplicative variant behaves like the additive, but with the + difference that the change is computed using the relative change instead of + the absolute change. .. math:: @@ -204,22 +216,21 @@ for both additive and multiplicative Delta Method are shown: Quantile Mapping ---------------- -The Quantile Mapping bias correction technique can be used to minimize distributional -biases between modeled and observed time-series climate data. Its interval-independent -behavior ensures that the whole time series is taken into account to redistribute -its values, based on the distributions of the modeled and observed/reference data of the -control period. - -This method must be applied on a 1-dimensional data set i.e., there is only one -time-series passed for each of ``obs``, ``simh``, and ``simp``. +The Quantile Mapping bias correction technique can be used to minimize +distributional biases between modeled and observed time-series climate data. Its +interval-independent behavior ensures that the whole time series is taken into +account to redistribute its values, based on the distributions of the modeled +and observed/reference data of the control period. The Quantile Mapping technique implemented here is based on the equations of -Alex J. Cannon and Stephen R. Sobie and Trevor Q. Murdock (2015) *"Bias Correction of GCM -Precipitation by Quantile Mapping: How Well Do Methods Preserve Changes in Quantiles -and Extremes?"* (https://doi.org/10.1175/JCLI-D-14-00754.1). +Alex J. Cannon and Stephen R. Sobie and Trevor Q. Murdock (2015) *"Bias +Correction of GCM Precipitation by Quantile Mapping: How Well Do Methods +Preserve Changes in Quantiles and Extremes?"* +(https://doi.org/10.1175/JCLI-D-14-00754.1). The regular Quantile Mapping is bounded to the value range of the modeled data -of the control period. To avoid this, the Detrended Quantile Mapping can be used. +of the control period. To avoid this, the Detrended Quantile Mapping can be +used. In the following the equations of Alex J. Cannon (2015) are shown and explained: @@ -230,11 +241,12 @@ In the following the equations of Alex J. Cannon (2015) are shown and explained: X^{*QM}_{sim,p}(i) = F^{-1}_{obs,h} \left\{F_{sim,h}\left[X_{sim,p}(i)\right]\right\} - The additive quantile mapping procedure consists of inserting the value to be - adjusted (:math:`X_{sim,p}(i)`) into the cumulative distribution function - of the modeled data of the control period (:math:`F_{sim,h}`). This determines, - in which quantile the value to be adjusted can be found in the modeled data of the control period - The following images show this by using :math:`T` for temperatures. + The additive quantile mapping procedure consists of inserting the value to + be adjusted (:math:`X_{sim,p}(i)`) into the cumulative distribution function + of the modeled data of the control period (:math:`F_{sim,h}`). This + determines, in which quantile the value to be adjusted can be found in the + modeled data of the control period The following images show this by using + :math:`T` for temperatures. .. figure:: ../_static/images/qm-cdf-plot-1.png :width: 600 @@ -244,8 +256,9 @@ In the following the equations of Alex J. Cannon (2015) are shown and explained: Fig 1: Inserting :math:`X_{sim,p}(i)` into :math:`F_{sim,h}` to determine the quantile value This value, which of course lies between 0 and 1, is subsequently inserted - into the inverse cumulative distribution function of the reference data of the control period to - determine the bias-corrected value at time step :math:`i`. + into the inverse cumulative distribution function of the reference data of + the control period to determine the bias-corrected value at time step + :math:`i`. .. figure:: ../_static/images/qm-cdf-plot-2.png :width: 600 @@ -258,7 +271,7 @@ In the following the equations of Alex J. Cannon (2015) are shown and explained: .. math:: - X^{*QM}_{sim,p}(i) = F^{-1}_{obs,h}\Biggl\{F_{sim,h}\left[\frac{\mu{X_{sim,h}} \cdot \mu{X_{sim,p}(i)}}{\mu{X_{sim,p}(i)}}\right]\Biggr\}\frac{\mu{X_{sim,p}(i)}}{\mu{X_{sim,h}}} + X^{*QM}_{sim,p}(i) = F^{-1}_{obs,h}\Biggl\{F_{sim,h}\left[\frac{\mu{X_{sim,h}} \cdot X_{sim,p}(i)}{\mu{X_{sim,p}(i)}}\right]\Biggr\}\frac{\mu{X_{sim,p}(i)}}{\mu{X_{sim,h}}} .. code-block:: python :linenos: @@ -286,37 +299,49 @@ In the following the equations of Alex J. Cannon (2015) are shown and explained: Detrended Quantile Mapping -------------------------- -The Detrended Quantile Mapping bias correction technique can be used to minimize distributional -biases between modeled and observed time-series climate data like the regular Quantile Mapping. -Detrending means, that the values of :math:`X_{sim,p}` are shifted to the value range of -:math:`X_{sim,h}` before the regular Quantile Mapping is applied. -After the Quantile Mapping was applied, the mean is shifted back. Since it does not make sens -to take the whole mean to rescale the data, the month-dependent long-term mean is used. +The Detrended Quantile Mapping bias correction technique can be used to minimize +distributional biases between modeled and observed time-series climate data like +the regular Quantile Mapping. Detrending means, that the values of +:math:`X_{sim,p}` are shifted by the mean of :math:`X_{sim,h}` before the +regular Quantile Mapping is applied. After the Quantile Mapping was applied, the +mean is shifted back. Since it does not make sense to take the whole mean to +rescale the data, the month-dependent long-term mean is used. This method must be applied on a 1-dimensional data set i.e., there is only one -time-series passed for each of ``obs``, ``simh``, and ``simp``. Also this method requires -that the time series are groupable by ``time.month``. +time-series passed for each of ``obs``, ``simh``, and ``simp``. This method +requires that the time series can be grouped by ``time.month``. -The Detrended Quantile Mapping technique implemented here is based on the equations of -Alex J. Cannon and Stephen R. Sobie and Trevor Q. Murdock (2015) *"Bias Correction of GCM -Precipitation by Quantile Mapping: How Well Do Methods Preserve Changes in Quantiles -and Extremes?"* (https://doi.org/10.1175/JCLI-D-14-00754.1). +Since the ratio when applying the multiplicative variant can result in extreme +factors, a maximum scaling factor of 10 is set. This can be changed by passing +the desired value to the hidden ``max_scaling_factor`` argument. -In the following the equations of Alex J. Cannon (2015) are shown (without detrending; see QM -for explanations): +The Detrended Quantile Mapping technique implemented here is based on the +equations of Alex J. Cannon and Stephen R. Sobie and Trevor Q. Murdock (2015) +*"Bias Correction of GCM Precipitation by Quantile Mapping: How Well Do Methods +Preserve Changes in Quantiles and Extremes?"* +(https://doi.org/10.1175/JCLI-D-14-00754.1). + +The following equations qre based on Alex J. Cannon (2015) but extended the +shift of :math:`X_{sim,p}(i)`: + +**Shift of value range**: + +.. math:: + + X_{sim,p}^{*DT}(i) = X_{sim,p}(i) + \Delta\mu **Additive**: .. math:: - X^{*QM}_{sim,p}(i) = F^{-1}_{obs,h} \left\{F_{sim,h}\left[X_{sim,p}(i)\right]\right\} + X_{sim,p}^{*DQM}(i) = F_{obs,h}^{-1}\left\{F_{sim,h}\left[X_{sim,p}^{*DT}(i)\right]\right\} **Multiplicative**: .. math:: - X^{*QM}_{sim,p}(i) = F^{-1}_{obs,h}\Biggl\{F_{sim,h}\left[\frac{\mu{X_{sim,h}} \cdot \mu{X_{sim,p}(i)}}{\mu{X_{sim,p}(i)}}\right]\Biggr\}\frac{\mu{X_{sim,p}(i)}}{\mu{X_{sim,h}}} + X^{*DQM}_{sim,p}(i) = F^{-1}_{obs,h}\Biggl\{F_{sim,h}\left[\frac{\mu{X_{sim,h}} \cdot X_{sim,p}^{*DT}(i)}{\mu{X_{sim,p}^{*DT}(i)}}\right]\Biggr\}\frac{\mu{X_{sim,p}^{*DT}(i)}}{\mu{X_{sim,h}}} .. code-block:: python @@ -344,22 +369,24 @@ for explanations): Quantile Delta Mapping ----------------------- -The Quantile Delta Mapping bias correction technique can be used to minimize distributional -biases between modeled and observed time-series climate data. Its interval-independent -behavior ensures that the whole time series is taken into account to redistribute -its values, based on the distributions of the modeled and observed/reference data of the -control period. In contrast to the regular Quantile Mapping (:func:`cmethods.CMethods.quantile_mapping`) -the Quantile Delta Mapping also takes the change between the modeled data of the control and scenario -period into account. - -This method must be applied on a 1-dimensional data set i.e., there is only one -time-series passed for each of ``obs``, ``simh``, and ``simp``. - -The Quantile Delta Mapping technique implemented here is based on the equations of -Tong, Y., Gao, X., Han, Z. et al. (2021) *"Bias correction of temperature and precipitation -over China for RCM simulations using the QM and QDM methods"*. Clim Dyn 57, 1425-1443 -(https://doi.org/10.1007/s00382-020-05447-4). In the following the additive and multiplicative -variant are shown. +The Quantile Delta Mapping bias correction technique can be used to minimize +distributional biases between modeled and observed time-series climate data. Its +interval-independent behavior ensures that the whole time series is taken into +account to redistribute its values, based on the distributions of the modeled +and observed/reference data of the control period. In contrast to the regular +Quantile Mapping (:func:`cmethods.CMethods.quantile_mapping`) the Quantile Delta +Mapping also takes the change between the modeled data of the control and +scenario period into account. + +Since the ratio when applying the multiplicative variant can result in extreme +factors, a maximum scaling factor of 10 is set. This can be changed by passing +the desired value to the hidden ``max_scaling_factor`` argument. + +The Quantile Delta Mapping technique implemented here is based on the equations +of Tong, Y., Gao, X., Han, Z. et al. (2021) *"Bias correction of temperature and +precipitation over China for RCM simulations using the QM and QDM methods"*. +Clim Dyn 57, 1425-1443 (https://doi.org/10.1007/s00382-020-05447-4). In the +following the additive and multiplicative variant are shown. **Additive**: @@ -370,24 +397,26 @@ variant are shown. \varepsilon(i) = F_{sim,p}\left[X_{sim,p}(i)\right], \hspace{1em} \varepsilon(i)\in\{0,1\} - **(1.2)** The bias corrected value at time step :math:`i` is now determined by inserting the - quantile value into the inverse cummulative distribution function of the reference data of the control - period. This results in a bias corrected value for time step :math:`i` but still without taking the - change in modeled data into account. + **(1.2)** The bias corrected value at time step :math:`i` is now determined + by inserting the quantile value into the inverse cumulative distribution + function of the reference data of the control period. This results in a bias + corrected value for time step :math:`i` but still without taking the change + in modeled data into account. .. math:: X^{QDM(1)}_{sim,p}(i) = F^{-1}_{obs,h}\left[\varepsilon(i)\right] - **(1.3)** The :math:`\Delta(i)` represents the absolute change in quantiles between the modeled value - in the control and scenario period. + **(1.3)** The :math:`\Delta(i)` represents the absolute change in quantiles + between the modeled value in the control and scenario period. .. math:: \Delta(i) & = F^{-1}_{sim,p}\left[\varepsilon(i)\right] - F^{-1}_{sim,h}\left[\varepsilon(i)\right] \\[1pt] & = X_{sim,p}(i) - F^{-1}_{sim,h}\left\{F^{}_{sim,p}\left[X_{sim,p}(i)\right]\right\} - **(1.4)** Finally the previously calculated change can be added to the bias-corrected value. + **(1.4)** Finally the previously calculated change can be added to the + bias-corrected value. .. math:: @@ -395,19 +424,21 @@ variant are shown. **Multiplicative**: - The first two steps of the multiplicative Quantile Delta Mapping bias correction technique are the - same as for the additive variant. + The first two steps of the multiplicative Quantile Delta Mapping bias + correction technique are the same as for the additive variant. - **(2.3)** The :math:`\Delta(i)` in the multiplicative Quantile Delta Mapping is calulated like the - additive variant, but using the relative than the absolute change. + **(2.3)** The :math:`\Delta(i)` in the multiplicative Quantile Delta Mapping + is calculated like the additive variant, but using the relative than the + absolute change. .. math:: \Delta(i) & = \frac{ F^{-1}_{sim,p}\left[\varepsilon(i)\right] }{ F^{-1}_{sim,h}\left[\varepsilon(i)\right] } \\[1pt] & = \frac{ X_{sim,p}(i) }{ F^{-1}_{sim,h}\left\{F_{sim,p}\left[X_{sim,p}(i)\right]\right\} } - **(2.4)** The relative change between the modeled data of the control and scenario period is than - multiplicated with the bias-corrected value (see **1.2**). + **(2.4)** The relative change between the modeled data of the control and + scenario period is than multiplied with the bias-corrected value (see + **1.2**). .. math::