A doubly robust estimator for continuous treatments in high dimensions

Gao, Qian; Wang, Jiale; Fang, Ruiling; Sun, Hongwei; Wang, Tong

doi:10.1186/s12874-025-02488-3

Research
Open access
Published: 13 February 2025

A doubly robust estimator for continuous treatments in high dimensions

Qian Gao¹,
Jiale Wang¹,
Ruiling Fang¹,
Hongwei Sun² &
…
Tong Wang¹

BMC Medical Research Methodology volume 25, Article number: 35 (2025) Cite this article

729 Accesses
Metrics details

Abstract

Background

Generalized propensity score (GPS) methods have become popular for estimating causal relationships between a continuous treatment and an outcome in observational studies with rich covariate information. The presence of rich covariates enhances the plausibility of the unconfoundedness assumption. Nonetheless, it is also crucial to ensure the correct specification of both marginal and conditional treatment distributions, beyond the assumption of unconfoundedness.

Method

We address limitations in existing GPS methods by extending balance-based approaches to high dimensions and introducing the Generalized Outcome-Adaptive LASSO and Doubly Robust Estimate (GOALDeR). This novel approach integrates a balance-based method that is robust to the misspecification of distributions required for GPS methods, a doubly robust estimator that is robust to the misspecification of models, and a variable selection technique for causal inference that ensures an unbiased and statistically efficient estimation.

Results

Simulation studies showed that GOALDeR was able to generate nearly unbiased estimates when either the GPS model or the outcome model was correctly specified. Notably, GOALDeR demonstrated greater precision and accuracy compared to existing methods and was slightly affected by the covariate correlation structure and ratio of sample size to covariate dimension. Real data analysis revealed no statistically significant dose-response relationship between epigenetic age acceleration and Alzheimer’s disease.

Conclusion

In this study, we proposed GOALDeR as an advanced GPS method for causal inference in high dimensions, and empirically demonstrated that GOALDeR is doubly robust, with improved accuracy and precision compared to existing methods. The R package is available at https://github.com/QianGao-SXMU/GOALDeR.

Peer Review reports

Introduction

The advent of omics data and health care data makes it possible to draw causal conclusions from observational studies because a substantial number of covariates makes the assumption of unconfoundedness plausible [1]. The propensity score (PS) method is a common statistical tool for performing such causal inference in observational studies. The PS method was originally developed to estimate the causal effects of a binary treatment, exposure, or intervention (hereafter referred to as ‘treatment’) on an outcome [2]. Recently, extensions of PS methods to the context of continuous treatment have been developed and are collectively known as generalized PS (GPS) methods. GPS methods are focused on estimating the dose–response function (DRF) describing the relationship between a continuous treatment and an outcome [3,4,5]. Similar to PS methods, GPS methods estimate the DRF through regression adjustment [5], matching [6], stratification [7], and inverse probability weighting (IPW) [8]. Additionally, the doubly robust approach has been proposed and has received increasing attention as a robust method to model misspecification of either the GPS model or the outcome model [9, 10].

GPS is a probability density function of the treatment conditional on observed covariates [5]. The validity of GPS methods relies on the assumption that both the conditional mean and the conditional distribution of the treatment, given the covariates, must be correctly specified [8]. To relax this assumption, several balancing approaches have been proposed under a weighting or doubly robust framework [11,12,13,14,15]. The balancing approaches are focused on directly estimating weights under the balance constraints, including covariate balance; specifically, the weighted cross-moments between the treatment and each covariate are 0. Recent methods include the nonparametric covariate balancing generalized propensity score (npCBGPS) of Fong et al. [11], entropy balancing weights [12, 13], and covariate association eliminating weights of Yiu et al. [14]. Whereas these methods are appealing in terms of robustness to GPS model misspecification, the orders of the moment of both the covariates and the treatment to decorrelate must be carefully chosen. A higher moment may be helpful when there are nonlinear correlations between the treatment and the covariates, but this may violate the positivity assumption [13, 15]. To our knowledge, there is still a lack of guidance for specifying correct orders of moment, which is necessary to mitigate confounding bias. To address the issue of what moments to decorrelate, Huling et al. proposed distance covariance optimal weights (DCOWs) [15]. However, the abovementioned methods do not consider variable selection, which is another important factor influencing the performance of the estimated DRF [16,17,18,19,20,21]; therefore, their application is limited in the case of high-dimensional covariates.

The GPS methods are susceptible to the covariates being balanced. For example, the inclusion of instrumental variables (IVs) that can only predict the treatment in the GPS model could inflate variance without reducing bias in the estimates [16,17,18,19,20,21]. It has been well documented that an optimal GPS method should balance or control for all confounders and prognostic covariates that can only predict the outcome [16,17,18,19,20,21]. Doing so can not only remove confounding bias but also improve the efficiency of the estimates [16,17,18,19,20,21]. Hence, it is necessary to introduce variable selection techniques into GPS methods in high-dimensional context. In the doubly robust framework, Su et al. [22] and Colangelo et al. [23] used machine learning methods, and Antonelli et al. [24] used a Gaussian process to estimate nuisance parameters related to the GPS model and the outcome model. Unfortunately, these studies failed to address the adverse influence of IVs. Under adaptive lasso-based shrinkage, our previously proposed generalized outcome-adaptive LASSO (GOAL) approach discourages the selection of IVs by strongly penalizing covariates that are not associated with the outcome [25]. The GOAL method is robust to the GPS model misspecification. However, its validity depends on the assumption that the outcome model is linear.

Here, we retained the idea of variable selection from the GOAL method and proposed a generalized outcome-adaptive LASSO and doubly robust estimation (GOALDeR) method. Unlike the GOAL method, our proposed method constructs a penalty function that is independent of the outcome model. Consequently, we can estimate the DRF in the doubly robust framework [21]. In recognizing that the correlation between the treatment and confounders is a source of confounding bias [15], our method uses a distance correlation coefficient as a measure to assess covariate balance. The distance correlation coefficient is zero if and only if the variables are independent of each other [26]. With a simulation, we show that the GOALDeR method is doubly robust, provides more precise and accurate estimates than existing methods, and is scarcely affected by the covariate correlation structure and ratio of sample size to covariate dimension. We also applied the GOALDeR method to investigate potential causality between epigenetic age acceleration and Alzheimer’s disease (AD).

Generalized outcome-adaptive LASSO and doubly robust estimation

Notations and assumptions

We let $D_{{i=1}}^{n}=\left( {{T_i},{Y_i},{Z_i}} \right)$ denotes an independent and identically distributed sample drawn from a common joint distribution $f\left( {T,Y,Z} \right)$. Each subject $i \in \left\{ {1,...,n} \right\}$ has a continuous treatment ${T_i}$ whose support is $\:\mathcal{T}\subseteq\:\mathcal{R}$, and an outcome $\:{Y}_{i}$. We characterize causal DRF using potential outcome notation [27] and define $\:{Y}_{i}\left(t\right)$ as the potential outcome for subject $i \in \left\{ {1,...,n} \right\}$ given treatment level $\:{T}_{i}=t$ ($\:t\in\:\mathcal{T}$). Our target estimand is $\:\mathbb{E}\left({Y}_{i}\left(t\right)\right)$. The observed $\:{\varvec{Z}}_{i}\in\:{\mathcal{R}}^{p}$ denotes pre-treatment covariates, where p is the dimension. Each available covariate belongs to one of four mutually exclusive covariate sets:

confounders ($\:{\varvec{Z}}_{c}$): covariates that contribute to both the treatment and the outcome;
prognostic covariates ($\:{\varvec{Z}}_{P}$): covariates that contribute to the outcome only;
instrumental variables ($\:{\varvec{Z}}_{I}$): covariates that contribute to the treatment only;
spurious covariates ($\:{\varvec{Z}}_{S}$): covariates that contributions to neither the treatment nor the outcome.

Under the potential outcome framework, we established the following assumptions to identify the DRF from the observed data, and we maintained these assumptions throughout this work.

Assumption 1 (Consistency): For subject $i \in \left\{ {1,\cdots,n} \right\}$, $T_{i} = (t \in\mathcal{T})$ implies $Y_{i} = Y_{i}(\text{t})$.

Assumption 2 (Positivity): The GPS or conditional probability density function of the treatment $f_{\text{T|Z}} (T_{\text{i}}= t|Z_{\text{i}})$ is positive for any $t \in \mathcal{T}$ and for any ${\varvec{Z}}_{i}\in\:{\mathcal{R}}^{p}$.

Assumption 3 (Unconfoundedness): $Y_{\text{i}}(\text{t})\perp T_{\text{i}} | Z_{\text{i}}, \forall \:t\in\:\mathcal{T}$ means that for any treatment level, the potential outcome $Y_{\text{i}}(\text{t})$ is conditionally independent of the treatment given the covariates. Note that this assumption is untestable from the observed data.

Assumption 4 (Stable unit treatment assumption): This assumption indicates that there is no interference among subjects.

Variable selection based on outcome-adaptive LASSO

We retained the idea of variable selection from the GOAL method [25] and started by assuming the GPS model as follows:

$$\:E\left(T|\varvec{Z}\right)={\alpha\:}_{0}+\sum\:_{j=1}^{p}{Z}_{j}{\alpha\:}_{j}$$

(1)

As mentioned in the introduction, an optimal GPS method should control for or balance covariates that are associated with the outcome (including $\:{\varvec{Z}}_{c}$ and $\:{\varvec{Z}}_{P}$). The covariate selection mechanism should be free from the outcome model for a doubly robust estimator. We borrowed the idea from the adaptive LASSO and achieved a covariate selection procedure by solving:

$$\:\widehat{\alpha\:}=arg\ \underset{\alpha\:}{\text{min}}{||T-{\alpha\:}_{0}-\sum\:_{j=1}^{p}{Z}_{j}{\alpha\:}_{j}\|}_{2}^{2}+{\lambda\:}_{n}\sum\:_{j=1}^{p}{\widehat{w}}_{j}\left|{\alpha\:}_{j}\right|$$

(2)

where $\:{\widehat{w}}_{j}$ denotes penalty weight and is inversely proportional to the influence of covariates $\:{Z}_{j}$ on the outcome. Here, the GOALDeR method defines an outcome model-free penalty weight as $\:{\widehat{w}}_{j}={\left|\left|dcor\left({Z}_{j},Y|T\right)\right|/\underset{j}{\text{max}}\left|dcor\left({Z}_{j},Y|T\right)\right|\right|}^{\gamma\:}$, where $\:dcor\left({Z}_{j},Y|T\right)$ is the conditional distance correlation coefficient between $\:{Z}_{j}$ and the outcome $\:Y$, given treatment $\:T$, measuring any kind of correlations [28]. $\:\gamma\:>1$ is a tuning parameter. $\:{\lambda\:}_{n}>0$ is another tuning parameter satisfying $\:{{\uplambda\:}}_{\text{n}}/\sqrt{\text{n}}\to\:0$ and $\:{{\uplambda\:}}_{\text{n}}{\text{n}}^{{\upgamma\:}/2-1}\to\:{\infty\:}$ for consistency in variable selection, as with the GOAL method [25, 29]. On the contrary, the GOAL method utilizes coefficients from a separate linear outcome model to create penalty weights, which means that the validity of the GOAL method depends on the correct specification of the outcome model.

Choosing ${\varvec{\lambda}}_{\varvec{n}}$

We propose dual-weight distance correlation (DWDC) as a measure for selecting the optimal $\:{\lambda\:}_{n}$, and the rule is minimizing DWDC. Similar to dual-weight correlation (DWC) in the GOAL method, the standpoint of DWDC is covariate balance for unbiased, efficient estimation. However, unlike DWC which only captures linear correlations between covariates and both the treatment and the outcome, DWDC uses distance correlation to capture all types of correlations between covariates and both the treatment and the outcome.

$$\:DWDC=\sum\:_{j=1}^{p}{\left|dcor\left({Z}_{j},Y|T\right)\right|}^{2}\left|{dcor}_{{w}^{{\lambda\:}_{n}}}\left({Z}_{j},T\right)\right|$$

(3)

where $\:{dcor}_{{w}^{{\lambda\:}_{n}}}\left({Z}_{j},T\right)$ refers to the weighted distance correlation between covariate $\:{Z}_{j}$ and the treatment, serving as a measure of covariate balance. The smaller the $\:\left|{dcor}_{{w}^{{\lambda\:}_{n}}}\left({Z}_{j},T\right)\right|$, the better the covariate balance achieved after weighting. Recall that $\:dcor\left({Z}_{j},Y|T\right)$ is the conditional distance correlation between $\:{Z}_{j}$ and $\:Y$ given T. Multiplying these two components implies that the DWDC is more affected by the imbalance of $\:{\varvec{Z}}_{c}$ and $\:{\varvec{Z}}_{P}$ and less affected by the imbalance of $\:{\varvec{Z}}_{I}$ and $\:{\varvec{Z}}_{S}$. Hence, a smaller DWDC could further encourage the selection of $\:{\varvec{Z}}_{c}$ and $\:{\varvec{Z}}_{P}$.

The balance weights $\:{w}^{{\lambda\:}_{n}}$ in the DWDC are estimated using the DCOWs method with covariates selected according to Eq. (2) with $\:{\lambda\:}_{n}$, without requiring the specification of moment orders for both the covariates and the treatment to achieve decorrelation. The DCOWs method uses weighted distance covariance between the treatment and covariates as a loss function and directly estimates balance weights under the following constraints: (1) the marginal distributions of the treatment and the covariates are preserved after weighting; (2) the weights are positive and sum to the sample size. The authors showed that the balance weights estimated by the DCOWs could enhance a doubly robust estimator. Further details are provided in the article by Huling et al. [15]. On the contrary, the balance weights $\:{w}^{{\lambda\:}_{n}}$ in the DWC are estimated using npCBGPS [11], which requires the specification of moment orders for both the covariates and the treatment to achieve decorrelation of nonlinearities.

Estimating DRF using a doubly robust estimator

Based on balance weights estimated using covariates selected by optimal $\:{\lambda\:}_{n}$, the GOAL method uses the IPW method to estimate DRF. The GOAL method cannot estimate DRF using the “doubly robust” method as variable selection in the GPS model hinges on the outcome model being correct, which undermines the “doubly robust” nature of the method [21]. In contrast, the variable selection in GOALDeR is independent of the outcome model; therefore, we ultimately use the doubly robust estimator of Kennedy et al. [10] to estimate DRF. The doubly robust estimator of Kennedy et al. [10] consists of two steps. In the first step, a pseudo-outcome is constructed, and in the second step, the pseudo-outcome is regressed on the treatment to estimate DRF. The pseudo-outcome can be estimated as [15]:

$$\:\widehat{\theta\:}\left({T}_{i}\right)=\frac{1}{n}\sum\:_{i=1}^{n}\widehat{\mu\:}\left(\stackrel{-}{\varvec{Z},}{T}_{i}\right)+\left({Y}_{i}-\widehat{\mu\:}\left({\varvec{Z}}_{i},{T}_{i}\right)\right){w}_{i}$$

(4)

where $\:\widehat{\mu\:}\left(\varvec{Z},T\right)$ denotes an estimate of the outcome model $\:\mu\:\left(\varvec{Z},T\right)$. Here, the Super Learner method (SL) which combines LASSO, XGBoost, Random Forest, and Support vector machines is applied to estimate $\:\widehat{\mu\:}\left(\bullet\:\right)$ [30]. $\:{w}_{i}$ denotes balance weights estimated by the DCOWs method with covariates selected by optimal $\:{\lambda\:}_{n}$. Subsequently, the DRF is estimated using a linear or nonlinear regression model of the treatment on the pseudo-outcome. In this work, we used a linear regression model for comparison purposes.

Simulations

Simulation setup

We modeled simulations to assess the performance of GOALDeR and compare it with existing approaches when there are a large number of covariates. Following our previous studies [25, 31], we developed simulations by adapting the research conducted by Tan et al. [32] and Shortreed et al. [29]. For each replicated dataset, p covariates and n individuals were drawn independently from a multivariate standard Gaussian distribution with varying correlations of 0, 0.2, and 0.5. We generated a continuous treatment and outcome from models given by:

$$\:\text{G}\text{P}\text{S}\:\text{m}\text{o}\text{d}\text{e}\text{l}:\:T=\sum\:_{j=1}^{p}m\left({Z}_{j}\right){\alpha\:}_{j}+\zeta\:,\:\zeta\:\sim N\left(\text{0,1}\right)$$

(5)

$$\:\text{O}\text{u}\text{t}\text{c}\text{o}\text{m}\text{e}\:\text{m}\text{o}\text{d}\text{e}\text{l}:\:Y=\eta\:T+\sum\:_{j=1}^{p}g\left({Z}_{j}\right){\beta\:}_{j}+\xi\:,\:\xi\:\sim N\left(\text{0,1}\right)$$

(6)

where $\:\eta\:=0$ or 2.

We used two data-generating scenarios to compare the GOALDeR method with existing methods, which were summarized in Table 1. In the first scenario, we assumed that both the GPS model and the outcome model are linear, that is, $\:g\left({Z}_{j}\right)={Z}_{j}$ and $\:m\left({Z}_{j}\right)={Z}_{j}$, $\:j=1,\dots\:p$, and we conducted simulations in three settings by varying the strength of the relationship between confounders and outcome, and treatment. We considered varying levels of confounding because the strength of the confounders affects the bias, variance, and mean-squared error (MSE) of an estimate [19]. For all three settings, the first two covariates, $\:{Z}_{1}$ and $\:{Z}_{2}$, are true confounders; the third and fourth covariates, $\:{Z}_{3}$ and $\:{Z}_{4}$, are prognostic covariates; the fifth and sixth covariates, $\:{Z}_{5}$ and $\:{Z}_{6}$, are IVs; and the other p-6 covariates are spurious covariates. The first setting (SoSt) sets $\:\varvec{\alpha\:}=\left(\text{1,1},\text{0,0},\text{1,1},0,\dots\:\dots\:,0\right)$ and $\:\varvec{\beta\:}=\left(\text{1,1},\text{1,1},\text{0,0},0,\dots\:\dots\:,0\right)$. The second setting (SoWt) sets $\:\varvec{\alpha\:}=\left(\text{0.5,0.5,0},\text{0,1},\text{1,0},\dots\:\dots\:,0\right)$ and $\:\varvec{\beta\:}=\left(\text{1,1},\text{1,1},\text{0,0},0,\dots\:\dots\:,0\right)$. The third setting (WoSt) sets $\:\varvec{\alpha\:}=\left(\text{1,1},\text{0,0},\text{1,1},0,\dots\:\dots\:,0\right)$ and $\:\varvec{\beta\:}=\left(\text{0.5,0.5,1},\text{1,0},\text{0,0},\dots\:\dots\:,0\right)$. The coefficients of 1 and 0.5 for confounders are commonly used in epidemiology [31, 33,34,35].

Under the second scenario, we introduced model misspecification via a nonlinear transformation of confounders and conducted simulations under three settings by varying whether the GPS model or the outcome model was misspecified. The data-generating processes were similar to those in the simulations by Tan et al. [32] and Kang et al. [36], which explored the impact of model misspecification on DR and non-DR estimators. The first setting correctly specified the outcome model, and misspecified the GPS model (CoMt) given $\:m\left({Z}_{1}\right)=exp\left({Z}_{1}/2\right)$, $\:m\left({Z}_{2}\right)=\left({Z}_{2}/\left(1+exp\left({Z}_{1}\right)\right)\right)+10$, $\:m\left({Z}_{3}\right)={\left(0.04*{Z}_{1}*{Z}_{3}+0.6\right)}^{3}$, $\:m\left({Z}_{4}\right)={\left({Z}_{2}+{Z}_{4}\right)}^{2}$, $\:m\left({Z}_{j}\right)={Z}_{j}$ for $\:j>4$, and $\:g\left({Z}_{j}\right)={Z}_{j}$ for $\:j=1,\dots\:p$. The second setting (MoCt) used a nonlinear data-generating process for the outcome, and linear for the treatment given $\:m\left({Z}_{j}\right)={Z}_{j}$ for $\:j=1,\dots\:p$, and $\:g\left({Z}_{1}\right)=exp\left({Z}_{1}/2\right)$, $\:g\left({Z}_{2}\right)=\left({Z}_{2}/\left(1+exp\left({Z}_{1}\right)\right)\right)+10$, $\:g\left({Z}_{3}\right)={\left(0.04*{Z}_{1}*{Z}_{3}+0.6\right)}^{3}$, $\:g\left({Z}_{4}\right)={\left({Z}_{2}+{Z}_{4}\right)}^{2}$, $\:g\left({Z}_{j}\right)={Z}_{j}$ for $\:j>4$. The third setting (MoMt) used a nonlinear data-generating process for both the outcome and treatment, as with CoMt and MoCt. For all three settings, the coefficients were set to $\:\varvec{\alpha\:}=\left(\text{1,1},\text{1,1},\text{0,0},\text{1,1},0,\dots\:\dots\:,0\right)$ and $\:\varvec{\beta\:}=\left(\text{1,1},\text{1,1},\text{1,1},\text{0,0},0,\dots\:\dots\:,0\right)$.

For other settings, including the true causal parameter in the DRF ($\:\eta\:=0$ or 2), sample size, and the dimension of covariates, we followed Shortreed et al. [29] and our previous study [25]. For each setting, we generated 100 simulated datasets each for dimensionality (n/p ratio): n = 200, p = 100 and n = 500, p = 200. We searched over several possible $\:{\lambda\:}_{n}$ values $\:\left\{{n}^{-10},{{n}^{-5},{n}^{-2},{n}^{-1.25},{n}^{-1},n}^{-0.75},{n}^{-0.5},{n}^{-0.25},{n}^{0.25},{n}^{0.49}\right\}$ for each dataset and chose $\:\gamma\:$ such that $\:{{\uplambda\:}}_{\text{n}}{\text{n}}^{{\upgamma\:}/2-1}={n}^{2}$.

Table 1 Simulation scenarios. Treatment T is generated as $\:N\left(m\left(\varvec{Z}\right),1\right)$, and outcome Y is generated as $\:N\left(\eta\:T+g\left(\varvec{Z}\right),1\right)$ where $\:\eta\:=0\:or\:2$

Full size table

Furthermore, to investigate the impact of effect size on statistical testing, we also explored the performance of each method when the DRF parameter was set to 0.4 and 0.7. The data-generating processes are the same as those described in Table 1, with the only difference being η = 0.4 or 0.7. To examine the performance of the GOALDeR method as the sample size increases, we let p = 20 and n = 200, 500, 1000. The data-generating processes are the same as those described in Table 1, with the only difference being the values of (n, p).

Comparing methods

We compared the following methods for estimating DRF: (1) GOAL [25], whose processes are similar to those of the GOALDeR method. The main differences between the GOAL method and the GOALDeR method are described in Sect. 2. A detailed implementation of the GOAL method can be found in the Supplementary Materials; (2) SL-DR, which estimates the DRF in the DR framework of Kennedy et al. [10] (described in subsection 2.4). Briefly, the SL-DR method fits the GPS model and the outcome model using the SL method to estimate the pseudo-outcome. The SL method combines the results of LASSO, XGBoost, Random Forest, and Support vector machines. The balance weights used to estimate the pseudo-outcome are given by $\:{w}_{i}={f}_{T}\left({T}_{i}\right)/{f}_{T|Z}\left({T}_{i}|{\varvec{Z}}_{\varvec{i}}\right)$ where the numerator is the marginal density of the treatment, and $\:{f}_{T|Z}\left({T}_{i}|{\varvec{Z}}_{\varvec{i}}\right)$ is the GPS. In this study, we normally approximated both $\:{f}_{T}\left({T}_{i}\right)$ and $\:{f}_{T|Z}\left({T}_{i}|{\varvec{Z}}_{\varvec{i}}\right)$. The R packages used to implement the GOALDeR, SL-DR, and GOAL methods are available at https://github.com/QianGao-SXMU/GOALDeR and https://github.com/QianGao-SXMU/GOAL, respectively.

Results

The results of data-generating with $\:\eta\:=2$ are shown following. The others are in the Supplementary Materials.

Estimation under scenarios 1 and 2 with a modest p = 20

We performed simulations to evaluate GOALDeR and compare it with existing methods. For illustrating the performance of GOALDeR as the sample size (n) increases, we plotted the distribution of the causal parameter estimates using a boxplot and the proportion of times each covariate was selected for simulation with a modest number of covariates (p = 20). For Scenario 1, where both the outcome and GPS models are linear, we present the results for SoSt (the confounders are strongly correlated with both the treatment and the outcome), with a true causal parameter equal to 2. The remaining results are provided in the Supplementary Materials. The boxplot of causal parameter estimates is presented in Fig. 1 (Supplementary Figs. S1 and S8). GOALDeR produced nearly unbiased estimates across all sample sizes, and the precision of the estimates was enhanced as n increased. In Scenario 2, when either the GPS model or outcome model was nonlinear (CoMt and MoCt), GOALDeR could still yield nearly unbiased estimates across all sample sizes (Fig. 2 and Supplementary Fig. S12) despite having unsatisfactory performance in variable selection (Supplementary Figs. S5 to S7 and S13 to S15). Under the setting where the outcome model is nonlinear and the GPS model is linear (MoCt), the variability of the estimates became smaller as the sample size increased. Not surprisingly, the estimates became biased when both the GPS and outcome models were nonlinear (MoMt; Fig. 2 and Supplementary Fig. S12).

The percentage of each covariate being selected under Scenario 1 is shown in Fig. 3 (Supplementary Figs. S2 to S4 and S9 to S11). We present the results for SoSt, with a true causal parameter equal to 2. The remaining results are presented in the Supplementary Materials. In general, the likelihood of selecting IVs decreased sharply with n increased. To illustrate no correlation between covariates, the average proportion of selecting IVs was 30% when n = 200, decreasing to 1.5% when n = 500, and further decreasing to 0 when n = 1000. The selection of IVs and spurious covariates increased as the correlation between covariates increased. Although GOALDeR may underselect confounders that are weakly correlated with outcome (Supplementary Figs. S4 and S11), it still yielded nearly unbiased estimates (Supplementary Figs. S1 and S8). Additionally, the GOALDeR showed a similar variable selection pattern when there was a large number of covariates.

Estimation and testing under scenario 1 with a large number of covariates

In Scenario 1, we compared the accuracy and precision of causal estimates between GOALDeR, GOAL, and SL-DR under varying strengths between confounders and both the treatment and outcome. The bias of parameter estimates in the DRF was used to evaluate accuracy. The bias distribution is shown in Fig. 4, and the summary statistics for Scenario 1 are listed in Table 2. GOALDeR showed nearly unbiased estimates across all three settings (Fig. 4; Table 2). The root mean squared error (RMSE) and the empirical standard error of the estimates were used to assess precision. The precision of GOALDeR was slightly enhanced when n = 500 compared to when n = 200. Interestingly, the accuracy and precision of GOALDeR were slightly impacted by the correlations between covariates (Fig. 4; Table 2). In contrast, as previously observed, the bias and variability (RMSE and empirical standard error) of GOAL increased as the correlations between covariates increased and the n/p ratio decreased. Compared with GOALDeR, SL-DR provided similar estimation accuracy, but the precision was significantly worse than that of GOALDeR (Fig. 4; Table 2). The reason is presumably owing to ignoring the negative effects of IVs when fitting the GPS model.

Table 2 Summary statistics of the performance under scenario 1 with the true parameter of DRF = 2

Full size table

The standard deviation (SD) was estimated using the regression of the treatment on pseudo-outcome for the GOALDeR and SL-DR methods, and the sandwich-type variance estimator for the GOAL method [25]. The coverage probability of the 95% confidence interval (CI) and power were used to assess statistical testing. As shown in Table 2, for GOALDeR, the estimated SDs were lower than the empirical standard errors in most cases, resulting in the coverage of the 95% CI being less than 95% (ranging from 67 to 90%). For SL-DR, the estimated SDs were significantly lower than the empirical standard errors, and the coverage of the 95% CI (ranging from 48 to 81%) was consistently lower than that of the GOALDeR method. This implied that the estimated SDs for GOALDeR and SL-DR methods were underestimated. For GOAL, the estimated SDs were larger than the empirical standard errors, resulting in the coverage of the 95% CI tending to be conservative (ranging from 95 to 100%). We also estimated the SD using the bootstrap method for the GOALDeR method and found that the bootstrap SD was slightly higher than the empirical standard errors, and the corresponding coverage was around 95% in most cases.

GOALDeR, GOAL, and SL-DR had similar power, which was 1 in all three settings (Table 2). Furthermore, we also explored the performance of each method when the DRF parameter was set to 0.4 and 0.7 to investigate the impact of effect size on statistical testing. The results for $\:\eta\:=0.4\:\text{a}\text{n}\text{d}\:\:0.7$ are presented in Supplementary Tables S3 and S4, which were similar to those for $\:\eta\:=2$ except for the power. When the DRF parameter decreased from 2 to 0.4, the power for GOALDeR remained consistently at 1, while it slightly decreased for SL-DR and significantly decreased for the GOAL method.

Estimation and testing under scenario 2 with a large number of covariates

In Scenario 2, we assessed the double robustness of GOALDeR, GOAL, and SL-DR. The bias distribution of parameter estimates in DRF is shown in Fig. 5, and the summary statistics for Scenario 2 are listed in Table 3. GOALDeR yielded estimates that were close to the true value 2 as long as one of the outcome and the GPS models were correctly specified, and the biases were less impacted by the correlation between covariates and the n/p ratio. This indicated the double robustness of GOALDeR (Table 3; Fig. 5). In the setting of MoCt, the variability (RMSE and empirical standard error) of the estimates by GOALDeR became large as the correlation between covariates increased, especially when the n/p ratio was small (n/p = 200/100). The SL-DR method also tended to be doubly robust. Compared to GOALDeR, SL-DR provided estimates with smaller biases and slightly higher RMSE in the MoCt setting. However, when only the GPS model was non-linear (CoMt), its biases and RMSE were significantly larger than those of the GOALDeR method, especially when there was a strong correlation among covariates. In contrast, GOAL became biased when the outcome model was incorrectly specified because it relies on the assumption that the outcome model is linear for variable selection. In the MoMt setting, all three approaches were biased, with SL-DR exhibiting the largest biases and RMSE.

Table 3 Summary statistics of the performance under scenario 2 with the true parameter of DRF = 2

Full size table

As shown in Table 3, when one of the models was correctly specified (CoMt and MoCt), the estimated SDs of GOALDeR were less than or equal to the empirical standard errors, resulting in the coverage being less than 95% in most cases. The bootstrap SDs were higher than the empirical standard errors, and the corresponding coverage tended to be conservative (ranging from 93 to 100%). For the SL-DR method, the estimated SDs were significantly less than the empirical standard errors, and the coverage of the 95% CI (ranging from 0 to 77%) was consistently lower than that of the GOALDeR method. For the GOAL method, the estimated SDs were nearly equal to the empirical standard errors in the setting of CoMt, and the corresponding coverage was less than 95%. In the setting of MoCt, the estimated SDs of GOAL were significantly larger than the empirical standard errors, and the coverage was conservative. In the setting of MoMt, the coverage of all three methods was 0 because of the large bias.

The power of GOALDeR and SL-DR was always 1 in all three settings (Table 3). In contrast, the power of GOAL was significantly reduced when only the outcome model was incorrectly specified (MoCt). When the DRF parameter decreased from 2 to 0.4, the power for GOALDeR remained consistently at 1, while it slightly decreased for SL-DR. The coverage for the GOALDeR and SL-DR methods decreased.

Real data applications

We applied GOALDeR and SL-DR to study causal relationships between epigenetic age acceleration and AD. The results of the GOAL method have been reported in our previous study [25]. We followed steps similar to those implemented in the GOAL method to collect datasets, calculate DNA methylation (DNAm) age, and process and identify potential confounders [25]. Briefly, we downloaded seven datasets from the Gene Expression Omnibus database according to the inclusion and exclusion criteria. The accession numbers are GSE105109 [37], GSE125895 [38], GSE134379 [39], GSE59685 [40], GSE66351 [41], GSE80970 [42], and GSE109627 [43], covering four brain regions: frontal cortex (FC), temporal cortex (TC), entorhinal cortex (ERC), and cerebellum (CRB). The ‘cortical DNAm clock’ was used to estimate DNAm age, which is a measure of biological age [44]. The residuals of the regression model of chronological age on DNAm age were defined as epigenetic age acceleration. We considered chronological age and gender to be recognized risk factors for AD, and the datasets with raw data also controlled for the proportion of neuronal cells. Additionally, we regarded whole-genome CpG sites as potential covariates, as they may contain confounders and prognostic covariates or act as surrogates for these two types of covariates. Initially, we selected potential adjustment CpG sites through epigenome-wide association study (EWAS) meta-analysis for each brain region, keeping the top K CpG sites with the smallest Bonferroni-adjusted P values. The value of K for each brain region was determined as follows: K = minimum sample size in the specific brain region − (number of known covariates + 2), since GOALDeR is not directly applicable when p > n.

Table 4 shows the estimated causal DRF of the GOALDeR and SL-DR methods between epigenetic age acceleration and AD across four brain regions. For the GOALDeR method, the four brain regions showed consistent results that there was no statistically significant dose–response relationship between epigenetic age acceleration and AD (P > 0.05). For the SL-DR method, the results for the four regions were inconsistent. We therefore performed a meta-analysis with a random-effects model (Supplementary Fig. S18) because there was heterogeneity among datasets (TC: $\:{I}^{2}=96.4\%,\:Q=111.76,\:P<\:0.0001$; FC: $\:{I}^{2}=$94.5%$\:,\:Q=54.63,\:P<\:0.0001$; ERC: $\:{I}^{2}=$92.8%$\:,\:Q=27.64,\:P<\:0.0001$; CRB: $\:{I}^{2}=$90.3%$\:,\:Q=30.95,\:P<0.0001$). The pooled odds ratios were 0.9985 (95% confidence interval: 0.9943–1.0027, P = 0.4883), 1.0006 (95% confidence interval: 0.9945–1.0067, P = 0.8550), 0.9643 (95% confidence interval: 0.8860–1.0496, P = 0.4008), and 0.9885 (95% confidence interval: 0.9510–1.0276, P = 0.5601), respectively.

Table 4 Summary statistics for the datasets and results of GOALDeR and SL-DR analyses across the four brain regions

Full size table

In summary, the GOALDeR and SL-DR analyses found that there was no statistically significant dose-response association between epigenetic age acceleration and AD, which is consistent with the results of the GOAL method [25]. In addition, the results of the SL-DR method showed greater variability than those of GOALDeR, which is consistent with our simulation results.

Discussion

We developed a new approach, GOALDeR, to estimate the linear or nonlinear DRF in high dimensions. Our extensive simulation studies, conducted under both correct and incorrect model specifications, indicated that GOALDeR can produce nearly unbiased estimates as long as either the outcome or GPS model is correctly specified. Therefore, it shows doubly robust empirically. The performance of GOALDeR is less impacted by the n/p ratio and correlated covariates. GOALDeR can also achieve statistical power and 95%CI coverage that are comparable to those of other methods.

Our simulations show that GOAL requires a linear outcome model to produce unbiased estimates, but the accuracy and precision worsen when there are correlated covariates or the n/p ratio is small. These results are consistent with those of previous studies [25]. The SL-DR requires the user to specify the conditional and marginal distributions of the treatment [22,23,24]. In our simulations, we assumed normal distributions for the treatment and the GPS both in the data-generating process and in the estimation of balance weights for SL-DR. This setting may partly contribute to the nearly doubly robust performance of SL-DR and explain why SL-DR performs less accurately and precisely than GOALDeR when the GPS model is misspecified and the outcome model is correctly specified. Additionally, the variability of estimates for SL-DR is greater than that of GOALDeR. This may be because SL-DR ignores the influence of IVs when estimating the GPS [16,17,18,19,20,21].

The outstanding performance of GOALDeR in estimation accuracy and precision may be attributed to the following: (1) GOALDeR uses a balance-based method to estimate balance weights, thereby avoiding the need to specify distributions for the treatment and GPS [15]; (2) GOALDeR uses a distance correlation coefficient as the measure to assess covariate balance, thereby avoiding the need to specify the orders of moment of both the covariates and the treatment to decorrelate [15]; (3) GOALDeR constructs penalty weights based on conditional correlation between the outcome and covariates without depending on the outcome model, thereby achieving exclusion IVs and estimation DRF in the doubly robust framework [21]. However, as with most existing methods [16, 21, 22, 25, 29], GOALDeR lacks a standard deviation estimator to guarantee a valid confidence interval. Here, GOALDeR uses the regression coefficient of the treatment on pseudo-outcome to obtain an inference of DRF. The corresponding power consistently equaled 1, while the coverage of the 95% CI was often less than the nominal value, suggesting that the SDs were underestimated. This underestimation may be due to the estimated SD failing to adequately capture the variability of variable selection. We also employed the bootstrap method to estimate the SD and found that (i) the bootstrap SDs were slightly higher than the empirical standard errors when both the GPS and outcome models were correctly specified, resulting in coverage probabilities around the nominal value in most cases; (ii) the bootstrap SDs were moderately higher than the empirical standard errors when either the GPS or the outcome model was correctly specified, leading to coverage probabilities that tended to be conservative (greater than the nominal value) in most cases. Although the bootstrap method tends to improve the statistical tests, it does not completely resolve the inference problem after variable selection [45]. Therefore, further research on the development of a valid and widely applicable variance estimator after variable selection is a possible topic in future work [46].

In summary, this study proposed a doubly robust estimator for continuous treatment and high-dimensional covariates. Within the framework of the doubly robust (DR) estimator, the proposed GOALDeR method combined a variable selection technique for causal inference to ensure unbiased and statistically efficient estimation, along with a balance-based method that was robust to misspecification of the distributions required for GPS methods. Simulation results and real data analyses provided empirical evidence that GOALDeR achieved double robustness, offering improved accuracy and precision compared to existing methods. We also provided an R package for implementing the GOALDeR method, available at https://github.com/QianGao-SXMU/GOALDeR.

Data availability

No datasets were generated or analysed during the current study.

References

Athey S, Imbens GW, Wager S. Approximate residual balancing: debiased inference of average treatment effects in high dimensions. J R Stat Soc Ser B-Stat Methodol. 2018;80(4):597–623.
Article Google Scholar
Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55.
Article Google Scholar
Imbens G. The role of the propensity score in estimating dose-response functions. Biometrika. 2000;87(3):706–10.
Article Google Scholar
Imai K, van Dyk DA. Causal inference with General Treatment regimes. J Am Stat Assoc. 2004;99(467):854–66.
Article Google Scholar
Hirano K, Imbens GW. The propensity score with continuous treatments. In: Applied Bayesian Modeling and Causal Inference from Incomplete-Data Perspectives. Chichester: John Wiley & Sons; 2004. p. 73–84.
Wu X, Mealli F, Kioumourtzoglou MA, Dominici F, Braun D. Matching on generalized propensity scores with continuous exposures. J Am Stat Assoc. 2022;119(545):757–72.
Zhang Z, Zhou J, Cao W, Zhang J. Causal inference with a quantitative exposure. Stat Methods Med Res. 2012;25(1):315–35.
Article PubMed Google Scholar
Naimi AI, Moodie EE, Auger N, Kaufman JS. Constructing inverse probability weights for continuous exposures: a comparison of methods. Epidemiology. 2014;25(2):292–9.
Bang H, Robins JM. Doubly robust estimation in missing data and causal inference models. Biometrics. 2005;61(4):962–73.
Article PubMed Google Scholar
Kennedy EH, Ma Z, McHugh MD, Small DS. Non-parametric methods for doubly robust estimation of continuous treatment effects. J Royal Stat Soc Ser B. 2017;79(4):1229–45.
Article Google Scholar
Fong C, Hazlett C, Imai K. Covariate balancing propensity score for a continuous treatment: application to the efficacy of political advertisements. Annals Appl Stat. 2018;12(1):156–77.
Article Google Scholar
Tübbicke S. Entropy balancing for continuous treatments. J Econometric Methods. 2022;11(1):71–89.
Article Google Scholar
Vegetabile BG, Griffin BA, Coffman DL, Cefalu M, Robbins MW, McCaffrey DF. Nonparametric estimation of population average dose-response curves using entropy balancing weights for continuous exposures. Health Serv Outcomes Res Method. 2021;21(1):69–110.
Article Google Scholar
Yiu S, Su L. Covariate association eliminating weights: a unified weighting framework for causal effect estimation. Biometrika. 2018;105(3):709–22.
Article PubMed Google Scholar
Huling JD, Greifer N, Chen G. Independence weights for causal inference with continuous treatments. J Am Stat Assoc. 2023;119(546):1657–70.
Koch B, Vock DM, Wolfson J. Covariate selection with group lasso and doubly robust estimation of causal effects. Biometrics. 2018;74(1):8–17.
Article PubMed Google Scholar
Wilson A, Reich BJ. Confounder selection via penalized credible regions. Biometrics. 2014;70(4):852–61.
Article PubMed Google Scholar
Ertefaie A, Asgharian M, Stephens DA. Variable selection in causal inference using a simultaneous penalization method. J Causal Inference. 2018;6(1):20170010.
Brookhart MA, Schneeweiss S, Rothman KJ, Glynn RJ, Avorn J, Stürmer T. Variable selection for propensity score models. Am J Epidemiol. 2006;163(12):1149–56.
Article PubMed Google Scholar
Zhu Y, Schonbach M, Coffman DL, Williams JS. Variable selection for propensity score estimation via balancing covariates. Epidemiology. 2015;26(2):e14-5.
Article PubMed Google Scholar
Tang D, Kong D, Pan W, Wang L. Ultra-high dimensional variable selection for doubly robust causal inference. Biometrics. 2023;79(2):903–14.
Su L, Ura T, Zhang Y. Non-separable models with high-dimensional data. J Econ. 2019;212(2):646–77.
Article Google Scholar
Colangelo K, Lee YY. Double debiased machine learning nonparametric inference with continuous treatments. arXiv Preprint. 2020. arXiv:200403036.
Antonelli J, Papadogeorgou G, Dominici F. Causal inference in high dimensions: a marriage between bayesian modeling and good frequentist properties. Biometrics. 2022;78(1):100–14.
Article PubMed Google Scholar
Gao Q, Zhang Y, Liang J, Sun H, Wang T. High-dimensional generalized propensity score with application to omics data. Brief Bioinform. 2021;22(6):bbab331.
Article PubMed Google Scholar
Gábor JS, Maria LR, Nail KB. Measuring and testing dependence by correlation of distances. Ann Stat. 2007;35(6):2769–94.
Google Scholar
Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. J Educ Psychol. 1974;66(5):688.
Article Google Scholar
Edelmann D, Fokianos K, Pitsillou M. An updated literature review of distance correlation and its applications to time series. Int Stat Rev. 2019;87(2):237–62.
Article Google Scholar
Shortreed SM, Ertefaie A. Outcome-adaptive lasso: variable selection for causal inference. Biometrics. 2017;73(4):1111–22.
Article PubMed PubMed Central Google Scholar
Van der Laan M, Polley E, Hubbard A. Super Learner, statistical applications in genetics and molecular biology. Stat Appl Genet Mol Biol. 2007;6(1):Article 25.
Gao Q, Zhang Y, Sun H, Wang T. Evaluation of propensity score methods for causal inference with high-dimensional covariates. Brief Bioinform. 2022;23(4):22727.
Article Google Scholar
Tan Z. Regularized calibrated estimation of propensity scores with model misspecification and high-dimensional data. Biometrika. 2019;107(1):137–58.
Article Google Scholar
Cepeda MS, Boston R, Farrar JT, Strom BL. Comparison of logistic regression versus propensity score when the number of events is low and there are multiple confounders. Am J Epidemiol. 2003;158(3):280–7.
Article PubMed Google Scholar
Verbeek JH, Whaley P, Morgan RL, Taylor KW, Rooney AA, Schwingshackl L, Hoving JL, Vittal Katikireddi S, Shea B, Mustafa RA, et al. An approach to quantifying the potential importance of residual confounding in systematic reviews of observational studies: a GRADE concept paper. Environ Int. 2021;157:106868.
Article PubMed Google Scholar
Schisterman EF, Cole SR, Platt RW. Overadjustment bias and unnecessary adjustment in epidemiologic studies. Epidemiology. 2009;20(4):488–95.
Article PubMed PubMed Central Google Scholar
Kang JDY, Schafer JL. Demystifying double robustness: a comparison of alternative strategies for estimating a population mean from incomplete data. Stat Sci. 2007;22(4):523–39.
Google Scholar
Smith AR, Smith RG, Pishva E, Hannon E, Roubroeks JA, Burrage J, Troakes C, Al-Sarraj S, Sloan C, Mill J. Parallel profiling of DNA methylation and hydroxymethylation highlights neuropathology-associated epigenetic variation in Alzheimer’s disease. Clin Epigenetics. 2019;11(1):1–13.
Article Google Scholar
Semick SA, Bharadwaj RA, Collado-Torres L, Tao R, Shin JH, Deep-Soboslay A, Weiss JR, Weinberger DR, Hyde TM, Kleinman JE. Integrated DNA methylation and gene expression profiling across multiple brain regions implicate novel genes in Alzheimer’s disease. Acta Neuropathol. 2019;137:557–69.
Article PubMed Google Scholar
Brokaw DL, Piras IS, Mastroeni D, Weisenberger DJ, Nolz J, Delvaux E, Serrano GE, Beach TG, Huentelman MJ, Coleman PD. Cell death and survival pathways in Alzheimer’s disease: an integrative hypothesis testing approach utilizing-omic data sets. Neurobiol Aging. 2020;95:15–25.
Article PubMed PubMed Central Google Scholar
Lunnon K, Smith R, Hannon E, De Jager PL, Srivastava G, Volta M, Troakes C, Al-Sarraj S, Burrage J, Macdonald R. Methylomic profiling implicates cortical deregulation of ANK1 in Alzheimer’s disease. Nat Neurosci. 2014;17(9):1164–70.
Article PubMed PubMed Central Google Scholar
Gasparoni G, Bultmann S, Lutsik P, Kraus TF, Sordon S, Vlcek J, Dietinger V, Steinmaurer M, Haider M, Mulholland CB. DNA methylation analysis on purified neurons and glia dissects age and Alzheimer’s disease-specific changes in the human cortex. Epigenetics Chromatin. 2018;11:1–19.
Article Google Scholar
Smith RG, Hannon E, De Jager PL, Chibnik L, Lott SJ, Condliffe D, Smith AR, Haroutunian V, Troakes C, Al-Sarraj S. Elevated DNA methylation across a 48‐kb region spanning the HOXA gene cluster is associated with Alzheimer’s disease neuropathology. Alzheimer’s Dement. 2018;14(12):1580–8.
Article Google Scholar
Lardenoije R, Roubroeks JA, Pishva E, Leber M, Wagner H, Iatrou A, Smith AR, Smith RG, Eijssen LM, Kleineidam L. Alzheimer’s disease-associated (hydroxy) methylomic changes in the brain and blood. Clin Epigenetics. 2019;11(1):1–15.
Article Google Scholar
Shireby GL, Davies JP, Francis PT, Burrage J, Walker EM, Neilson GW, Dahir A, Thomas AJ, Love S, Smith RG. Recalibrating the epigenetic clock: implications for assessing biological age in the human cortex. 2020;143(12):3763–75.
Google Scholar
Dukes O, Vansteelandt S. How to obtain valid tests and confidence intervals after propensity score variable selection? Stat Methods Med Res. 2020;29(3):677–94.
Article PubMed Google Scholar
Dukes O, Avagyan V, Vansteelandt S. Doubly robust tests of exposure effects under high-dimensional confounding. Biometrics. 2020;76(4):1190–200.
Article PubMed Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This study was supported by the National Natural Science Foundation of China (grant numbers: 82204163, 82373692 and 82073674) and the Fundamental Research Program of Shanxi Province (grant number: 202203021212382).

Author information

Authors and Affiliations

Department of Health Statistics, School of Public Health, MOE Key Laboratory of Coal Environmental Pathogenicity and Prevention, Shanxi Medical University, No.56 Xinjian South Road, Taiyuan, 030001, China
Qian Gao, Jiale Wang, Ruiling Fang & Tong Wang
Department of Health Statistics, School of Public Health, Binzhou Medical University, Yantai, China
Hongwei Sun

Authors

Qian Gao
View author publications
You can also search for this author inPubMed Google Scholar
Jiale Wang
View author publications
You can also search for this author inPubMed Google Scholar
Ruiling Fang
View author publications
You can also search for this author inPubMed Google Scholar
Hongwei Sun
View author publications
You can also search for this author inPubMed Google Scholar
Tong Wang
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

T.W. conceived the idea and contributed to the interpretation of the results. T.W. and Q.G. developed the model. Q.G implemented the software; conducted analyses of simulation and real data with assistance from J.W.; interpreted the results with assistance from R.F. and H.S.; and drafted and revised the manuscript with input from all other authors. All authors approved the final manuscript.

Corresponding author

Correspondence to Tong Wang.

Ethics declarations

Ethics approval and consent to participate

All data used in this study came from public databases, and the original studies had been approved by the relevant review boards.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gao, Q., Wang, J., Fang, R. et al. A doubly robust estimator for continuous treatments in high dimensions. BMC Med Res Methodol 25, 35 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12874-025-02488-3

Download citation

Received: 02 February 2024
Accepted: 03 February 2025
Published: 13 February 2025
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12874-025-02488-3

A doubly robust estimator for continuous treatments in high dimensions

Abstract

Background

Method

Results

Conclusion

Introduction

Generalized outcome-adaptive LASSO and doubly robust estimation

Notations and assumptions

Variable selection based on outcome-adaptive LASSO

Choosing \({\varvec{\lambda}}_{\varvec{n}}\)

Estimating DRF using a doubly robust estimator

Simulations

Simulation setup

Comparing methods

Results

Estimation under scenarios 1 and 2 with a modest p = 20

Estimation and testing under scenario 1 with a large number of covariates

Estimation and testing under scenario 2 with a large number of covariates

Real data applications

Discussion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s note

Supplementary Information

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Research Methodology

Contact us