# The treatment paradox `````{admonition} Executive summary :class: info A **treatment paradox** is when a prognostic factor: * (A) Has a **strong relationship** with the outcome, and * (B) When its present, **triggers an effective treatment** It will mean that the prognostic factor will appear to have **poorer performance** than it actually has, and hence leading to **underestimation of outcomes** for people with that prognostic factor. For example, may find an unexpected protective relationship between two factors that actually have a harmful relationship. A treatment paradox is relevant to causal inference studies (as its about the true relationship between the treatment and the outcome, rather than using the presence of treatment to help best predict the outcome). ````` ## What is a treatment paradox? A **treatment paradox** (or "**intervention effect**", or "**treatment use** in a validation study") is 'when a **strong prognostic factor of an adverse outcome triggers an effective treatment**. In other words, if we wanted to demonstrate the presence of a treatment paradox, we would need to meet two criteria: 1. The prognostic factor has a **strong relationship with the outcome** 2. When the prognostic factor is present, it **triggers an effective treatment** [[Uffen et al. 2021]](https://doi.org/10.1136%2Fbmjopen-2020-046518) ````{mermaid} flowchart LR; %% Define the nodes and subgraphs exp("Exposure"):::white; treat("Treatment"):::white; out("Outcome"):::white; %% Produce the figure exp --> treat; treat --> out; exp --> out; classDef white fill:#FFFFFF, stroke:#FFFFFF; classDef black fill:#FFFFFF, stroke:#000000; ```` ## Why does it matter? **Lack of generalisability** - the aim of predictive models will often be to help guide decisions to treat on future patients (i.e. they have not yet received any treatments). **Models trained on treated patients will offer poor/biased performance, underestimating risk, which can be attributed to treatment use in the validation data.**[[Pajouheshnia et al. 2017]](https://doi.org/10.1186%2Fs12874-017-0375-8) The impact of treatment use depends on various factors... * Strength of effect of treatment on outcome - e.g. * If weak effect, then impact will be small[[Pajouheshnia et al. 2017]](https://doi.org/10.1186%2Fs12874-017-0375-8) * When a strong prognostic factors triggers an effective treatment, it reduces incidence of the outcome, and meaning that the prognostic factor that initiated the treatment will **appear to have a poorer prognostic performance than it actually has.** This can lead to an **underestimation of adverse outcomes for people with that prognostic factor**. [[Uffen et al. 2021]](https://doi.org/10.1136%2Fbmjopen-2020-046518) This leads to a lack of **generalisability**.[[Schuit et al. 2013]](https://doi.org/10.1503/cmaj.120812) * Proportion of individuals receiving treatment * Underlying pattern of treatment use * Whether treatment is allocated randomly or based on individual risk profile or based on strict treatment guidelines - e.g. * If high-risk individuals are selectively treated, this will have a large impact - '**distribution of observed risks will become narrower**, due to the risk-lowering effects of treatment in the high-risk individuals, making it **more difficult for the model to discriminate** between individuals who will or will not develop the outcome, and the calibration in high-risk individuals will be most greatly affected'[[Pajouheshnia et al. 2017]](https://doi.org/10.1186%2Fs12874-017-0375-8) ### Is this relevant to obstetrics? Absolutely! In fact, obstetric examples are quite commonly cited in papers about the treatment paradox, and there are a few papers focused on this, such as the paper by [Cheong-See et al. 2016](http://dx.doi.org/10.1111/1471-0528.13860) - '**Prediction models in obstetrics: understanding the treatment paradox and potential solutions to the threat it poses**'. They describe clinicians use "probabilitic reasoning" to identify high risk pregnancies (based on clinical history and tests) - and how there is potential benefit for data-driven prediction/prognostic models providing individualised risk estimates are relevant in helping identify pregnancies at risk of adverse outcomes, to help guide clinical management and targeted interventions - as we are hoping to explore in this project. They describe common statistical challenges for these models - and one of which is the handling of interventions in the prediction model (i.e. the treatment paradox). [[Cheong-See et al. 2016]](http://dx.doi.org/10.1111/1471-0528.13860) ### Examples **Intrapartum fever leads to metabolic acidosis, but also triggers a caesarean section.** Because of this: * We find an **unexpected negative (protective)** relationship between intrapartum fever and metabolic acidosis * If we include caesarean delivery in the model, we would expect to find a **positive (harmful)** relationship between intrapartum fever and metabolic acidosis [[Schuit et al. 2013]](https://doi.org/10.1503/cmaj.120812) ````{mermaid} flowchart LR; %% Define the nodes and subgraphs fever("Intrapartum fever"):::white; cs("Ceasarean section"):::white; met("Metabolic acidosis"):::white; %% Produce the figure fever --> cs; cs --> met; fever --> met; classDef white fill:#FFFFFF, stroke:#FFFFFF; classDef black fill:#FFFFFF, stroke:#000000; ```` **Fetal compromise would lead to poor outcomes, but it also triggers quicker delivery (e.g. caesarean)** (as in the NICE 2011 Caesarean guidance). * Due to this treatment paradox, you will find that babies **born within 30 minutes will consistently contain a higher proportion of babies in poorer condition**, but that this likely due to this disparity (treatment paradox), rather than being due to differences in speed of delivery.'[[source]](https://www.nice.org.uk/guidance/ng192/evidence/full-guideline-pdf-9071942942) ````{mermaid} flowchart LR; %% Define the nodes and subgraphs comp("Compromised"):::white; quick("Quicker delivery"):::white; poor("Poor outcomes"):::white; %% Produce the figure comp --> quick; quick --> poor; comp --> poor; classDef white fill:#FFFFFF, stroke:#FFFFFF; classDef black fill:#FFFFFF, stroke:#000000; ```` **High blood pressure leads to poor outcome, but also triggers effective treatment (antihypertensive, delivery)**. A recent model for adverse outcomes in women with pre-eclampsia did not identify high blood pressure as a predictor. This may be due to the treatment paradox, as 'women with high blood pressure are likely to receive effective interventions (antihypertensive or delivery)' - and this effective treatment means they are unlikely to develop the outcome. * 'If blood pressure is truly a predictor of high(er) risk in untreated patients, effective intervention will make it look like a poor predictor during the development of a model in treated patients. This problem frequently jeopardises obstetric modelling, altering the true predictor–outcome association and the natural outcome rate (incidence).' * 'In pre-eclampsia, a prediction model without blood pressure risks overlooking patients with high blood pressure, and may actually underestimate the outcome risks (as an important predictor is missed).' [[Cheong-See et al. 2016]](http://dx.doi.org/10.1111/1471-0528.13860) ````{mermaid} flowchart LR; %% Define the nodes and subgraphs bp("High blood pressure"):::white; adverse("Adverse outcomes"):::white; treat("Treatment (antihypertensive
or delivery)"):::white; %% Produce the figure bp --> adverse; treat --> adverse; bp --> treat; classDef white fill:#FFFFFF, stroke:#FFFFFF; classDef black fill:#FFFFFF, stroke:#000000; ```` **Other examples** (not detailed): * Epidural, pyrexia (fever), and birth outcomes * Pre-term birth, tocolytics, and birth outcomes[[Cheong-See et al. 2016]](https://doi.org/10.1111/1471-0528.13859) (tocolytics are drugs used to slow/stop contractions, typically given to women in pre-term labour) ## Treatment paradox for caesarean and HIE In our case, we have a treatment paradox as indicators of poor outcome (e.g. abnormal fetal heart rate (FHR), gestational age) will trigger an effective treatment (caesarean). Hence, if we fit a prediction model where those indicators were used to predict risk of HIE, the relationship between the indicators and outcome will be biased by the selection to receive treatment which helps prevent the outcome, e.g. * Protective relationship between abnormal FHR and HIE, even though abnormal FHR is an indicator that there is likely something wrong * Protective relationship between gestational age and HIE, even though we know later gestational age makes sentinal events that lead to hypoxia more likely, but masked by the fact that it also makes selection for caesarean section more likely ````{mermaid} flowchart LR; %% Define the nodes and subgraphs ind("Indicators of HIE
(e.g. abnormal FHR,
gestational age)"):::white; csection("Caesarean section"):::white; poor("Hypoxic ischaemic encephalopathy"):::white; %% Produce the figure ind --> csection; csection --> poor; ind --> poor; classDef white fill:#FFFFFF, stroke:#FFFFFF; classDef black fill:#FFFFFF, stroke:#000000; ```` We could demonstrate the **existence** of this paradox as they do in [Uffen et al. 2021](https://doi.org/10.1136%2Fbmjopen-2020-046518) - in which the aim of their study was to 'evaluate the existence of a treatment paradox by determining the influence of baseline qSOFA on treatment decisions within the first 24 hours after admission' - with patients with particularly values of qSOFA have higher odds of receiving intensive therapy, fast antibiotic administration, or vasopressic therapy. ## How do we deal with the treatment paradox? In their paper, [Cheong-See et al. 2016](https://doi.org/10.1111/1471-0528.13859) convene 'a panel of experts in pre-eclampsia and prognostic research, to explore the potential solutions in the development of a valid prediction model for adverse maternal or fetal outcomes' - addressing this very problem! They panel had 24 members including 'obstetricians, statisticians, clinical epidemiologists, and researchers'. It had a particular focus on the treatment paradox, as they developed a prediction model for the [PREP study (Thnagaratinam et al. 2017)](https://doi.org/10.3310/hta21180). They identified solutions of: * Standardisation of treatment * Predictor substitution * Treatment as predictor * Treatment as outcome * Propensity scores Have also added, from other sources: * Exclusion of treated ### Standardisation of treatment This is a scenario where there is complete collinearity between the predictor and the treatment, where the presence of a particular predictor will **always** guarantee the presence of a particular treatment. In this case, the predictor is deterministic of treatment (i.e. everyone with this predictor *definitely* receives treatment - complete colinearity between predictor and treatment). This requires **fully standardised care (same medication, same dose, same treatment thresholds)**. This is not realistic. Example: Management of early-onset pre-eclampsia, such as the commencement of anti-hypertensives and magnesium sulphate, this is somewhat standardised by guidelines [e.g. from the National Institute for Health and Care Excellence (NICE) in the UK], but the threshold for commencing treatment varies between clinicians and centres. Furthermore, the response from a specific antihypertensive and dosage varies between individual patients. You can however use multi-level models to account for differences between clinicians and treatment centres.[[Cheong-See et al. 2016]](https://doi.org/10.1111/1471-0528.13859) [Steer 2016](https://doi.org/10.1111/1471-0528.13860) comments though that 'such models rarely take into account all of the relevant factors (e.g. the coexistence of a modulating pathology such as an autoimmune disorder) or the social and emotional circumstances and preferences of the mother and her family.'[[Steer 2016]](https://doi.org/10.1111/1471-0528.13860) ### Predictor substitution You could **remove all the predictors upon which the decision to treat is based on**. Limitations: * Can prevent you from including meaningful predictors in the model * Other predictors may be correlated with the predictors used to make treatment decisions. [[Cheong-See et al. 2016]](https://doi.org/10.1111/1471-0528.13859) ### Treatment as predictor You could include whether they were treated as a predictor in the model.[[Pajouheshnia et al. 2017]](https://doi.org/10.1186%2Fs12874-017-0375-8)[[Schuit et al. 2013]](https://doi.org/10.1503/cmaj.120812) In practice, you won't be able to input "they have been treated or not" for the as-yet untreated patients - but you could use the model to estimate outcomes in scenarios where they are or are not treated.[[Pajouheshnia et al. 2017]](https://doi.org/10.1186%2Fs12874-017-0375-8) You could just add the indicator on top of the prognostic model, keeping the original coefficients fixed.[[Pajouheshnia et al. 2017]](https://doi.org/10.1186%2Fs12874-017-0375-8) However, if there is an interaction between the effectiveness of treatment and having a predictor (e.g. more effective in those with predictor), then the model will need to account for/incorporate this interaction.[[Schuit et al. 2013]](https://doi.org/10.1503/cmaj.120812) Instead therefore, the model could be entirely refitted with the addition of an indicator term for treatment, with the inclusion of interaction terms where anticipated.[[Pajouheshnia et al. 2017]](https://doi.org/10.1186%2Fs12874-017-0375-8) Limitations: * Information on treatment use may not be available at the intended moment of prediction. [[Cooray et al. 2020]](http://dx.doi.org/10.1136/bmjopen-2020-038845) * Failure to correctly specify any interactions between treatment and other predictors in the validation set could mean that the effects of treatment are not completely taken into account[[Pajouheshnia et al. 2017]](https://doi.org/10.1186%2Fs12874-017-0375-8) * The addition of a term for treatment to the model that is to be validated may improve the performance beyond that of the original model due to the inclusion of additional predictive information[[Pajouheshnia et al. 2017]](https://doi.org/10.1186%2Fs12874-017-0375-8) * Not possible if everyone in the study had the same intervention (but in that case, it is likely that unexpected findings are not due to a treatment paradox)[[Schuit et al. 2013]](https://doi.org/10.1503/cmaj.120812) * With this approach, 'differentiating treatment from predictor effects becomes difficult. We could adjust for the interaction between ‘decision to treat’ as a predictor and each of the other prognostic factors in the model; however, when many predictors are involved, or when ‘decision to treat’ is based on multiple predictors, this approach becomes complex. In such situations, extremely large sample sizes are needed for the reliable assessment of interactions.'[[Cheong-See et al. 2016]](https://doi.org/10.1111/1471-0528.13859) Due to the limitations, Pajouheshnia et al. 2017 do not recommend this approach.[[Pajouheshnia et al. 2017]](https://doi.org/10.1186%2Fs12874-017-0375-8) ### Treatment as outcome If the treatment is likely to prevent an adverse outcome, then you can use treatment itself as the outcome, as it indicates they would've otherwise experienced an adverse outcome. Example: 'In women with early-onset pre-eclampsia, if a large proportion of women are delivered at an early preterm gestation (before 34 weeks), then delivery itself could be considered as an outcome (replacing complications that would have occurred in the absence of delivery). In the absence of a standardised protocol for decision to deliver at early preterm gestation, such an approach could help to overcome the limitations in the model as a result of delivery preventing the occurrence of an adverse outcome.'[[Cheong-See et al. 2016]](https://doi.org/10.1111/1471-0528.13859) ### Propensity scores Use propensity score (treatment probability) to account for 'multiway interactions with other predictors'. Either include in model, or use to weight contribution of participant towards the model (i.e. receiving no treatment --> less treatment effect --> more weight) Limitations: * Limited clinical applicability as requires knowledge of a propensity score for clinicians to make a decision on whether to treat [[Cheong-See et al. 2016]](https://doi.org/10.1111/1471-0528.13859) ### Exclusion of treated individuals You can simply exclude the treated individuals. Limitations: * Will 'decrease the effective **sample size**' (which could cause you to not see an effect if you don't have the power). This, for example, leads to 'the precision of estimates of both the observed:expected ratio and c-index (area under the ROC curve) decreased due to the reduction in effective sample size'. * Results in loss of information about **high risk individuals**, if treatment allocation was dependent on risk (and so very few were untreated), with the discriminative ability of the model worsening with the exclusion of high-risk individuals and consequently narrower case mix. * 'In the presence of a strong **unmeasured predictor of the outcome associated with treatment use**, exclusion of treated individuals resulted in an underestimation of the performance of the model.' [[Pajouheshnia et al. 2017]](https://doi.org/10.1186%2Fs12874-017-0375-8) ## How does this relate to causal inference? Ultimately, our interest is in understanding the true relationships between variables, which is also the focus of causal inference studies. However, it is important to be aware that in causal inference, you are focussed on understanding the causal effect of an exposure or treatment on an outcome, whilst controlling for confounders for that relationship. If you create a model that includes those confounders, the effect estimates between the confounders and the outcome are still at risk of being impacted by residual confounding or bias, as we did not go through the process for their relationship, and only for the primary relationship of interest. Therefore, some of those methods fall under the banner of "causal inference", and could be designed in such a way that you would be estimating the causal effect between an exposure and an outcome, but it does not mean that you are finding causal effects between all variables and the outcome (which is often the focus below - having a multivariable model that accounts for treatment use - that is still a prediction model). If you were however to just focus on the relationship between the treatment and the outcome, then similarities would be: * Standardisation of treatment --> Instrumental variable analysis - where the hospital/clinician (a) cause variation in treatment, (b) are unrelated to outcome - and so can be used as an instrumental variable * Treatment as predictor --> Multivariable regression - with treatment included as predictor * Propensity scores --> Inverse probability of treatment weighting (IPTW) You could also focus on just the relationship between a particular risk factor and the outcome.