This article has Open Peer Review reports available.
Impact of sex in stroke thrombolysis: a coarsened exact matching study
© Hametner et al.; licensee BioMed Central. 2015
Received: 8 September 2014
Accepted: 13 January 2015
Published: 10 February 2015
It is not established whether sex influences outcome and safety following intravenous thrombolysis (IVT) in acute stroke. As a significant imbalance exists between the baseline conditions of women and men, regression analysis alone may be subject to bias. Here we aimed to overcome this methodical shortcoming by balancing both groups using coarsened exact matching (CEM) before evaluating outcome.
From our local prospective stroke database we analyzed consecutive patients who suffered anterior circulation stroke and received IVT from 1998 to 04/2013 (n = 1391, 668 female, 723 male). Data were preprocessed by CEM, balancing for age, NIHSS, lesion side, hypertension, diabetes, atrial fibrillation, smoking, coronary heart disease, and previous stroke, which yielded a matched cohort of 502 women and 436 men (n = 938). Outcome was estimated by adjusted binomial logistic regression analysis incorporating matched weights.
No effect of sex was seen to predict good outcome (OR 1.04, CI 0.76–1.43) or mortality (OR 1.13, CI 0.73–1.73). However, female sex was a strong independent predictor of symptomatic intracerebral hemorrhage (sICH – ECASS-II definition, OR 3.62, CI 1.77-7.41) and fatal ICH (OR 4.53, CI 1.61-12.7).
In balanced groups, the two sexes showed comparable outcomes following IVT. A novel finding was the higher rate of sICH and fatal ICH in women. In this analysis we also demonstrate how CEM can reduce multivariate imbalance and thereby improve estimates, already in crude, but more importantly, in adjusted regression analysis. Further investigations of multicentre data with improved analytical approaches that yield balanced sex-groups are therefore warranted.
It is still not established whether sex has an impact on outcome in acute stroke patients who received intravenous thrombolysis (IVT). Former studies reported mainly equipoise in the 3-months outcome following IVT in women compared to men [1-3], but also a disadvantage for women was found . Two studies found a greater incidence of bleeding complications in men [1,2]. However, all these studies have a critical bias in common. As sex is a nature-determined factor, (primary) randomization is obviously not possible. In addition, if covariates are very different between the sexes, the results of regression analysis alone can be misleading [5-7]. To overcome these issues in comparing the sexes, we improved the balance within the groups in a first step by coarsened exact matching (CEM) , thereby neglecting outcome and safety variables. To account for the remaining bias in covariates and to estimate outcome, we then performed adjusted regression analysis. This two-step approach is less prone to model misspecification and even more robust than are results based on the full unmatched data set [7,9,10].
Improve multivariate balance between the sexes using coarsened exact matching (CEM) to investigate whether IVT treated women differ from IVT treated men with respect to outcome and safety.
From our local prospective stroke database we analyzed clinical and imaging data of all consecutive patients who received IVT from 1998 to 04/2013 (n = 1501). Our prospective local stroke database was managed and this study implemented according to the STrengthening the Reporting of OBservational studies in Epidemiology (STROBE) statement for reporting case–control studies . Data were collected as part of national and international quality-control programs. The retrospective analysis of the data lacks any treatment influence and therefore written informed consent and a formal ethical approval from the local ethics committee of the University of Heidelberg was waived. We excluded from further analysis 93 patients with posterior circulation stroke, 13 patients due to missing clinical follow-up, and 4 patients who died before follow-up imaging. Therefore, 1391 patients comprised the unmatched cohort. Three-month outcome was assessed either during an outpatient visit or a telephone interview using the mRS. Good outcomes were adjusted with respect to NIHSS score at presentation as previously described : Presenting NIHSS scores of 1 to 7 a mRS score 0 at follow-up, presenting NIHSS scores of 8 to 14 mRS scores of 0 or 1, and presenting NIHSS scores above 14 mRS scores of 0–2 were counted as good outcome. Time to treatment was defined as time from symptom onset to start of IVT. Symptomatic intracerebral hemorrhage (sICH) was defined according to the definition of the ECASS-II trial . Fatal ICH was defined as death caused most probably due to sICH following IVT. It has been shown recently that it is preferable to treat missing data by multiple imputation rather than listwise deletion in further processing (matching, multiple regression analysis) [14,15]. Therefore, Amelia II  for multiple imputation (m = 10) was used to further process all (n = 1391) instead of (only) 1126 patients. Covariates were imputed as follows: statin use (4.5%), antithrombotics (3.7%), oral anticoagulation (2.2%), thrombocytes (7.9%), systolic blood pressure (15%), and diastolic blood pressure (15.6%). Importantly, no nonlisted variables and no outcome variables were imputed. As recommended, each imputed data set was analyzed separately and combined at the end . Groups of baseline characteristics were compared with the Student’s T-Test, the Mann–Whitney U-Test, or the Fisher’s Exact Test, as appropriate, and accounted for matched weights on matched group comparisons. In all statistical analyses, a p-value of 0.05 was considered significant. The following variables were then preprocessed using CEM: age, NIHSS, lesion side, hypertension, diabetes, atrial fibrillation, smoking, coronary heart disease, and previous stroke. The aim of matching is not to estimate, but rather to find better balance in the multidimensional distribution of covariates of the groups. This in turn reduces the degree of dependence on the estimation model of the outcome variable and therefore diminishes bias . In detail, the CEM algorithm consists of three steps. First, desired variables of all patients are coarsened temporarily. Second, all patients of the initial cohort are sorted into strata on the basis of their coarsened variables. Third, only patients with strata containing at least one woman and one man are kept; others are discarded. Additionally, a weighting variable is generated to equalize the number of women and men in one stratum. CEM is a matching method of the class monotonic imbalance bounding . This means that reducing imbalance in the empirical distribution in one covariate has no effect on any other covariates chosen for balancing, which represents a clear advantage of CEM over other matching methods . Of course, only observed variables are accounted for in matching, and thus bias of omitted covariates cannot be eliminated. For balance checking Iacus and colleagues introduced the multivariate imbalance measure L1 . Ranging from 0 to 1 - L1 is a relative magnitude depending on the data set and the selected covariates. The more the two distributions overlap, the more L1 decreases and trends to zero. The advantage of this two-step approach, first performing a matching solution and then an outcome estimation, is that it is more robust than, for example, regression analysis alone and also insensitive to selecting outcome model specifications arbitrarily, which is a common potential bias source [7,9,10]. In a final step, outcome was estimated by binomial logistic regression incorporating matched weights. Statistical analysis was performed using R [18-20] and SPSS (SPSS Inc., 21.0 for Windows).
Baseline characteristics for sex in unmatched and matched cohort (Matched variables are marked bold)
12 (7; 17)
10 (6; 15)
12 (7; 17)
12 (6; 17)
Hemisphere, left ‡
Atrial fibrillation ‡
Current smoker ‡
Previous Stroke ‡
140 (105; 180)
140 (105; 180)
140 (105; 180)
140 (110; 180)
Systolic BP [mmHg]†
Diastolic BP [mmHg]†
121 (105; 148)
120 (105; 147)
121 (106; 144)
263 (215; 313)
233 (193; 283)
266 (216; 313)
234 (192; 277)
Adjusted* binomial logistic regression full model in the matched cohort for (1) good outcome, (2) mortality, (3) sICH, and (4) fatal ICH
OR (95% CI)
OR (95% CI)
OR (95% CI)
OR (95% CI)
Diastolic BP [mmHg]
Univariate analysis of mortality (137 women (20.5%), 84 men (11.6%), p < 0.001) in the unmatched group showed significant differences in sex, but the effect did not persist in multivariate analysis. Independently associated with mortality (93 women (18.5%), 61 men (13.9%), p = 0.064) in the matched cohort were age (OR 1.09, CI 1.06–1.12, p < 0.001), NIHSS (OR 1.10, CI 1.07–1.13, p < 0.001), TTT (OR 0.99, CI 0.99–0.99, p = 0.003), sICH (OR 20.6, CI 9.52–44.7, p < 0.001), glucose (OR 1.00, CI 1.00–1.01 p = 0.009), hypertension (OR 0.44, CI 0.21–0.90, p = 0.026), and hyperlipidemia (OR 1.99, CI 1.10–3.61, p = 0.022). Similarly to the unmatched cohort, a logistic regression model to predict mortality in the matched cohort showed no effect of sex (Table 2).
Sex was significantly associated with sICH (45 women (6.7%), 24 men (3.3%), p = 0.004) in univariate and multivariate analysis in the unmatched cohort. Multivariate analysis to predict sICH (37 women (7.4%), 11 men (2.6%), p = 0.001) in the matched cohort found female sex (OR 3.62, CI 1.77–7.41 p < 0.001) and antithrombotic treatment (OR 2.13, CI 1.04–4.37, p = 0.04) as the only independent predictors (Table 2).
Sex was significantly associated with fatal ICH (23 women (3.4%), 8 men (1.1%), p = 0.003) in univariate but not in the multivariate analysis in the unmatched cohort. Multivariate analysis to predict fatal ICH (20 women (4.0%), 5 men (1.2%) p = 0.008) in the matched cohort found female sex (OR 4.53, CI 1.61–12.7, p = 0.004) and antithrombotic treatment (OR 3.76, CI 1.26–11.2, p = 0.018) as the only independent predictors (Table 2).
To our knowledge this is the first time a balanced cohort of women and men has been used to analyse the influence of sex on outcome and safety after IVT in acute ischemic stroke. In these balanced groups 3-months outcome and mortality following IVT was comparable between the two sexes. In addition, we report the novel finding of increased bleeding complications in IVT-treated women.
We substantially tried to remove bias from our analysis of functional outcome. Saver and colleagues recently reported that it is meaningful to perform a baseline severity-adjusted endpoint analysis . This adjustment may in particular be meaningful in a sex-based analysis, since NIHSS distributions may differ between the sexes, even if they appear similar in mean. The presented results appear to be in line with previous studies, which found no differences in functional outcome between the sexes evaluating mRS ≤ 1 [1,3] and mRS ≤ 2 . However, they are not directly comparable, because of the adjustment chosen in our analysis. One single centre study also used a baseline severity adjustment evaluating mRS ≤ 2, but reported univariate results on sex only, because temperature was the main study focus .
Regarding mortality our results are in line with the post-hoc analysis of the Canadian Alteplase for Stroke Effectiveness Study , but contradict the previously largest study on this topic . Lorenzano et al. found a higher mortality in women in univariate, but just the opposite, higher mortality in men, after multivariate adjustment. In our matched cohort, already univariate analysis yielded non-significant differences between the sexes (Figure 2), which were confirmed on additional adjustment. A limiting factor for a comparison of study-cohorts here might be the difference in analysis, namely different grades of multivariate balance and the differences in adjusting confounders. One example when estimating mortality is the consideration of sICH as a confounder. On the one hand sICH was not usually included in regression analysis for mortality of previous studies, although it is an established predictor for mortality [22,23]. Inclusion of sICH in the mortality regression may be misleading because mortality is also a part of the definition of sICH according to ECASS-II (any hemorrhage leading to death). However, on the other hand in our cohort female sex was an independent predictor of sICH. Therefore we preferred to include sICH in mortality regression analysis and thus omit an important sex-related mortality bias.
Again, crude mortality estimation of the matched cohort already gains more robust results that are not changed significantly after adjustment in contrast to estimation of the unmatched cohort. This demonstrates how researchers, if CEM is applied before regression analysis, may improve their estimates and how different study models may be better comparable, even if models slightly differ, because included confounders are chosen differently (see also ).
With respect to bleeding complication following IVT, female sex turned out to be the most important predictor for sICH and fatal ICH in our matched cohort. Previous studies observed a higher rate of sICH in men, reasoning that a higher incidence of antithrombotics and the higher absolute doses of recombinant tissue plasminogen activator (rtPA) (due to body weight) in men could account for this finding . The intake of antithrombotics was lower in women in our cohort, thus favouring lower rates of sICH. Unfortunately, body weight was not consecutively registered, and therefore we can only conjecture very likely that the absolute dose of rtPA was higher in men. For the moment we are also interpreting the observed higher rate of sICH in women as being a single-centre phenomenon. However, external validation by a centre-based analysis of multicentre data including balanced sex cohorts should provide more in-depth insight regarding sex dependency on sICH.
The major strength of this study is its unique analytical approach, aiming to minimize the bias due to different covariates between the sexes. Pre-matching by CEM improves balance essentially and achieves more robust inferences than an unmatched, full data set does. An example of possible avoided bias is illustrated in Figure 2. In unmatched data, both crude and adjusted analyses either underestimate the effect (e.g., for sICH) or may give misleading results (e.g., for fatal ICH). However, facilitating a pre-matched regression analysis sex proved to be a strong independent predictor of fatal ICH in our cohort. Because CEM is a relatively novel rather than a standard approach we provide the reader with unmatched and matched outcome analysis to aim transparency and to enable a direct comparison of the results.
In our matching, we also included an often overlooked covariate: lesion side. By pathophysiological means, left and right anterior circulation strokes are reflected differently in the most commonly used score (NIHSS), with left-hemispheric strokes yielding higher scores but better outcomes . Not considering these matters can produce a critical bias in determining outcome. This is the first study comparing sexes in stroke thrombolysis to address both the bias of side of lesion and baseline severity-adjusted analysis when determining outcome.
Our study has several limitations: This is a retrospective analysis of prospectively collected data from a single centre – external validation is needed. The matching process is accompanied by an attempt to find a reasonable compromise between the optimal match and the maximum size of the cohort. With respect to sICH we had no information regarding early infarct signs and we cannot adjust for body weight and consecutive rtPA dose. Our results are limited to patients eligible for treatment with IVT. Thus factors influencing outcome after stroke like older age and higher prestroke disability as well as sociodemographic parameters were not investigated in detail. We did not refer to parameters which are known to influence outcome after stroke like pre-stroke mRS, stroke subtype, vessel occlusion, and vessel recanalization. Outcome studies in stroke may be biased due to “do not resuscitate”- orders. This objection may therefore also be true for our cohort. In addition, there was no control group without IVT treatment. Therefore, we cannot conclude an absolute effect of IVT within sexes but only between the two sexes in comparison.
In balanced groups, the two sexes show comparable outcomes following IVT. Taken together with the novel finding of higher rates of sICH and fatal ICH in women, further investigation of multicentre data in balanced groups is warranted. For observational data CEM seems to be a useful pre-processing tool to reduce bias in estimating outcome.
- Kent DM, Buchan AM, Hill MD. The gender effect in stroke thrombolysis: of CASES, controls, and treatment-effect modification. Neurology. 2008;71:1080–3.View ArticlePubMedGoogle Scholar
- Lorenzano S, Ahmed N, Falcou A, Mikulik R, Tatlisumak T, Roffe C, et al. Does sex influence the response to intravenous thrombolysis in ischemic stroke?: answers from safe implementation of treatments in stroke-international stroke thrombolysis register. Stroke. 2013;44:3401–6.View ArticlePubMedGoogle Scholar
- Kent DM, Price LL, Ringleb P, Hill MD, Selker HP. Sex-based differences in response to recombinant tissue plasminogen activator in acute ischemic stroke: a pooled analysis of randomized clinical trials. Stroke. 2005;36:62–5.View ArticlePubMedGoogle Scholar
- Elkind MS, Prabhakaran S, Pittman J, Koroshetz W, Jacoby M, Johnston KC, et al. Sex as a predictor of outcomes in patients treated with thrombolysis for acute stroke. Neurology. 2007;68:842–8.View ArticlePubMedGoogle Scholar
- Cochran WG. The effectiveness of adjustment by subclassification in removing bias in observational studies. Biometrics. 1968;24:295–313.View ArticlePubMedGoogle Scholar
- Cochran WG. Analysis of covariance: its nature and uses. Biometrics. 1957;13:261–81.View ArticleGoogle Scholar
- Rubin DB. The use of matched sampling and regression adjustment to remove bias in observational studies. Biometrics. 1973;29:185–203.View ArticleGoogle Scholar
- Iacus SM, King G, Porro G. Causal Inference Without Balance Checking: Coarsened Exact Matching. Political Analysis first published online August 23, 2011 doi:10.1093/pan/mpr013Google Scholar
- Rubin DB, Thomas N. Combining propensity score matching with additional adjustments for prognostic covariates. J Am Stat Assoc. 2000;95:573–85.View ArticleGoogle Scholar
- Ho D, Imai K, King G, Stuart E. Matching as nonparametric preprocessing for reducing model dependence in parametric causal inference. Polit Anal. 2007;15:199–236.View ArticleGoogle Scholar
- von Elm E, Altman DG, Egger M, Pocock SJ, Gotzsche PC, Vandenbroucke JP, et al. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. J Clin Epidemiol. 2008;61:344–9.View ArticleGoogle Scholar
- Saver JL, Yafeh B. Confirmation of tPA treatment effect by baseline severity-adjusted end point reanalysis of the NINDS-tPA stroke trials. Stroke. 2007;38:414–6.View ArticlePubMedGoogle Scholar
- Hacke W, Kaste M, Fieschi C, von Kummer R, Davalos A, Meier D, et al. Randomised double-blind placebo-controlled trial of thrombolytic therapy with intravenous alteplase in acute ischaemic stroke (ECASS II). Second European-Australasian acute stroke study investigators. Lancet. 1998;352:1245–51.View ArticlePubMedGoogle Scholar
- Rubin DB. Multiple Imputation for Nonresponse in Surveys. New York: J. Wiley & Sons; 1987.View ArticleGoogle Scholar
- King G, Honaker J, Joseph A, Scheve K. Analyzing incomplete political science data: an alternative algorithm for multiple imputation. Am Polit Sci Rev. 2001;95:49–69.Google Scholar
- Honaker J, King G, Blackwell M. Amelia II: a program for missing data. J Stat Software. 2011;45:1–47.View ArticleGoogle Scholar
- King G, Nielsen R, Coberley C, Pope J, and Wells A. Comparative Effectiveness of Matching Methods for Causal Inference. 2011. Copy at http://j.mp/jCpWmk
- R Development Core Team. R: A language and environment for statistical computing. In: Book R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2013.Google Scholar
- Imai K, King G, Lau O. "logit: Logistic Regression for Dichotomous Dependent Variables" in Kosuke Imai, Gary King, and Olivia Lau in "Zelig: Everyone's Statistical Software,”. 2013. http://gking.harvard.edu/zelig
- Ho D, Imai K, King G, Stuart EA. MatchIt: Nonparametric Preprocessing for Parametric Causal Inference. J Stat Software. 2011;42:1–28.View ArticleGoogle Scholar
- Ernon L, Schrooten M, Thijs V. Body temperature and outcome after stroke thrombolysis. Acta Neurol Scand. 2006;114:23–8.View ArticlePubMedGoogle Scholar
- Fiorelli M, Bastianello S, von Kummer R, del Zoppo GJ, Larrue V, Lesaffre E, et al. Hemorrhagic transformation within 36 hours of a cerebral infarct: relationships with early clinical deterioration and 3-month outcome in the European Cooperative Acute Stroke Study I (ECASS I) cohort. Stroke. 1999;30:2280–4.View ArticlePubMedGoogle Scholar
- Larrue V, von Kummer RR, Muller A, Bluhmki E. Risk factors for severe hemorrhagic transformation in ischemic stroke patients treated with recombinant tissue plasminogen activator: a secondary analysis of the European-Australasian Acute Stroke Study (ECASS II). Stroke. 2001;32:438–41.View ArticlePubMedGoogle Scholar
- Di Legge S, Saposnik G, Nilanont Y, Hachinski V. Neglecting the difference: does right or left matter in stroke outcome after thrombolysis? Stroke. 2006;37:2066–9.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.