 Research
 Open Access
 Published:
Effects of opium use on oneyear major adverse cardiovascular events (MACE) in the patients with STsegment elevation MI undergoing primary PCI: a propensity score matched  machine learning based study
BMC Complementary Medicine and Therapies volume 23, Article number: 16 (2023)
Abstract
Background
Considerable number of people still use opium worldwide and many believe in opium’s health benefits. However, several studies proved the detrimental effects of opium on the body, especially the cardiovascular system. Herein, we aimed to provide the first evidence regarding the effects of opium use on oneyear major adverse cardiovascular events (MACE) in the patients with STelevation MI (STEMI) who underwent primary PCI.
Methods
We performed a propensity score matching of 2:1 (controls: opium users) that yielded 518 opium users and 1036 controls. Then, we performed conventional statistical and machine learning analyses on these matched cohorts. Regarding the conventional analysis, we performed multivariate analysis for hazard ratio (HR) of different variables and MACE and plotted Kaplan Meier curves. In the machine learning section, we used two treebased ensemble algorithms, Survival Random Forest and XGboost for survival analysis. Variable importance (VIMP), tree minimal depth, and variable hunting were used to identify the importance of opium among other variables.
Results
Opium users experienced more oneyear MACE than their counterparts, although it did not reach statistical significance (Opium: 72/518 (13.9%), Control: 112/1036 (10.8%), HR: 1.27 (95% CI: 0.94–1.71), adjusted pvalue = 0.136). Survival random forest algorithm ranked opium use as 13th, 13th, and 12th among 26 variables, in variable importance, minimal depth, and variable hunting, respectively. XGboost revealed opium use as the 12th important variable. Partial dependence plot demonstrated that opium users had more oneyear MACE compared to nonopiumusers.
Conclusions
Opium had no protective effects on oneyear MACE after primary PCI on patients with STEMI. Machine learning and oneyear MACE analysis revealed some evidence of its possible detrimental effects, although the evidence was not strong and significant. As we observed no strong evidence on protective or detrimental effects of opium, future STEMI guidelines may provide similar strategies for opium and nonopium users, pending the results of forthcoming studies. Governments should increase the public awareness regarding the evidence for nonbeneficial or detrimental effects of opium on various diseases, including the outcomes of primary PCI, to dissuade many users from relying on false beliefs about opium’s benefits to continue its consumption.
Background
Opium was among the earliest plants used for its medicinal and recreational properties [1]. Derived from dried Papaver somniferum L. milky exudate, opium it is still ranked as the second common abused substance in the Middle East, just after tobacco [2]. This consumption trend is partly related to the proximity of this region to main production centers of opium, causing easier accessibility to this drug. Although the more conventional opium use have lost its popularity in many world regions, the family of opioids account for the highest share of disease burden related to illicit drug use worldwide [3].
Many people who use opium believe in its protective effects against diseases, including cardiovascular morbidity, and such beliefs may account for the tendency towards opium or reluctance to give up its use [4, 5]. Nevertheless, the studies oppose such claim. For instance, a metaanalysis of 41 studies found a 2.75 (95% confidence interval (CI): 2.04–3.75) increased risk of coronary artery disease (CAD) in patients who use opium [6]. Several other studies also announced opium as a risk factor for increased cardiovascular and allcause mortality [4,5,6,7,8,9,10,11,12]. Opium can exert its detrimental effects via numerous mechanisms, such as increasing inflammation, coagulation, and oxidative stress, decreasing physical activity, and adverse hormonal and metabolic changes, etc., that are further expanded in the discussion [1].
Earlier researchers studied the effects of opium on patients undergoing coronary artery bypass graft (CABG) surgery and observed the adverse outcomes of patients who use opium in this setting [13, 14]. In one study the patients who used opium had higher 5year major adverse cardiovascular events (MACE) and mortality [14], while in the other they had higher readmission rates [13]. However, no evidence exists regarding the effects of opium on the outcomes of patients undergoing primary percutaneous coronary intervention (PCI) after STsegment elevation myocardial infarction (STEMI). Only one study exists in the elective PCI settings, but found no associations between opium use and oneyear MACE. Therefore, we aimed to study oneyear outcomes of these patients using conventional statistical analysis and machine learning strategies.
Methods
Study population
We conducted a retrospective cohort study to assess the effect of opium use and cardiovascular outcome in the first year after primary PCI for STEMI patients. A total of 3466 patients who underwent primary PCI were initially included in this study, including 586 opium users and 2922 nonopium users as controls.
Patients’ data was extracted from Tehran Heart Center Primary PCI database. The percentage of missing values were evaluated after dataset’s variables modifications. Variables with missing values of more than 10% were excluded. For conventional analysis, remaining missing values were imputed by replacing the value with mode and median for the categorical and numerical values, respectively. We chose median because the base analysis demonstrated the distribution of all the numeric variables were not normal. Then, we performed 2:1 propensity score matching (PSM) yielding 518 opium users and 1036 controls and all the analyzes in this study, including both statistical and machine learning methods, were performed on these matched groups. All the analyzes were carried out using R statistical packages v4.0.4 (http://www.rproject.org/).
Baseline characteristics and propensity score matching (PSM)
To compare baseline characteristics between the opium users and control groups, student ttest and MannWhitney Utest were used for numeric variables with normal and nonnormal distributions, respectively, and Chisquare test was used for categorical variables. Numeric variables with normal distribution were reported with mean and 95% confidence interval (CI) and numeric variable with nonnormal distribution were reported with median and interquartile range (IQR). Categorical variables were reported with count and percentage. Twosided alpha value of 0.05 was considered as significant level. Supplementary Table 1 shows the betweengroup differences of baseline characteristics before and after matching. Only four variables of body mass index (BMI) (opium: Median: 27 vs. control: 27.11), triglyceride (116 vs. 125), creatinine (1.0 vs. 0.9), and hemoglobin (15 vs 15.7) remained statistically significant between the groups, but their differences were clinically insignificant.
PSM was conducted to minimize differences in baseline propensity of observations to be assigned to the independent variable of interest, opium use. PSM was conducted by the logistic regression method with 2:1 matching (2 control: 1 opium). Greedy nearestneighbor method without replacement was performed to choose nearest distance of each observation propensity score in opium user and control groups.
Variables included in the matching process were selected based on the baseline characteristics comparison results, those with statistically and clinically significant difference were included in a logistic regression model. There was a significant difference between opium users and control group in baseline prevalence of hypertension, diabetes mellitus (DM), dyslipidemia, smoking history, gender, and baseline mean of fasting blood sugar, age, and lowdensity lipoprotein (LDL) levels (Table 1).
After applying PSM, absolute standardized mean difference (SMD) plot of the variables included in PSM demonstrated perfect matching of the selected variables as all the SMDs reduced to less than 10% (Supplementary Fig. 1).
Conventional statistical analysis
Univariate cox regression analysis was performed for each of the independent variables, as follows:
where s(x) is relative risk function, λ_{0}(t) is baseline hazard, λ(tx) is hazard function λ at time of t for an observation with covariate vector x is calculated.
Variables with significant pvalue of less than 0.1 in each model and their model Wald test pvalue of less than 0.1 were selected for multivariate cox regression analysis. Opium was included in the multivariate analysis regardless of its significant level in the univariate analysis. Assessment of proportionality of hazard function was assessed by Shoenfeld’s residuals. None of the predictors violated proportionality of hazard functions.
Hazard ratio (HR) for opium use was calculated in the multivariate analysis and then, KaplanMeier (KM) curves were plotted for oneyear mace MACE and its components (allcause mortality, myocardial infarction (MI), target vessel revascularization (TVR), target lesion revascularization (TLR), and CABG).
Machine learning analysis
We conducted machine learning analysis as a sensitivity analysis to assess robustness of the results. Two infamous machine learning algorithms, Survival Random Forest and Extended Gradient Boosting for survival study (XGboost), with builtin variable importance and feature selection capability were selected. We used mlr3proba 0.4.0 version and its dependent packages (mlr3extralearner, mlr3pipelines, mlr3filter, etc.) for implementing machine learning algorithms in the R software. The advantage of these packages lies is their capability of implementing machine learning algorithms for survival studies which have different structure from classification studies because of counting time in addition to events.
Data splitting
The main dataset with missing values was randomly split into train and test parts with 80 and 20% of the total data, respectively. All hyperparameter optimization, training and benchmarking processes were conducted on the train set. Final assessment of the model’s accuracy was conducted on the test set.
Data preprocessing
Categorical predictors were transformed to numerical using numerical encoding. To prevent data leakage, missing values were replaced in training and test datasets separately by median and mode for numerical and categorical variables, respectively. Median (IQR) was chosen due to nonnormal distribution of both groups. To avoid significant multicollinearity between numeric variables, correlation matrix of independent numerical variables was assessed and a correlogram was plotted. The degree of collinearity between two variables by Pearson correlation coefficient was considered weak if 0 ≤ r < 0.3, moderate if 0.3 ≤ r < 0.7 and strong if r ≥ 0.7 [15]. Total cholesterol and LDL had significant colinearity, so total cholesterol was dropped from features (Supplementary Fig. 2).
Base learner
Decision Tree is one of the wellknown algorithms of machine learning, constructed from nodes and leaves. It divides subjects by input features according to their outcomes until best separation of observations with homogeneous survival outcomes achieves. Decision tree is the default baselearner (weaklearner) of survival random forest algorithm. Splitting rule for studies which all observation may not have complete followup, as in survival studies, would be different from classification problems.
Two main methods of splitting nodes to daughter’s nodes in survival decision trees are node purity and node distance methods. Default splitting rule of decision trees of survival random forest of mlr3proba package is “logrank” hypothesis tests which is a “node distance based” splittingrule. Briefly describing, the nullhypothesis of logrank test assumes that survival distribution and hazard functions of two separate groups of observations are identical. Here the algorithm performs logrank test in each split, comparing hazard function of two leaves.
Considering h^{A} as leaf “A” hazard function and h^{B} as leaf B hazard function, then logrank null and alternative hypothesis would be:
respectively, and assuming:
\({d}_{\tau}^A\)  No. of observed deaths in leaf A at time τ 
\({e}_{\tau}^A\)  No. of expected deaths in leaf A at time τ 
\({\upsilon}_{\tau}^A\)  Variance of the No. of deaths in leaf A at time τ 
υ_{D}  list of unique event times in both leaves 
Then, logrank statistics would be [16]:
The result of logrank test indicates degree of dissimilarity between two leaves in each split. The higher its score, the more different is hazard functions of leaves, hence more discriminative is the feature in the splitting process.
Default splitting rule of decision trees of XGboost algorithm of mlr3proba package is full likelihood deviance measures of cox model, which we used in in our study. It is based mainly on estimating cumulative hazard function of each node by Cox model, and trying to maximize full proportional hazard likelihood. As it is discussed by LeBlanc and Crowely [16], it tries to maximized reduction in onestep deviance.
As a brief description, considering following definition:
\(\overset{\sim }{T}\)as a set of terminal nodes,
S_{h} as a set of observation labels in terminal node h,
λ_{0} as hazard function,
Λ_{0}(t) as baseline cumulative hazard function,
t_{i} observation time of individual i,
δ_{i }failure indicator for individual i, which would be zero or one,
Then full likelihood score of node h given tree T would be:
Then deviance of node h would be the difference between fitted model and saturated model maximum loglikelihood values:
where \(loglikelihood\left(\overset{\sim }{\theta_h}\right)\) is the maximized loglikelihood and the baseline cumulative hazard function Λ_{0}(t) is known.
The deviance residual of node h in terms of proportional hazard function would be:
Therefore, the improvement of deviance of node h into left daughter nodes l_{node}(h) and right daughter nodes r_{node}(h) is
The algorithms perform binary splitting with all possible split of covariates to achieve maximum reduction in deviance measures in each split.
The default evaluation metrics of consecutive trees for survival XGboost algorithm in our study, was coxnloglikelihood.
Supplementary Fig. 3 illustrates two of the decision trees plotted in our study as an example.
Ensemble methods
The main advantage of decision tree is its low bias rates compared to other baselearners, but it has high variance. To reduce its variance, ensemble methods have been developed to aggregate the results of many trees and improve the prediction. We used survival random forest and XGboost (extended gradient boosting) for survival analysis as ensemble methods. Random forest utilizes a bagging (bootstrap aggregating) method and XGboost follows a gradient boosting algorithm.
Hyperparameter optimization
Important machine learning algorithms’ hyperparameters must be tuned before implementing the final model on the new test datasets. We utilized “randomsearch” tuning strategy with terminating rule defined as 50 iterations to optimize hyperparameters of each the algorithms.
Important hyperparameters for random forest algorithms were assessed for optimization and those contributing to improved model’s accuracy, were provided to the algorithm, including number of variables to choose randomly in each splits (“mtry”) and minimum number of objects in each terminal node(“nodesize”) (Table 2). In a spot check assessment of data, the outofbag error rate of survival random forest stabilized after about 250–300 trees (Supplementary Fig. 4), so we defined 1000 trees as number of trees to be generated for random forest.
For XGboost algorithm, eleven hyperparameters were assessed for optimization (including nrounds, max_depth, min_child_weight, etc.) and those contributing to improved model’s accuracy were provided to the algorithm.
To reduce the probability of data leakage and overfitting during optimization, “nested crossvalidation” with 10 inner folds and 3 outer folds was conducted to assess the improved accuracy of XGboost model by each hyperparameters (Fig. 1). For the survival random forest, simple 10 folds crossvalidation was used (Fig. 2).
Benchmarking
To compare machine learning (ML) algorithms with reliability, it is necessary to ensure that train and test datasets are the same for all of the algorithms. With Benchmarking, one can apply resampling methods on main dataset, assuring all algorithms are being implemented on exactly the same train/test set in each resampling run. Then, we extract the overall average results of resampling in each set of populations.
We included nontreebased cox proportional hazard model (Although, this time with full set of features) in benchmark resampling to enable the comparison of our two ML methods (i.e Survival Random Forest, XGboost) with traditional cox proportional hazards method used in our study. We conducted benchmark resampling using 3folds crossvalidation.
Two measurement indices for survival studies were selected, Harrell’s Cindex and Uno’s Cindex to compare the algorithms. These are somehow equivalent to area under the curve (AUC) used for classification algorithms.
Harrell’s Cindex, is an algorithm which is used to assess time to event studies performance. The main concept behind Harrell’s Cindex is that a pair of subjects with different time of event experience, so called “comparable” subjects according to time of event, would have different calculated risk of experiencing event. The less time an event occurs, the higher the risk one subject would have. Therefore, a “comparable” subject’s pair risk estimation is expected to be “concordant” to their time of events.
Harrell’s Cindex calculates ratio of “concordant” to “comparable” pairs, meaning how much the model has accurately measure the risks that are concordant to time of events [17].
In the above equation, \({T}_i^{obs},{T}_j^{obs}\) are time to event for observation i and observation j respectively, and \({M}_i^{obs},{M}_j^{obs}\) are calculated risks for observation i and observation j. The result of I(…) would be zero or one according to comparison results.
Features ranking
Results of training of the train set were used to conduct variable importance. For random forest, outofbag samples during creating trees were used for variable importance method.
For survival random forest, we utilized permutation variable importance (VIMP) as described by Breiman et al. [18], tree minimal depth methodology, and variable hunting to rank variables based on their level of importance.
Permutation importance assesses model accuracy (error rate) before and after permuting (random shuffling) of each variable; the more deterioration occurs in the model accuracy, the more important permuted variable is.
As described by Breiman et al. [18], considering following definitions:
t: one of the trees where t ∈ {1, …, ntree.}
\({\overline{B}}^{(t)}\): out of bag (oob) sample for t
X_{j}: variable j in tree t
\({\hat{y}}_i^{(t)}={f}^{(t)}\left({x}_i\right)\): predicted value of observation i before permutation of its value of X_{j}
\({\hat{y}}_{i,{\pi}_j}^{(t)}={f}^{(t)}\left({x}_{i,{\pi}_j}\right)\): predicted value of observation i after permutation of its value of X_{j}
Then variable importance of X_{j} in tree t is calculated as follows:
Then for each variable, mean variable importance score is calculated as follows:
which is the mean variable importance score of v among all trees.
For minimal depth method, a preliminary random forest is generated first, then VIMP of each variable is calculated and is used to weigh each variable. Then routine random forest run is conducted but this time instead of randomly selecting variables in each node split, they are selected with a chance that is proportional to their assigned weights. It searches subtrees which their root nodes are split by variable v, so called maximal subtrees of variable v. A closest maximal subtree root of variable v to the main tree root is called minimal depth of variable v. The smaller minimal depth, the more important the variable v in predicting the outcome.
We used 50 iterations of survival random forest for minimal depth method (using package ‘randomForestSRC’ Hemant Ishwaran, version 3.1.1).
Variable hunting (VH) method usually is implemented for highdimensional dataset (number of variables remarkably more than subjects, e.g. 10 times), our dataset was not highdimensional, but we used this method to investigate the concordance between all methods of variable importance.
VH method in randomForestSRC package (one of mlr3proba dependency) follows this sequence: A preliminary forest is created to calculate VIMP of each variable, then another forest is created by selecting variables with chance proportional to their VIMP (weight). But this time instead of “depth”, relative frequency of selecting a variable is used to determine its importance, the more the relative frequency is, the more the variable is important. We defined 50 numbers of survival random forest iterations, and one preliminary tree was created before each iteration to calculate VIMP scores [using package ‘randomForestSRC’ Hemant Ishwaran, version 3.1.1]. Again, All VIMP scores were calculated by BreimanCutler permutation.
For XGboost algorithm, we used builtin “feature importance” function of XGboost package, which calculate the relative number of a feature that selected for splitting nodes across all trees, and percentage of total gain increase in all splits of a feature.
Partial dependence plot (PDP)
We used partial dependence to assess the average marginal effect of selected top features on the target variable, MACE. For numerical features, it helps to find the pattern of relation between the features and outcome, as they are linear or nonlinear. For categorical variables it helps to compare effects of each category on the target variable.
For numerical variables, considering x_{S} as the feature(s) in the set S that we want to plot its/their relation(s) with the outcome variable, x_{C} as vector of other features used in our ML model \(\hat{f}\), partial function marginalizes ML output over various distributions of vector x_{C} variables, so the function would depends mainly on x_{S}, our variable of interest [19]:
And estimation of partial function \(\hat{f}_{x_S}\) by averaging marginal effects as [19]:
To compute the marginal effect of a categorical variable, we set the category of all observations to the category that we are interested. For example, considering hypertension as a variable with two categories of 0 and 1, we calculate PDP estimate of having and not having hypertension (i.e. 1 or 0). Then we replace hypertension status of all the observations to 1 at once, and perform prediction, and then to 0 and perform the prediction again [19].
Web application and source codes
To demonstrate applicability of our study, we developed a web application which can be used to predict first year MACE of primary PCI patients by uploading proper data file by users or realtime completing a form of features (webapp link: https://behnamhedayat.shinyapps.io/primace or https://primace.aikadeh.com). The source code of statistical and machine learning analysis and the web application are available in Supplementary Table 3.
Results
Conventional statistical analysis
Opium users experienced about 27% more MACE during oneyear after primary PCI compared to their counterparts, although that was not proved to be significant in multivariate cox regression model (Opium: 72/518 (13.9%), Control: 112/1036 (10.8%), HR: 1.27 (95% CI: 0.94–1.71), adjusted pvalue = 0.136) (Table 3). KM curves were plotted for oneyear MACE (Fig. 3) and its components (Supplementary Fig. 5), all without significant differences between the groups. Oneyear need for CABG after primary PCI was the most notably different component of MACE between the groups, although it was not significantly changed, but it suggests a trend toward more oneyear need for CABG in patients who used Opium compared to nonusers (HR: 1.56 (95% CI: 0.98–2.5), adjusted pvalue = 0.063) (Table 4).
Machine learning analysis
Random forest results
On variable importance performed on out of the box (OOB) samples, opium use had positive VIMP score and ranked 13th among other variables (Fig. 4). Opium use also ranked 13th by minimal depth method (Fig. 5); and therefore, the results in variable importance and feature selection were concordant (Fig. 6). Opium use was ranked 12th in the variable hunting method (Fig. 7). Figure 8 Partial dependence plots illustrates marginal effects of opium use and four top variables example on oneyear MACE. The plot shows that opium users had increased MACE compared with nonopium users.
NelsonAaren estimator and KM curves demonstrated nearly similar overall survival curves. Continuous ranked probability score (CRPS) and Brier score plots over time in OOB subjects, demonstrated acceptable prediction accuracy of the SRF model over time (Supplementary Fig. 6).
XGboost results
Opium ranked 12th among other variables using its builtin variable importance method on the training set (Fig. 9).
Performance analysis
Benchmarking on train dataset with nested (repeated) cross validation resampling demonstrated that random forest method outperformed cox proportional hazards (conventional analysis) and XGboost. XGboost had the lowest performance, although the performance of the methods did not differ much (Harrell’s Cindex: random forest: 63.0%, Cox proportional hazards: 61.2%, XGboost: 59.2%) (Supplementary Table 2, Supplementary Fig. 7).
On the unseen test set, random forest model achieved a Harrell’s Cindex of about 69.4%, about 7% more than the observed value in benchmarking. XGboost Harrell’s Cindex value was similar to its value on the benchmarking with about 60%. Cox proportional hazard analysis was also performed on train and test dataset with all the independent variables. Its Harrell’s Cindex was 66%.
Values of Harrell’s Cindex between survival random forest and the two other models were significantly different, while XGboost and Coxph models were not significantly different regarding their Harrell’s Cindex (Table 5). We then performed a timedependent ROC analysis and assessed ROC AUC at six and 12 months after each model. As it is evident, at these time points, survival random forest remarkably outperformed other two models (Fig. 10).
Discussion
Overall, this study suggests that opium use offers neither benefits nor strongly affirmed detrimental effect on the rate of MACE during first year following primary PCI, although our results arose possibilities of detrimental effects of opium after primary PCI. Opium use ranked 12 or 13 in the machine learning analyses and was not among the most influential risk factors (ie. top ten variables). Despite the fact that opium use demonstrated a trend towards higher oneyear MACE in the conventional statistical analysis, the difference was not significant.
In our study we tried to make opium user and control groups as homogeneous as possible in terms of baseline features, especially those features having known major contribution to cardiovascular events. One main drawback of using variable ranking by machine learning models is that such methods do not consider confounding factors and do not adjust and control possible contribution of other variables to the outcome. Hence, we cannot ascertain that opium has independent adverse effect on first year MACE following primary PCI.
This finding is important, as many opium users believe in opium’s protective effects against several diseases and rely on this factor as a motivation for continued consumption. For example, a study in Iran found that 78.3% of the opium users believe that opium has positive effects in glycemic and hypertension control [4], while no such benefits were observed in the studies [5].
No previous study examined the association between opium use and primary PCI outcomes. A retrospective cohort in our center did not find associations between opium use and oneyear MACE in males undergoing “elective” PCI, although the authors did not adjust the groups’ age as a potential confounder [20]. Mousavi et al. also found no increased inhospital and sixmonth adverse outcomes after thrombolytic therapy for STEMI in patients addicted to opium compared to controls [21]. However, several studies on stable coronary artery disease (CAD), including a metaanalysis [6], found that opium use positively correlated with the risk of developing atherosclerotic plaque and CAD, the severity, and the risk of mortality from CAD [6,7,8, 10,11,12]. Doseresponse associations were observed between opium use and the extent of atherosclerotic plaques according to Gensini’s score [10], CAD severity by clinical vessel score [11], and cardiovascular and allcause mortality [9]. Furthermore, Sadeghian et al. reported opium use as the most important risk factor for premature CAD (< 45 years) among Iranian males [12]. Regarding acute coronary events (ACS), Roayaei et al. concluded that disagreements existed if opium had adverse effects on patients’ outcomes; however, at least no studies reported protective properties for opium [1].
Unlike primary PCI, outcomes were studied in opium users undergoing CABG. Masoudkabir et al. found higher 5year mortality and MACE in patients who continued to use opium after CABG, but no such findings were observed in patients who quit opium use after their surgery [14]. Concurrently, Safaei et al. reported higher readmission rates in opium users following CABG compared to nonopium users [13].
Roayaei [1], Masoudkabir [5], and Nakhaee et al. [22] reviewed several mechanisms that opium can exert its detrimental effects on the cardiovascular system: [1] Increased inflammatory cytokines and decreased antiinflammatory mediators, [2] elevated oxidative stress, [3] increased levels of procoagulant molecules, [4] higher rates of insulin resistance and metabolic syndrome [5, 23] altered hormone levels, notably decreased testosterone, estrogen, and adiponectin levels and hyperprolactinemia, [6] increased homocysteine levels, [7] physical inactivity and sedentary life style, [8] altered pain sensation and delayed clinical presentations leading to adverse outcomes, [9] other impurities and substances, most notably lead, and [10] interference with some antiplatelet medications, including aspirin, clopidogrel, prasugrel, and ticagrelor. As mentioned, opium interferes with antiplatelets and this may increase the risk of coronary and stent thrombosis. Therefore, appropriate studies on antiplatelet dosage modification may address this issue in patients who use opium. Neovascularization and collateral formation might also play role in the effects of opium on patients with cardiovascular disorders, as these mechanisms decrease the damage from acute and subacute ischemic events to the heart [24]. Opioids probably have proangiogenic properties that may hypothetically increase collateral coronary arteries [25], but may not be important regarding several mechanisms disfavoring opium use.
As mentioned earlier, opium using had no significant effect on MACE and its components among STEMI patients despite it has shown increasing allcause mortality among post CABG patients. We hypothesize that this finding may be due to the already intensely high inflammatory state during STEMI active phase compared to less intense chronic proinflammatory effect of opium which would contribute to small portion of inflammatory milieu during STEMI. Also, relatively lower overall risk factors and lower coronary artery disease burden in STEMI patients, would represent lower chronic inflammatory state compared to those who need CABG. This makes CABG patients more likely to experience mortality due to various cause [26].
This study comes with some limitations. One is the retrospective nature of this cohort. Furthermore, we did not divide the patients to former and current opium users due to the lack of a universal definition and the design of our database. Another drawback is that machine learning methods cannot adjust for confounding variables that could alter the observed outcomes. We recommend future researchers to conduct studies that compare former and current patients who use opium. On the other hand, in our opinion, the novelty of this study and its robust statistical methods may compensate for its shortcoming.
Conclusion
Opium had neither protective effects nor strongly affirmed detrimental effect on oneyear MACE after primary PCI on patients presenting with STEMI. It was not ranked among top ten important variables in machine learning algorithms and had not significant effect in conventional statistical analysis on oneyear MACE outcome despite adjusting for other variables. Accordingly, it could emphasize that treatment strategies for patients presenting with ST elevation MI should not be different for those who are opium users vs. non users, a point that can be studied in the future and mentioned in future STEMI guidelines. On the other hand, patients who believe opium has certain health benefits and is useful after primary PCI, should be counseled about the lack of evidence for such claims and the possible adverse effects of opium.
Availability of data and materials
The datasets generated and analyzed during the current study are not publicly available due to our institutional policies, but are available from the corresponding author on reasonable request.
Abbreviations
 CI:

Confidence interval
 CAD:

Coronary artery disease
 CABG:

Coronary artery bypass graft
 MACE:

Major adverse cardiovascular events
 PCI:

Percutaneous coronary intervention
 STEMI:

STsegment elevation myocardial infarction
 PSM:

Propensity score matching
 DM:

Diabetes mellitus
 LDL:

Low density lipoprotein
 SMD:

Standardized mean difference
 IQR:

Interquartile range
 HR:

Hazard ratio
 KM:

KaplanMeier
 TVR:

Target vessel revascularization
 TLR:

Target lesion revascularization
 AUC:

Area under the curve
 VIMP:

Variable importance
 VH:

Variable hunting
 OOB:

Out of the box
 CRPS:

Continuous ranked probability score
 PDP:

Partial dependance plot.
References
Roayaei P, Aminorroaya A, VasheghaniFarahani A, Oraii A, Sadeghian S, Poorhosseini H, et al. Opium and cardiovascular health: a devil or an angel? Indian Heart J. 2020;72(6):482–90.
AminEsmaeili M, RahimiMovaghar A, Sharifi V, Hajebi A, Radgoodarzi R, Mojtabai R, et al. Epidemiology of illicit drug use disorders in Iran: prevalence, correlates, comorbidity and service utilization results from the Iranian mental health survey. Addiction. 2016;111(10):1836–47.
United Nations Office on Drugs and Crime (UNODC), World Drug Report 2021, accessed September 7, 2021 [available online at: https://www.unodc.org/unodc/en/dataandanalysis/wdr2021.html].
Azod L, Rashidi M, AfkhamiArdekani M, Kiani G, Khoshkam F. Effect of opium addiction on diabetes. The Am J Drug and aAcohol Abuse. 2008;34(4):383–8.
Masoudkabir F, Sarrafzadegan N, Eisenberg MJ. Effects of opium consumption on cardiometabolic diseases. Nat Rev Cardiol. 2013;10(12):733–40.
Nakhaee S, Amirabadizadeh A, Qorbani M, Lamarine RJ, Mehrpour O. Opium use and cardiovascular diseases: a systematic review and metaanalysis. Crit Rev Toxicol. 2020;50(3):201–12.
Masoumi M, Shahesmaeili A, Mirzazadeh A, Tavakoli M, Ali AZ. Opium addiction and severity of coronary artery disease: a casecontrol study. J Res In Med Sci : The Official J Isfahan Univ Med Sci. 2010;15(1):27–32.
Masoomi M, Ramezani MA, Karimzadeh H. The relationship of opium addiction with coronary artery disease. Int J Prev Med. 2010;1(3):182–6.
Khademi H, Malekzadeh R, Pourshams A, Jafari E, Salahi R, Semnani S, et al. Opium use and mortality in Golestan cohort study: prospective cohort study of 50,000 adults in Iran. BMJ (Clin Res ed). 2012;344:e2502.
Hosseini SK, Masoudkabir F, VasheghaniFarahani A, AlipourParsa S, Sheikh Fathollahi M, RahimiForoushani A, et al. Opium consumption and coronary atherosclerosis in diabetic patients: a propensity scorematched study. Planta Med. 2011;77(17):1870–5.
Sadeghian S, Darvish S, Davoodi G, Salarifar M, Mahmoodian M, Fallah N, et al. The association of opium with coronary artery disease. Eur J Cardiovasc Prev Rehabil. 2007;14(5):715–7.
Sadeghian S, Graili P, Salarifar M, Karimi AA, Darvish S, Abbasi SH. Opium consumption in men and diabetes mellitus in women are the most important risk factors of premature coronary artery disease in Iran. Int J Cardiol. 2010;141(1):116–8.
Safaei N. Outcomes of coronary artery bypass grafting in patients with a history of opiate use. Pak J Biol Sci. 2008;11(22):2594–8.
Masoudkabir F, Yavari N, Pashang M, Sadeghian S, Jalali A, Poorhosseini H, et al. Effect of persistent opium consumption after surgery on the longterm outcomes of surgical revascularisation. Eur J Prev Cardiol. 2020;27(18):1996–2003.
Vatcheva KP, Lee M, McCormick JB, Rahbar MH. Multicollinearity in regression analyses conducted in epidemiologic studies. Epidemiology (Sunnyvale). 2016;6(2):227.
LeBlanc M, Crowley J. Relative risk trees for censored survival data. Biometrics. 1992;48(2):411–25.
Longato E, Vettoretti M, Di Camillo B. A practical perspective on the concordance index for the evaluation and selection of prognostic timetoevent models. J Biomed Inform. 2020;108:103496.
Strobl C, Boulesteix AL, Kneib T, Augustin T, Zeileis A. Conditional variable importance for random forests. BMC Bioinformatics. 2008;9:307.
Molnar C. Interpretable machine learning. Methods. 2020;179:1–2. https://doi.org/10.1016/j.ymeth.2020.05.024.
Sharafi A, Pour Hosseini HR, Jalali A, Salarifar M, Nematipour E, Shojanasab M, et al. Opium consumption and midterm outcome of percutaneous coronary intervention in men. J Tehran Heart Cent. 2014;9(3):115–9.
Mousavi M, Kalhor S, Alizadeh M, Movahed MR. Opium addiction and correlation with early and sixmonth outcomes of presenting with ST elevation myocardial infarction treated initially with thrombolytic therapy. Am J Cardiovasc Dis. 2021;11(1):115–23.
Nakhaee S, Ghasemi S, Karimzadeh K, Zamani N, AlinejadMofrad S, Mehrpour O. The effects of opium on the cardiovascular system: a review of side effects, uses, and potential mechanisms. Subst Abuse Treat Prev Policy. 2020;15(1):30.
Yousefzadeh G, Shokoohi M, Najafipour H, Eslami M, Salehi F. Association between opium use and metabolic syndrome among an urban population in southern Iran: results of the Kerman coronary artery disease risk factor study (KERCADRS). ARYA Atheroscler. 2015;11(1):14–20.
Kobayashi K, Maeda K, Takefuji M, Kikuchi R, Morishita Y, Hirashima M, et al. Dynamics of angiogenesis in ischemic areas of the infarcted heart. Sci Rep. 2017;7(1):7156.
Mahbuba W, Lambert DG. Opioids and neovascularization; pro or anti? Br J Anaesth. 2015;115(6):821–4.
Lechner I, Reindl M, Tiller C, Holzknecht M, Fink P, Plangger J, et al. Association between inflammation and left ventricular thrombus formation following STelevation myocardial infarction. Int J Cardiol. 2022;361:1–6.
Acknowledgements
None declared
Funding
We received no funding regarding this research.
Author information
Authors and Affiliations
Contributions
All the authors read and approved the final version of the manuscript. YJ: Conception, Supervision, Data gathering. BH: Data analysis using conventional statistics, Machine learning and AIbased analysis, Writing: drafting and revising the manuscript. AK: Writing: drafting and revising the manuscript. ST: Conception, Data gathering, Writing: revising the manuscript. SMG: Data gathering. HE: Writing: revising the manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
All the methods in this study have been performed in accordance with the Declaration of Helsinki. All the participants gave consent to participate in the study. The study was approved by the institutional board review at Tehran Heart Center with the ethics code of IR.TUMS.THC.REC.1399.009.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1:
Supplementary Table 1. Baseline characteristics of the study groups before and after matching.
Additional file 2:
Supplementary Table 2. Comparison of the results of two different machine learning models (Random Forest and XGboost) and cox proportional hazards (coxph) based on Uno and Harrell’s Cindex. Resampling method was crossvalidation for all the learners.
Additional file 3:
Supplementary Table 3. Source codes for the application and analyses.
Additional file 4:
Supplementary Figure 1. Absolute standardized mean difference between opium users and controls before and after propensity score matching (PSM).
Additional file 5:
Supplementary Figure 2. Correlation matrix of the independent numerical variables.
Additional file 6:
Supplementary Figure 3. Examples of plotted decision trees. Above decision tree is plotted using random forest and the bottom using extended gradient boosting (XGboost).
Additional file 7:
Supplementary Figure 4. Outofbox (OOB) error rates for MACE per number of tree.
Additional file 8:
Supplementary Figure 5. Kaplan–Meier (KM) curves for the components of oneyear MACE of the patients who underwent primary PCI after STsegment elevation MI separated by opium users and controls.
Additional file 9:
Supplementary Figure 6. Outofbag (OOB) survival plot for individuals, Brier score, and continuous ranked probability score (CRPS) plots. The top left plot illustrates Kaplan Meier (KM) plots for OOB sample of each individual, and also included aggregate KM results in and NelsonAalen estimator in green. Both methods show same survival curve. Top right plot illustrates OOB Brier score to assess accuracy of the predictions over time in quarters of patients. Less Brier score indicates better prediction. Bottom left plot shows CRPS over time, another measure of prediction accuracy. Bottom right plot shows individual subjects’ MACE outcome vs. time.
Additional file 10:
Supplementary Figure 7. Box plot of performance comparison of three algorithms (Forest plot, XGboost, cox proportional hazards) based on Harrell’s Cindex.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Jenab, Y., Hedayat, B., Karimi, A. et al. Effects of opium use on oneyear major adverse cardiovascular events (MACE) in the patients with STsegment elevation MI undergoing primary PCI: a propensity score matched  machine learning based study. BMC Complement Med Ther 23, 16 (2023). https://doi.org/10.1186/s1290602303833z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s1290602303833z
Keywords
 Machine learning
 Major adverse cardiovascular events
 Mortality,Myocardial infarction
 Opium
 Percutaneous coronary intervention