

ORIGINAL ARTICLE 

Year : 2021  Volume
: 10
 Issue : 2  Page : 317334 

Quantitative structure–activity relationship modeling of some naphthoquinone derivatives as inhibitors of pathogenic agent IDO1
Sajjad Jazayeri Farsani, Saeid Asadpour, Abolfazl Semnani, Shima Ghanavati Nasab
Department of Chemistry, Faculty of Sciences, University of Shahrekord, Shahrekord, Iran
Date of Submission  13Oct2020 
Date of Acceptance  03Oct2021 
Date of Web Publication  17Dec2021 
Correspondence Address: Dr. Saeid Asadpour Department of Chemistry, Faculty of Science, Shahrekord University, Shahrekord. Iran
Source of Support: None, Conflict of Interest: None  Check 
DOI: 10.4103/jrptps.JRPTPS_124_20
Quantitative structure–activity relationship (QSAR) was performed to analyze naphthoquinone derivatives as an inhibitor of indoleamine 2,3dioxygenase pathogen via multivariate regression (MLR) and artificial neural network. The best descriptors were picked to construct the QSAR. Two sets of exercises and experiments were also performed using Principal Component Analysis for multiple linear regression (MLR). A quantitative model was then proposed based on these analyses and the activity of the compounds based on multivariate statistical analysis was interpreted. The study finally revealed that although the MLR model can predict the activity of the compounds to some extent, the artificial neural network (ANN) model results indicate that the predictions obtained by the neural network are much better and more efficient than other models. The neural network was also used where three coefficients of correlation were used. The results uncovered that the ANN model is statistically significant and has good stability for data validation for the validation method. Share Descriptive relationships of structure and activity were also examined. Keywords: Artificial neural network (ANN), multiple linear regression (MLR), naphthoquinone derivatives, pathogenic agent, quantitative structure–activity relationship (QSAR)
How to cite this article: Jazayeri Farsani S, Asadpour S, Semnani A, Ghanavati Nasab S. Quantitative structure–activity relationship modeling of some naphthoquinone derivatives as inhibitors of pathogenic agent IDO1. J Rep Pharma Sci 2021;10:31734 
How to cite this URL: Jazayeri Farsani S, Asadpour S, Semnani A, Ghanavati Nasab S. Quantitative structure–activity relationship modeling of some naphthoquinone derivatives as inhibitors of pathogenic agent IDO1. J Rep Pharma Sci [serial online] 2021 [cited 2022 Aug 8];10:31734. Available from: https://www.jrpsjournal.com/text.asp?2021/10/2/317/332778 
Introduction   
The interaction between the immune system and the growth of tumors is complex and active. Although the host immune system has the capacity to detect and identify tumor cells, many malignancies can actively hinder the immune response.^{[1],[2]} This can be inhibited by several mechanisms, such as the recruitment of immunosuppressive cells, the removal of T cells, and the activation of inspection pathways.^{[3]} There is increasing evidence that indoleamine 2,3dioxygenase 1 (IDO1) plays a seminal role in suppressing immunity in the microscopic environment of the tumor. These studies indicate that IDO1 might be a valid therapeutic target in immunotherapy.^{[4],[5],[6]}
Many studies have highlighted the point that IDO1 is suppressed in normal human tissues including the spleen, intestine, and lung.^{[7],[8]} Also, using many types of human tumors, they are constitutively expressed by inflammatory stimuli such as interferonγ (IFNγ) and transforming growth factorβ.^{[9],[10]} Overexpression of IDO1 is abolished by tumors that destroy tryptophan and accumulate large amounts of kynurenine and its downstream metabolites in the tumor microenvironment. Tryptophan deficiency can also be mediated by mTORC1 inhibition^{[11]} and GCN2 activation,^{[12]} affecting Tcell sensitivity and production of kynurenine active pathway metabolites, leading to increased trade differentiation as a result of aryl hydrocarbon receptor (AHR) activation.^{[13]} Overexpression of IDO1 also affects dendritic cells and macrophages, conversion to normal killer cells, and production of reactive oxygen species. It, thus, enables tumor cells to suppress host immune responses.^{[13],[14],[15],[16]} New research on COVID19 has also shown that after entry into cells, coronaviruses activate AHRs by an IDO1independent mechanism, bypassing the IDO1kynurenineAhR pathway.^{[17]}
Attempts to quantitative analysis of the relationship between the structure and activity of compounds provide an understanding of the effect of structure on their activity, which may not be easy when having large amounts of data. In addition, this method makes it possible to make predictions that lead to the synthesis of new compounds similar to the expected amount of activity. The quantitative structure–activity relationship (QSAR) method covers a wide range of chemical measurements and biological tests, statistical methods, and interpretation of results.^{[18],[19],[20]} The QSAR method can be used for any molecular design purpose, including predicting the biological activity and physicochemical properties, better understanding the mechanism of action in a number of chemicals, saving and reducing product costs (drugs, pesticides, and new chemicals), and replacing the use of laboratory animals.^{[21]} QSAR modeling studies have also been performed on different types of IDO1 inhibitors.^{[22]}
Given the abovementioned points, this study aimed to develop QSAR models for the use of naphthoquinone derivatives as inhibitors of the pathogenic factor IDO1 using several statistical approaches, including Principal Component Analysis (PCA), multivariate regression (MLR), and artificial neural network (ANN). The validation method was chosen to evaluate the performance and stability of this model.
Materials and Methods   
A QSAR study consists of three parts are as follows: (1) data related to the activity or feature under study (in this study, IDO1 inhibitors) that should be modeled and predicted, (2) the descriptors on which the model is based, and (3) a mathematical or statistical method used to formulate the model such as MLR and ANN.^{[23],[24],[25]}
Data sources
In the current study, the compounds of naphthoquinone derivatives as inhibitors of the pathogenic factor IDO1 were investigated and the data were retrieved from the publication of Xiangbao Meng et al.^{[26]}[Figure 1] shows the basic structure of the naphthoquinone. Also, [Table 1] presents the values of the percent inhibition of substituted compounds studied from naphthoquinone derivatives.
Molecular descriptors
All molecules were extracted with ChemDraw software, optimized with MM2 molecular force field, and calculated with the DRAGON bundle.
Dragon software calculates a large number of descriptors from which the most effective descriptors need to be selected. In the first step, descriptors with constant and zero values were omitted because they could not show the relationship between structure and activity well. In the second step, the correlation between the descriptors and the dependent variable was established and the descriptors that had low correlation with the dependent variable were eliminated.
In the third stage, since there is a correlation between the descriptors, the descriptors whose correlation coefficient was greater than 0.95 indicate a linear correlation between them because both descriptors have identical information. Therefore, some of these descriptors as well as those that were less correlated with the dependent variable were removed.
Of all the descriptors, six were selected to predict IDO1 pathway inhibitor activity as follows:
Geary autocorrelation of lag 2 weighted by van der Waals volume (GATS2v), R maximal autocorrelation of lag 3/ weighted by Sanderson electronegativity (R3e^{+}), signal 16/weighted by mass (Mor16m), first component symmetry directional WHIM index/weighted by mass (G1m), second component symmetry directional WHIM index/unweighted (G2u), and signal 31/weighted by mass (Mor31m).
Statistical analysis
The purpose of quantitative structure–activity relationship (QSAR) analysis is to predict the biological activities of the compounds by chemical structures using models. In the QSAR analysis, quantitative descriptors and analysis results are used in a mathematical model to depict the chemical structure that describes the relationship between chemical structure and biological activity.
The target molecules were divided into two groups of training and testing using Minitab software. PCA is a highperformance statistical method for summarizing all encrypted information in the structure of compounds
Among the methods of regression analysis, MLR is typically taken as a regressionbased method for QSAR or QSPR analysis. Each variable is added to the equation before another and regression is performed. The new expression remains if an experiment confirms the significance of the equation. This regression method is useful when a large number of variables and key descriptors are unknown.^{[27]} In addition, it selects descriptors that are used as input parameters to the ANN.
MLR is generated using Excel software. Various parameters are used to evaluate the model such as correlation coefficient (R), mean square error (RMSE), and crosscorrelation correlation coefficient. ANN analysis is performed via the Matlab Toolkit in the Components Database. A number of unique ANN models were designed, manufactured, and trained. Regarding the structure of a neural network, it entails three underlying elements of the processing elements or nodes, the topology of the connections between the nodes, and the learning rule by which new information is encoded in the network. Among the different models of ANN, the forward feed distribution network was decided to be used in this study. In this type of network, neurons are set as the input layer, a hidden layer, and an output layer. Each neuron in each layer is entirely related to the neurons of a single layer, and there is no correlation among the neurons belonging to one layer.
Results and Discussion   
Dataset for analysis
A QSAR study was performed for 57 derivatives of naphthoquinone as an inhibitor IDO1 to determine a quantitative relationship between the structure and inhibitory activity.^{[26]}[Table 2] represents the values of the six descriptors.  Table 2: Values of the parameters of the derivatives of naphthoquinone investigated
Click here to view 
Correlation analysis was performed to identify the relationship between different variables. [Table 3] presents the correlation coefficients matrix for the relevance between the six descriptors selected.{Table 3}
The obtained matrix provides information on the degree of correlation between the variables. In general, the results indicate a low correlation (r < 0.5) between most variables. Although a high interrelationship was observed between GATS2v and Mor31m (r = −0.5032), a low interrelationship was observed between R3e+ and G2u (r = 0.03975).
MLR model results
Initially, six final descriptors coding 57 molecules (as mentioned above) were sent to the PCA for classification of compounds into train and test sets to validate the MLR model. In total, 45 molecules were included in the train set to construct QSAR models, whereas the remaining 12 molecules constituted the test set. Sections were randomly selected using PCA. [Figure 2] shows the results of data classification by the PCA analysis.  Figure 2: PCA analysis and score plots of the analyzed aaphthoquinone derivatives
Click here to view 
Many attempts have been made to establish an acceptable relationship between the molecular descriptors and the values of Inhibition%. However, the best relationship ultimately obtained with this method was the one related to the linear combination of the selected several descriptors, GATS2v، R3e+، Mor16m، G1m، G2u و Mor31m.
The resulting equation is as follows:
N_{train} = 45; N_{test} = 12; R^{2} = 0.569; RMSE = 13.827; Q^{2} = 0.467.
In this equation, N is the number of compounds, R^{2} is the squared correlation coefficient, RMSE is the root RMSE, Q^{2} is the crosscorrelation coefficient evaluation. A higher correlation coefficient and lower root RMSE indicate that the model is more reliable. The QSAR model expressed by is crossvalidated by its appreciable R^{2} values (R^{2} = 0.569), Q^{2} values (Q^{2} = 0.467).
The developed QSAR model reveals that inhibitors of Pathogenic agent IDO1 might be explained by a number of 3DMoRSE and 2D autocorrelations and GETAWAY and WHIM factors. Whereas the negative correlation between the WHIM descriptor (G2u) and the 3DMoRSE descriptor (Mor31m) with inhibition activity indicates that the increase in these values represents a devaluation of inhibition, a positive correlation between the2D autocorrelations (GATS2v) and GETAWAY descriptor (R3e^{+)} and 3DMoRSE descriptor (Mor16m) and WHIM descriptor(G1m) with inhibition activity indicates an increase in the inhibition value. Based on Equation (1), the mechanism of Pathogenic agent IDO1 activity for the derivatives of naphthoquinone is as follows:
 (1) The inhibitors of pathogenic agent IDO1 activity of the naphthoquinone derivatives decrease with the increase of G2u, Mor31m. Thus, these descriptors are against the Inhibitory activity of the naphthoquinone derivatives.
 (2) The inhibitors of pathogenic agent IDO1 activity of the naphthoquinone derivatives increase with the increase of GATS2v, R3e +, Mor16m, and G1m. Thus, the descriptor is directly related to the Inhibitor activity of the derivatives of the naphthoquinone.
[Figure 3] shows the correlations of the predicted and observed activities. The descriptors proposed in by MLR are, therefore, used as the input parameters in ANN. [Table 4] presents the predicted values of the inhibition percent of the train set and the test set using the MLR equation.  Figure 3: Graphical representation of calculated and observed activity by MLR
Click here to view  {Table 4}
ANN model results
Among the types of models available, ANNs can produce predictive models of QSAR between the molecular descriptors derived from the MLR model and the activity observed from the compounds. The predicted activity of the compounds was prepared via the ANN model using the properties of several compounds studied. [Figure 4] shows the degree of compliance with the anticipated and observed activities.  Figure 4: Graphical representation of calculated and observed activity by ANN
Click here to view 
The squared correlation coefficient (R^{2}) obtained from the neural network model for this set of naphthoquinone derivatives was calculated as 0.983. Given the acceptable value of this coefficient for the model, it is confirmed that ANN is a superior method for constructing quantitative structure–activity relevance model to predict the desired activity in the compounds mentioned.
In addition, the high value of this coefficient (R^{2} = 0.983) confirms that the obtained QSAR model can well predict inhibitory activity against pathogenic agent IDO1 for other similar compounds. [Table 5] represents the predicted values of inhibition percent for the training set, validation set, and test set using the ANN model.{Table 5}
To assess the predictive power of MLR and ANN models for case activity, we need to use a set of compounds that are different from the training set to create the QSAR model and are not used in model construction. The model MLR established in the computation procedure using the 45 thiazolidinedione derivatives are used to predict the activity of the remaining 12 compounds and model ANN using the 39 thiazolidinedione derivatives are used to predict the activity of the remaining 18 compounds (test = 9 and validation = 9).
The results of regression residual investigation displayed that the error dispensation of these models are unsystematic with normal distribution and have homogenous variance. The agreement observed between the predicted experimental values in [Figures 3] and [4] and the random distribution of residuals about zero mean in [Figure 5] confirms the high predictive capability of MLR and ANN modeling. Additionally, as can be seen from the figure, the distribution of percent residuals shows that the ANN approach results in fewer prediction errors across the entire data.  Figure 5: Residual versus experimental values in MLR (a) and ANN (b) models
Click here to view 
[Table 6] shows the main performance parameters of the two models. As expected, all statistical parameters for the neural network model are better than those for the MLR model.{Table 6}
In sum, we evaluated the best linear QSAR regression equations specified in this study. As expected, based on the results obtained, the quality evaluation of the MLR model shows that the ANN model is significantly more predictive than the MLR model because the results of ANN method are better than those of MLR model. Therefore, ANN establishes an acceptable relationship between the types of molecular descriptors and the activity of the compounds studied.
Conclusion   
In this study, two methods of MLR and ANN were used to predict the inhibitor of pathogenicity of IDO1 with naphthoquinone derivatives. Six types of descriptors were picked to construct the MLR model and the neural network for naphthoquinone derivatives, which include GATS2v, R3e +, Mor16m, G1m, G2u, and Mor31m. The results of the models showed that the model of the neural network is better than the MLR model. To compare the accuracy and prediction of proposed models, key statistical indicators, such as R^{2} and RMSE were presented in different models using different statistical tools and descriptors. The results of comparing the two models showed that the ANN model was significantly superior to the MLR model. The slope of the predicted line equations was close to that of the experimental value, which was closer to one and expresses the correct prediction for the naphthoquinone derivatives, with a high R^{2} value and a low RMSE. Finally, we conclude that descriptor studies using the neural network have a much better ability to predict the inhibitor of the pathogenic IDO1 agent.
Financial support and sponsorship
Nil.
Conflicts of interest
The authors declare that they have no known competing financial interests or personal relationships that might influence the work reported in this study. Also, there are no conflicts of interest.
References   
1.  Li H, Chiappinelli KB, Guzzetta AA, Easwaran H, Yen RW, Vatapalli R, et al. Immune regulation by low doses of the DNA methyltransferase inhibitor 5azacitidine in common human epithelial cancers. Oncotarget 2014;5:58798. 
2.  Vinay DS, Ryan EP, Pawelec G, Talib WH, Stagg J, Elkord E, et al. Immune evasion in cancer: Mechanistic basis and therapeutic strategies. Semin Cancer Biol 2015;35 Suppl:18598. 
3.  Munn DH, Mellor AL. IDO in the tumor microenvironment: Inflammation, counterregulation, and tolerance. Trends Immunol 2016;37:193207. 
4.  Prendergast GC. Immune escape as a fundamental trait of cancer: Focus on IDO. Oncogene 2008;27:3889900. 
5.  Katz JB, Muller AJ, Prendergast GC. Indoleamine 2,3dioxygenase in Tcell tolerance and tumoral immune escape. Immunol Rev 2008;222:20621. 
6.  Zamanakou M, Germenis AE, Karanikas V. Tumor immune escape mediated by indoleamine 2,3dioxygenase. Immunol Lett 2007;111:6975. 
7.  Platten M, Wick W, Van den Eynde BJ. Tryptophan catabolism in cancer: Beyond IDO and tryptophan depletion. Cancer Res 2012;72:543540. 
8.  Théate I, van Baren N, Pilotte L, Moulin P, Larrieu P, Renauld JC, et al. Extensive profiling of the expression of the indoleamine 2,3dioxygenase 1 protein in normal and tumoral human tissues. Cancer Immunol Res 2015;3:16172. 
9.  Pallotta MT, Orabona C, Volpi C, Vacca C, Belladonna ML, Bianchi R, et al. Indoleamine 2,3dioxygenase is a signaling protein in longterm tolerance by dendritic cells. Nat Immunol 2011;12:8708. 
10.  Taylor MW, Feng GS. Relationship between interferongamma, indoleamine 2,3dioxygenase, and tryptophan catabolism. Faseb J 1991;5:251622. 
11.  Metz R, Rust S, Duhadaway JB, Mautino MR, Munn DH, Vahanian NN, et al. IDO inhibits a tryptophan sufficiency signal that stimulates mTOR: A novel IDO effector pathway targeted by D1methyltryptophan. Oncoimmunology 2012;1:14608. 
12.  Munn DH, Sharma MD, Baban B, Harding HP, Zhang Y, Ron D, et al. GCN2 kinase in T cells mediates proliferative arrest and energy induction in response to indoleamine 2,3dioxygenase. Immunity 2005;22:63342. 
13.  Mezrich JD, Fechner JH, Zhang X, Johnson BP, Burlingham WJ, Bradfield CA. An interaction between kynurenine and the aryl hydrocarbon receptor can generate regulatory T cells. J Immunol 2010;185:31908. 
14.  Jinushi T, Shibayama Y, Kinoshita I, Oizumi S, Jinushi M, Aota T, et al. Low expression levels of microRNA1245p correlated with poor prognosis in colorectal cancer via targeting of SMC4. Cancer Med 2014;3:154452. 
15.  Shirey KA, Jung JY, Maeder GS, Carlin JM. Upregulation of IFNgamma receptor expression by proinflammatory cytokines influences Ido activation in epithelial cells. J Interferon Cytokine Res 2006;26:5362. 
16.  Song H, Park H, Kim YS, Kim KD, Lee HK, Cho DH, et al. Lkynurenineinduced apoptosis in human NK cells is mediated by reactive oxygen species. Int Immunopharmacol 2011;11:9328. 
17.  Turski WA, Wnorowski A, Turski GN, Turski CA, Turski L. Ahr and IDO1 in pathogenesis of COVID19 and the “systemic ahr activation syndrome:” a translational review and therapeutic perspectives. Restor Neurol Neurosci 2020;38:34354. 
18.  Masoomi Sefiddashti F, Haddadi H, Asadpour S, Ghanavati Nasab S. Prediction of IC50 Values of 2benzyloxy benzamide Derivatives using Multiple Linear Regression and Artificial Neural Network Methods. Iran J Math Chem 2020;11:17999. 
19.  Norouzian MA, Asadpour S. Prediction of feed abrasive value by artificial neural networks and multiple linear regression. Neural Comput Appl 2012;21:9059. 
20.  Asadpour S, Chamsaz M, Haron MJ. Application of MLR, PLS and artificial neural networks for prediction of GC/ECD retention times of chlorinated pesticides, herbicides, and organohalides. Res J Pharm Biol Chem Sci2012;3;21:85060. 
21.  Gramatica P, Consonni V, Todeschini R. QSAR study on the tropospheric degradation of organic compounds. Chemosphere 1999;38:13718. Available from: https://www.sciencedirect.com/science/article/pii/S0045653598005396 
22.  Zhang L, Lai F, Chen X, Xiao Z. Identification of potential indoleamine 2, 3dioxygenase 1 (IDO1) inhibitors by an FBGbased 3D QSAR pharmacophore model. J Mol Graph Model 2020;99:107628. Available from: https://www.sciencedirect.com/science/article/pii/S1093326320302497 
23.  Javidfar M, Ahmadi S. QSAR modelling of larvicidal phytocompounds against Aedes aegypti using index of ideality of correlation. SAR QSAR Environ Res 2020;31:71739. 
24.  Ahmadi S, Ghanbari H, Lotfi S, Azimi N. Predictive QSAR Modeling for the antioxidant activity of natural compounds derivatives based on Monte Carlo method. Mol Divers 2021;25:8797. 
25.  Ghiasi T, Ahmadi S, Ahmadi E, Talei Bavil Olyai MR, Khodadadi Z. The index of ideality of correlation: QSAR studies of hepatitis C virus NS3/4A protease inhibitors using SMILES descriptors. SAR QSAR Environ Res 2021;32:495520. 
26.  Pan L, Zheng Q, Chen Y, Yang R, Yang Y, Li Z, et al. Design, synthesis and biological evaluation of novel naphthoquinone derivatives as IDO1 inhibitors. Eur J Med Chem 2018;157:42336. 
27.  HabibiYangjeh A, DanandehJenagharad M, Nooshyar M. Application of artificial neural networks for predicting the aqueous acidity of various phenols using QSAR. J Mol Model 2006;12:33847. 
[Figure 1], [Figure 2], [Figure 3], [Figure 4], [Figure 5]
[Table 1], [Table 2]
