2D QSAR and Atom based 3D QSAR study of Tropomyosin Receptor Kinases inhibition by pyrazol derivatives

 

Arpan Adhikary, Ronak Nair, Lakshya Moukthika, Ruchi Verma*

Department of Pharmaceutical Chemistry, Manipal College of Pharmaceutical Sciences,

Manipal Academy of Higher Education, Manipal-576104, Karnataka, India.

*Corresponding Author E-mail: ruchiverma.pharma@gmail.com

 

ABSTRACT:

Quantitative Structure Activity Relationship (QSAR) studies are tools mostly used in many research areas, including drug discovery process. The tropomyosin receptor kinase (TRK) family are emerging as an important target for cancer therapeutics. The atom based 3D QSAR model and 2D QSAR model were designed and suitable models were generated useful for predicting the tetrahydropyrrolo[3,4-c]pyrazol derivatives prior to their synthesis, developed for predicting the anti-cancer activity against TRKs . The given study indicates the credibility of derived QSAR model by the determination of suitable statistical parameters as we have observed high relationship between experimental and predicted activity values showing ligand molecule larotrectinib with various possibilities of structural modifications to develop potential molecules with significant TRKs inhibitory activity and also predict the activity of any unknown derivative. The data reported by the above QSAR models provides necessary directions for the designing of new TRKs inhibitors against cancer.

 

KEYWORDS: Phosphatidylinositol-3 kinase, pharmacophore, TRKs, descriptors, QSAR.

 

 


1. INTRODUCTION: 

Our bodies are made up of countless number of cells, grouped to form tissues and it is our genes which direct our cells to show growth, division and ultimately causing cell death and thus, our body function normally. However, when there is a change in our DNA, a gene can mutate1. Mutated genes don’t work properly because the instructions in their DNA are scrambled which can cause the cells that should be resting, to divide and grow out of control which can lead to cancer.This failure of apoptosis (programmed cell death) has the potential to invade or spread to other parts of the body2. In this study, we focus on TRKs (Tropomyosin Receptor Kinases) subtype A, primarily a trans-membrane receptor protein (PDB ID: 5KVT) that regulates cell differentiation, proliferation and survival.  The family of TRKs are advancing as major target for cancer therapeutics 3. TRK stimulation leads to activation of a number of intracellular signaling cascades including, the RAS and the phosphatidylinositol-3 kinase (PI-3K) cascades which are cancer causing carcinogenic factor.

 

Larotrectinib is the expected molecule that lines up with the features of Tetrahydropyrolo derivatives in docking study and QSAR analysis is done to forecast their activities4.

 

Quantitative Structure Activity Relationship (QSAR) models are models5 that can be used to predict the physicochemical, biological and environmental properties of compounds from the knowledge of their structure6. QSAR quantifies the relationship between structure and activity on the basis of their physiochemical property7. In a QSAR we use descriptors that are eventually used, to convert chemical structures into mathematical variables; statistical methods to derive the relationships between the observations and the descriptors and the quality of the observed data8. In this article, we’ve build 3D and 2D QSAR models using TRKs inhibitors data for predicting the activity.

 

2.    METHODOLOGY:

2.1 Computational details:

PHASE 4.3 of Maestro 11.4 version software package of Schrodinger Inc., was used to generate 2D and 3D-QSAR models for tetrahydropyrrolo[3,4-c]pyrazol derivatives as inhibitors of TRKs. A dataset consisting of 42 derivatives were chosen from the mentioned Literature10 and were used in this study, consisting of minimal concentration of the compounds (IC50) which were then converted to PIC50 by applying the respective formula (PIC50= Log [1/IC50]) which are represented in. The structures were drawn with the help of 2D sketcher option in Schrodinger software.

 

2.2 Ligand preparation and molecular alignment:

The selected compounds were prepared by using LigPrep tool with force field- OPLS_2005 assigned at pH state 7.4  and then ligand alignment was carried out with respective alignment tool 10 provided by the maestro software. The structures were aligned in such a way that they superimpose one other which helps in observing variations in them. It is a key step in order to make a precise and accurate 3D QSAR model 11.

 

2.3 Datasets:

From the total 42 molecules, 26 of these were selected for training set by keeping its division 75% with PLS factor as five and 7 were selected as test set based on the rule that the activity of the training dataset should range from the most potent to the least potent one 12. PHASE provides a set of built-in pharmacophoric features such as hydrophobic group, negatively ionizable, positively ionizable, Electron withdrawing and hydrogen bond donor presented in-. The model was prepared using these features to generate sites for all the compounds which can be used to create partial least-squares (PLS) factors of 3D-QSAR model 13.

 

2.3 PLS (Partial Least Square) analysis and validation:

The predictive value of the models was evaluated by leave one- out (LOO) to generate cross validated-  r2cv value(regression), SD(Standard deviation), F( variance ratio), P(significance level of F), RMSE (root mean square error),Q2value (predicted activity) and Pearson-R (relation between the predicted and observed activity for the test set) 14. 3D-QSAR models are accepted if they fulfill the conditions - r2cv>0.5 and Q2> 0.6

 

2.4 External validation:

A Q2 is mostly useful but not satisfactory factor for the validation of model. In several cases it was observed that model with good values of r2cv and r2 were found out to be disappointing 15. Although a model gives good prediction on the basis of test set data, it’s still not assured that it will always predict fine new data set of compounds. Thus, an external test with r2pred (predictive r2) is suggested for prediction.


 

Table 1: Molecules with structures for QSAR study with their measured biological activity, predicted activity and dataset.

Compound no.

Structures

IC50 value (µM)

PIC50 value (µM)

Datasets

17a

 

 

1.24

 

5.9066

 

Training

17b

 

 

1.10

 

 

5.9586

 

Training

17c

 

 

1.12

 

5.9508

 

Training

17d

 

 

0.98

 

6.0088

 

Training

17e

 

 

1.36

 

5.8665

 

Training

17f

 

 

0.94

 

6.0269

 

Test

17g

 

 

0.67

 

6.1739

 

        _

17h

 

 

0.47

 

6.3279

 

Training

17i

 

 

0.60

 

6.2218

 

Training

17j

 

 

0.45

 

6.3468

 

Test

17k

 

 

1.18

 

5.9281

 

 _

17l

 

 

0.19

 

6.7212

 

Training

17m

 

 

0.20

 

6.6990

 

_

17n

 

 

0.48

 

6.3188

 

Test

17o

 

 

0.68

 

6.1675

 

_

17p

 

 

2.32

 

6.6345

 

_

17q

 

 

0.52

 

6.2840

 

Training

17r

 

 

0.94

 

6.0269

 

Test

17s

 

 

0.88

 

6.0555

 

Training

17t

 

 

 

16.36

 

 

4.7862

 

 

_

17u

 

 

1.00

 

6.0000

 

Training

17v

 

 

6.96

 

5.1574

 

_

19a

 

 

 

0.46

 

 

6.3372

 

 

Training

19b

 

 

 

 

0.44

 

 

 

6.3565

 

 

 

Training

19c

 

 

 

0.18

 

 

6.7447

 

 

Training

19d

 

 

 

0.21

 

 

6.6778

 

 

-

19e

 

 

 

 

0.35

 

 

 

6.4559

 

 

 

Training

19f

 

 

 

0.95

 

 

6.0223

 

 

Training

19g

 

 

 

0.71

 

 

6.1487

 

 

Training

19h

 

 

 

0.029

 

 

7.5376

 

 

Training

19i

 

 

 

0.014

 

 

7.8539

 

 

Training

19j

 

 

 

0.069

 

 

7.1612

 

 

Training

19k

 

 

 

0.096

 

 

7.0177

 

 

Test

19l

 

 

 

0.017

 

 

7.7696

 

 

Training

19m

 

 

 

 

0.017

 

 

 

7.7696

 

 

 

Training

19n

 

 

 

0.095

 

 

7.0223

 

 

_

19o

 

 

0.27

 

6.5686

 

Training

19p

 

 

0.12

 

6.9208

 

Training

19q

 

 

0.42

 

6.3768

 

Test

19r

 

 

0.27

 

6.5686

 

Test

19s

 

 

0.21

 

6.6778

 

Training

19t

 

 

0.15

 

6.8239

 

Training

Larotrectinib

 

 

0.0034

 

8.4685

 

Training

 


3. RESULTS AND DISCUSSION:

Atom Based 3D QSAR analysis:

The atom-based 3D QSAR model was developed on the basis of their biological activity and their relation to its structural attributes. The graphical representation of the relation between activity and predicted activity of test and training set is shown in. While creating this model we employed a dataset of 42 molecules for which ligand preparation was performed with OPLS_2005 force field at pH 7.4. Then 75% division was kept for the training set with OPLS factor as five 16. Scatter plot was constantly checked for best fit line molecules and those molecules which had shown deviation were left out, by which a  test and a training set was built and after completion of model development, the left out molecules were then inserted into the model for predicting its activity. The scatter plot for both test and training set are shown in Figure . The above model was selected due to its high Q2 (0.8296) and appropriate r2 (0.9797).The atom based QSAR statistics for the selected model with 5 PLS factors are shown at.Due to its high cross validation coefficients this model tends to be precise and efficient.

 

 

 

3D QSAR visualization:

The 3D QSAR study was visualised in fields of Hydrophobic interactions, Negative Ionic interactions, Positive ionic interactions, electron withdrawing) and other factors such that the dark blue(good activity) and red colour(bad activity) regions indicates the positive and negative thresholds on the activity. The visualisations of the QSAR model are shown in 2 17. This visualisation gives us the insight about the structural properties present in the ligand molecule and their relation to its biological activity with also the effect of functional groups substitution in the molecule.


 

Table 2. Atom based (QSAR) statistics with five best common PLS factors

Factors

SD

r2

 r2cv

r2pred

F

P

RMSE

Q2

Pearson-R

1

0.2926

0.8357

0.6698

0.2359

127.2

2.68e-11

0.16

0.7512

0.9259

2

0.1989

0.9271

0.7898

0.4190

152.7

2.24e-14

0.15

0.7886

0.9191

3

0.1592

0.9552

0.8192

0.5747

163.6

1.18e-15

0.16

0.7506

0.8870

4

0.1392

0.9673

0.8163

0.6536

162.6

5.35e-16

0.15

0.7850

0.9029

5

0.1122

0.9797

0.8090

0.7091

202.9

4.94e-17

0.13

0.8296

0.9160

 


 

Dataset plot for experimental

 

Test dataset plot


 

 

Training dataset plot  data versus predicted data

Figure 1: Scatter plot for Atom Based 3D QSAR showing relation between activity observed and the predicted activity

Table 3: QSAR results of pharmacophoric features

Factors

H-bond donor

Hydrophobic interaction

Negative ionic

Positive ionic

Electron-withdrawing

1

0.032

0.718

0.003

0.023

0.224

2

0.028

0.715

0.003

0.033

0.221

3

0.031

0.682

0.007

0.041

0.239

4

0.028

0.678

0.010

0.039

0.245

5

0.028

0.723

0.008

0.038

0.203


3.1 Hydrophobic/ non-polar visualization:

 

 

(19i)

3.2 Electron withdrawing interaction visualization:

 

 

(19i)

 

 

(Larotrectinib)

 

 

(17e)

3. 3 Hydrogen bond donor interaction:

 

(Larotrectinib)

 

 

(17e)

Figure 2. Atom Based 3D QSAR visualization images

 

2D QSAR analysis and visualization:

Here, we used 2D Canvas which provides seven fingerprint models which are as follows: atom pair, atom triplet, dendritic, molprint, radial, linear and topological models 18. It is associated with Kernel based PLS (KPLS) in order to generate a precise QSAR model that gives information about the overall favorable and unfavorable attributes of the given dataset. The same set of test and training used in atom-based 3D QSAR was utilized here presented in.

 

Among all models, the linear model was found out to be most suitable which helps in predicting the activity more precisely with the training validating set- r2 and standard deviation (SD) values as 0.9858 and 0.0919 respectively and test validating set Q2 and RMSE values as 0.9087 and 0.09906 respectively with 5 KPLS factors. The statistical data of the Fingerprint based 2D QSAR model are presented at 19. The 2D QSAR fingerprint visualization of the linear model can be seen in. It is shown that the red and blue color regions represent the positive and negative activity effects. The scatter plots for 2D QSAR is shown in .

 

Table 4: Data statistics of the 2D QSAR model

 

Training

Test

KPLS factors

SD

r2

RMSE

Q2

1

0.3913

0.6902

0.1596

0.7629

2

0.2201

0.9061

0.07249

0.9427

3

0.1665

0.9486

0.1143

0.8784

4

0.1207

0.9742

0.1061

0.8952

5

0.0919

0.9858

0.09906

0.9087

 

 

Scatter plot for both training and test set


   

Training set plot                                                                                          Test set plot

Figure 3. Scatter plots for Fingerprint Based 2D QSAR

 

       

(Larotrectinib)                                                                               (19i)

Figure 4: Linear fingerprint 2D QSAR model visualization

 


4 CONCLUSION:

The atom based 3D QSAR model and 2D QSAR model were designed and suitable models were generated useful for predicting the tetrahydropyrrolo[3,4-c]pyrazol derivatives prior to their synthesis, developed for predicting the anti-cancer activityagainst TRKs . The given study indicates the credibility of derived QSAR model by the determination of suitable statistical parameters as we have observed high relationship between experimental and predicted activity values showing ligand molecule larotrectinib with various possibilities of structural modifications to develop potential molecules with significant TRKs inhibitory activity and also predict the activity of any unknown derivative. The data reported by the above QSAR models provides necessary directions for the designing of new TRKs inhibitors against cancer.

 

5. CONFLICT OF INTEREST STATEMENT:

The authors declared no conflict of interest.

 

6. ACKNOWLEDGEMENTS:

Authors would like to acknowledge Manipal College of Pharmaceutical Sciences for the facilities to carry out the research work.

 

 

7. REFERENCES

1.      Zhao M, Wang L, Zheng L, Zhang M, Qiu C, Zhang Y, Donshu D, Bing. 2D-QSAR and 3D-QSAR Analyses for EGFR Inhibitors. Biomed Res Int. 2017:4649191 https://doi.org/10.1155/2017/4649191.

2.      Verma R, Bairy I, Tiwari M, Bhat GV, Shenoy GG. In silico studies, synthesis and anticancer activity of novel diphenyl ether-based pyridine derivatives. Mol Divers. 2019;23(3):541–554: https://doi.org/10.1007/s11030-018-9889-1

3.      N. Noolvi M, M. Patel H, Bhardwaj V. A Comparative QSAR Analysis of Quinazoline Analogues as Tyrosine Kinase (erbB-2) Inhibitors. Med Chem. 2011;7(3):200–212. https://doi.org/10.2174/157340611795564213

4.      Goudzal A, Hadaji E, Bouachrine M, El Hamdani H, Ouammou A. QSAR and docking molecular models to predict anti-cancer activity on a series of azacalix [2] arene [2] pyrimidine derivatives as anticancer agents. Mater Today Proc. 2020;51(2).https://doi.org/10.1016/j.matpr.2020.08.002

5.      Ul-Haq Z, Effendi JS, Ashraf S, Bkhaitan MM. Atom and receptor based 3D QSAR models for generating new conformations from pyrazolopyrimidine as IL-2 inducible tyrosine kinase inhibitors. J Mol Graph Model. 2017;74:379–395. https://doi.org/10.1016/j.jmgm.2017.03.016

6.      Pourbasheer E, Amanlou M. 3D-QSAR analysis of anti-cancer agents by CoMFA and CoMSIA. Med Chem Res. 2014;23(2):800–809. https://doi.org/10.1007/s00044-013-0676-3

7.      Kasmi R, Hadaji E, Chedadi O, El Aissouq A, Bouachrine M, Ouammou A. 2D-QSAR and docking study of a series of coumarin derivatives as inhibitors of CDK (anticancer activity) with an application of the molecular docking method. Heliyon. 2020;6(8):e04514.  https://doi.org/10.1016/j.heliyon.2020.e04514

8.      Singh KD, Karthikeyan M, Kirubakaran P, Nagamani S. Pharmacophore filtering and 3D-QSAR in the discovery of new JAK2 inhibitors. J Mol Graph Model [Internet]. 2011;30:186–197. http://dx.doi.org/10.1016/j.jmgm.2011.07.004

9.      Wu T, Zhang C, Lv R, Qin Q, Liu N, Yin W, et al. Design, synthesis, biological evaluation and pharmacophore model analysis of novel tetrahydropyrrolo[3,4-c]pyrazol derivatives as potential TRKs inhibitors. Eur J Med Chem. 2021;223:113627. https://doi.org/10.1016/j.ejmech.2021.113627

10.   Ma J, Zhang H, Zhang X, Lei M. 3D-QSAR studies of D3R antagonists and 5-HT1AR agonists. J Mol Graph Model [Internet]. 2019;86:132–141. https://doi.org/10.1016/j.jmgm.2018.10.013

11.   Wang M, Li W, Wang Y, Song Y, Wang J, Cheng M. In silico insight into voltage-gated sodium channel 1.7 inhibition for anti-pain drug discovery. J Mol Graph Model. 2018;84:18–28.  https://doi.org/10.1016/j.jmgm.2018.05.006

12.   Sadeghian-Rizi S, Sakhteman A, Hassanzadeh F. A quantitative structure-activity relationship (QSAR) study of some diaryl urea derivatives of B-RAF inhibitors. Res Pharm Sci. 2016;11(6):445–453. https://doi.org/10.4103/1735-5362.194869

13.   Dermeche K, Tchouar N, Belaidi S, Salah T. Qualitative Structure-activity relationships and 2D-QSAR modeling of TNF-α inhibition by thalidomide derivatives. J Bionanoscience. 2015;9(5):395–400. http://doi.org/10.1166/jbns.2015.1320

14.   Kristam R, Parmar V, Viswanadhan VN. 3D-QSAR analysis of TRPV1 inhibitors reveals a pharmacophore applicable to diverse scaffolds and clinical candidates. J Mol Graph Model. 2013;45:157–172. http://dx.doi.org/10.1016/j.jmgm.2013.08.014

15.   Zhao Y, Gu W, Li Y. Molecular design of 1,3,5,7-TetraCN derivatives with reduced bioconcentration using 3D-QSAR modeling, full factorial design, and molecular docking. J Mol Graph Model. 2018;84:197–214. http://doi: 10.1016/j.jmgm.2018.07.006

16.   Jiménez Villalobos TP, Gaitán Ibarra R, Montalvo Acosta JJ. 2D, 3D-QSAR and molecular docking of 4(1H)-quinolones analogues with antimalarial activities. J Mol Graph Model [Internet]. 2013;46:105–124. http://dx.doi.org/10.1016/j.jmgm.2013.10.002

17.   Poojita K, Fathima F, Ray R, Kumar L, Verma R. 3D Atom based QSAR model of DprE1 inhibitors as Anti-tubercular Agents. Res J Pharm Technol. 2021;14(11):5903–10. https://doi.org/10.52711/0974-360X.2021.01026

18.   Amata E, Marrazzo A, Dichiara M, Modica MN, Salerno L, Prezzavento O, et al. Comprehensive data on a 2D-QSAR model for Heme Oxygenase isoform 1 inhibitors. Data Br. 2017;15:281–299. https://doi.org10.1016/j.dib.2017.09.036

19.   Poojita K, Fathima F, Ray R, Kumar L, Verma R. Atom based 3D QSAR and Fingerprint based 2D QSAR of Novel Molecules as MmpL3 receptor inhibitors for Mycobacterium tuberculosis. Res J Pharm Technol. 2021;14(12):6321–6329. https://doi.org/10.52711/0974-360X.2021.01093

 

 

 

 

 

Received on 09.09.2017            Modified on 18.04.2022

Accepted on 05.08.2023           © RJPT All right reserved

Research J. Pharm. and Tech 2023; 16(10):4681-4690.

DOI: 10.52711/0974-360X.2023.00761