112.

Barbosa E. G., Martins J. P. A., Pasqualoto K. F. M., Ferreira M. M. C., "LQTAgrid: an open source package to generate 4D-QSAR descriptors". Porto de Galinhas, PE, Brazil, 09-13/11/2008: The 4th Brazilian Symposium on Medicinal Chemistry (BrazMedChem2008): Systems Chemical Biology, CD-ROM Online, (2008) No. 171.  Poster R0348-1.


Brazilian Chemical Society (SBQ). Division of Medicinal Chemistry. 4th Brazilian Symposium on Medicinal Chemistry

LQTAgrid: an open source package to generate 4D-QSAR descriptors

Barbosa, E.G.; Martins, J.P.A.; Pasqualoto, K. F. M.; Ferreira, M. M. C. - jpmartins@iqm.unicamp.br

Laboratório de Quimiometria Teórica e Aplicada - LQTA, Instituto de Química, UNICAMP, Campinas-SP, Brasil

Keywords: Molecular Dynamics, 4D-QSAR, Gromacs, LQTAgrid.
 

Introduction

3D-QSAR formalisms, such as comparative molecular field analysis (CoMFA),1 use a set of compounds to generate 3D descriptors for building partial least squares (PLS) models, and provide relevant information for developing ligand-based drug design. Hopfinger and co-workers2 reported an independent-receptor (IR) methodology where multiple conformations of each ligand obtained from molecular dynamics (MD) simulations are considered in the construction of IR 3D-QSAR models. Aiming to combine the advantages of both methods, CoMFA and IR 4D-QSAR, an open source package of programs was developed, named LQTAgrid.
Initially, the open source program GROMACS3 is employed to create a conformational profile (CP) of each ligand in the training set, from MD simulations having explicit solvent. Then, the CP of the ligands are aligned in a 3D virtual box or grid and the van der Waals and electrostatic energy contributions are calculated, using probes, to generate the 4D-QSAR descriptors matrix (LQTAgrid program). The construction of multivariate QSAR models can be performed according to the user's software preferences.
 

Results and Discussion

To validate the methodology proposed, the following two sets considering distinct classes were chosen: 44 inhibitors of p38 kinase4 (set 1) and 47 glucose analogue inhibitors of glycogen phosphorylase5 (set 2).
A previous variable selection was carried out, using the Pirouette package and the OPS algorithm6, which was developed in our research group. Reasonable QSAR models employing PLS and leave-one-out crossvalidation were obtained. The best QSAR model generated with set 1 presented the following statistical parameters values: q2 = 0.70; r2 = 0.83; and, standard error of calibration (SEC) of 0.26 and standard error of validation (SEV) of 0.30, with 3 latent variables (LV), which were statistically more significant than the values reported in ref. 4 [q2 = 0.55; r2 = 0.91; SEC = 0.19]. The values of q2 and r2 reported in ref. 4 are indicative of overffiting. Regarding set 2, the best QSAR model presented the values of statistical measures comparable to those from the original paper. LQTAgrid (q2 = 0.76; r2 = 0.80); ref. 5 (q2 = 0.83 and r2 = 0.87), SEV was 0.63 using 5 LV. Those QSAR models were validated applying Y-randomization and leave-N-out (N = 1 to 10) methodologies.
The descriptors selected in the best QSAR models can be graphically visualized (hot spots) in Figure 1. Favorable and unfavorable energy contributions (electrostatic and van der Waals) to the biological activity are defined based on the sign of the PLS regression coefficients. Those contributions correspond to possible ligand-receptor interactions, as well as favorable ligand occupations (places for adding functional groups that would increase the biological activity, for example). Figure 1 shows the graphical visualization of the 4D descriptors selected in the best QSAR model for a ligand from set 2.

Figure 1. Descriptors graphical representation considering the best QSAR model (set 2)
 

Conclusions

The methodology presented generates 4D descriptors, which after a variable selection, provide reliable and robust QSAR models. The collaborative license of the open source LQTAgrid program will allow its use for the construction of descriptor matrices in 4D-QSAR analyses.
 

Acknowledgements

The authors are grateful to CAPES, FAPESP and CNPq for the financial support.
 

1 Cramer, R.D.; Patterson, D.E.; Bunce J.D. J. Am. Chem. Soc. 1988, 110, 5959;
2 Albuquerque M.G.; Hopfinger, A.J.; Barreiro E. J.; de Alencastro R. B. J. Chem. Inf. Model. 1997, 38, 925;
3 Christen, M.; Hunenberger, P. H.; Bakowies, D.; Baron, R.; Burgi, R.; Geerke, D. P.; Heinz, T. N.; Kastenholz, M. A.; Krautler, V.; Oostenbrink, C.; Peter, C.; Trzesniak D. J. Comput. Chem. 2005, 26, 1719;
4 Ravindra, G.K.; Achaiah, G.; Sastry, G.N. Euro. J. Med. Chem. 2008, 43, 830.;
5 Venkatarangan, P.; Hopfinger, A. J. Chem. Inf. Comput. Sci. 1999, 39, 1141.
6 Teófilo, R.F.; Martins, J.P.A.; Ferreira, M.M.C. J. Chem. 2008, xx.
 
 
 
 

4th Brazilian Symposium on Medicinal Chemistry - BrazMedChem2008