Acessibilidade / Reportar erro

Prediction of Gas Chromatographic Retention Indices of Coumarins

Abstracts

Quantitative structure-retention relationships (QSRR) were used in this study to relate the chromatographic retention of different substituted coumarins to molecular structure. Different structural parameters were selected, such as topological, geometric, electronic, quantum-chemical and physico-chemical descriptors, in order to find an equation that fitted the chromatographic retention of these compounds. The method proposed by Dimov that classifies the descriptors in different groups in agreement with their values of correlation coefficients was analysed. Significant correlation equations were obtained with the following molecular descriptors: the total surface area (A T), the electrotopological state index (S(-o-)) of the oxygen in position 1 of coumarin, and the highest occupied molecular orbital energy (E HOMO), showing that the experimental retention, using stationary phases with low polarity, was related with the shape and electronic factors of the solutes. The models found have a good predictive ability as established by cross-validation r²cv values and thus, can be used to aid in the elucidation of the structure or the chromatographic retention of similar coumarins.

QSRR; retention indices; coumarins; molecular descriptors; multiple linear regression


O estudo de correlação entre estrutura-retenção (QSRR) foi usado para relacionar a retenção cromatográfica de diferentes cumarinas substituídas as suas características estruturais. Diferentes parâmetros estruturais foram selecionados tais como: descritores topológicos, geométricos, eletrônicos, químico-quânticos e físico-químicos para encontrar uma equação que melhor explique o comportamento cromatográfico destes compostos. O método proposto por Dimov que classifica os descritores em diferentes grupos de acordo com seus valores de coeficientes de correlação foi utilizado neste estudo. As melhores equações de correlação linear múltipla foram obtidas com os seguintes descritores: área total (A T), índice do estado eletrotopológico (S-O-) e orbital molecular ocupado de maior energia (E HOMO), mostrando que a retenção experimental, usando fase estacionária de baixa polaridade está relacionada principalmente com fatores relacionados a forma e também eletrônicos. Os modelos selecionados apresentaram uma boa predictabilidade, a qual foi estabelecida através dos valores do coeficiente de correlação de validação cruzada (r²cv). As equações encontradas podem ser utilizadas para auxiliar na elucidação da estrutura ou retenção cromatográfica de cumarinas similares às estudadas.


Article

Prediction of Gas Chromatographic Retention Indices of Coumarins

Míriam de Freitas Soares a , Franco Delle Monache b , Vilma Edite Fonseca Heinzen a * , and Rosendo A. Yunes b

a Laboratório de Estrutura-Atividade, Departamento de Química, CFM,

Universidade Federal de Santa Catarina, 88040-900 Florianópolis - SC, Brazil

b Instituto di Chimica Recettori Molecole Biologicamente Attive del C.N.R.,

Università Cattolica S. Cuore, Largo F. Vito 1,00168 Rome, Italy

O estudo de correlação entre estrutura-retenção (QSRR) foi usado para relacionar a retenção cromatográfica de diferentes cumarinas substituídas as suas características estruturais.

Diferentes parâmetros estruturais foram selecionados tais como: descritores topológicos, geométricos, eletrônicos, químico-quânticos e físico-químicos para encontrar uma equação que melhor explique o comportamento cromatográfico destes compostos.

O método proposto por Dimov que classifica os descritores em diferentes grupos de acordo com seus valores de coeficientes de correlação foi utilizado neste estudo.

As melhores equações de correlação linear múltipla foram obtidas com os seguintes descritores: área total (AT), índice do estado eletrotopológico (S-O-) e orbital molecular ocupado de maior energia (EHOMO), mostrando que a retenção experimental, usando fase estacionária de baixa polaridade está relacionada principalmente com fatores relacionados a forma e também eletrônicos.

Os modelos selecionados apresentaram uma boa predictabilidade, a qual foi estabelecida através dos valores do coeficiente de correlação de validação cruzada (r2cv). As equações encontradas podem ser utilizadas para auxiliar na elucidação da estrutura ou retenção cromatográfica de cumarinas similares às estudadas.

Quantitative structure-retention relationships (QSRR) were used in this study to relate the chromatographic retention of different substituted coumarins to molecular structure. Different structural parameters were selected, such as topological, geometric, electronic, quantum-chemical and physico-chemical descriptors, in order to find an equation that fitted the chromatographic retention of these compounds.

The method proposed by Dimov that classifies the descriptors in different groups in agreement with their values of correlation coefficients was analysed.

Significant correlation equations were obtained with the following molecular descriptors: the total surface area (AT), the electrotopological state index (S(-o-)) of the oxygen in position 1 of coumarin, and the highest occupied molecular orbital energy (EHOMO), showing that the experimental retention, using stationary phases with low polarity, was related with the shape and electronic factors of the solutes.

The models found have a good predictive ability as established by cross-validation r2cv values and thus, can be used to aid in the elucidation of the structure or the chromatographic retention of similar coumarins.

Keywords: QSRR, retention indices, coumarins, molecular descriptors, multiple linear regression

Introduction

The quantitative structure-retention relationships (QSRR)1, establishes the relationship between a chemical structure and its chromatographic retention value. The chromatographic retention is due to the integral effect of molecular interactions between the analyte and both mobile and stationary phase molecules. To define the structural descriptors best related to the characteristic of a particular solute to undergo both polar and dispersive interactions is an important step to understanding the process of chromatographic separation.

Using the QSRR approach, structural descriptors such as topological, geometric and electronic or physico-chemical descriptors can be proposed for molecules with similar structural features, and a subset of them can be selected in order to describe and predict the gas chromatographic retention indices2. Numerous authors have attempted to correlate and predict retention indices of a variety of compounds by use of molecular structural information or physico-chemical and quantum-chemical parameters3-20.

In this study, our aim was to select the best descriptors related to the chromatographic retention of different substituted coumarins (some of them are interesting natural products) using the QSRR, to broaden the approach of Dimov et al.3,4 which takes into account only hydrocarbons (alkylbenzenes, alkylnaphthalenes, isoalkanes and alkenes).

Dimov et al.3,4 suggested that the descriptors should be divided in groups according to correlation coefficient values for the simple linear regression. After this, the descriptors of different groups would be related through multiple regression to obtain the best equations. These equations would be formed by two groups of descriptors: basic and tuning. The basic descriptors are those that permit the calculation of the retention index values closest to the experimental values. The tuning descriptors are those that further tune the calculated value to experimental ones.

Experimental

Samples and standards

The different coumarins studied were obtained by extraction in the Instituto di Chimica Recettori Molecole Biologicamente Attive del C.N.R., Università Cattolica S. Cuore. The 6-methylcoumarin, 7-methylcoumarin, 4-hydroxycoumarin, coumarin and 3,4-dihydrocoumarin were obtained commercially from the Aldrich Chemical Company Inc. The analytical standards of hydrocarbons (n-alcanes, nC12 to nC22) were supplied by Aldrich Chemical Company Inc.

Method of chromatographic analysis

The gas chromatographic retention data for 20 substituted coumarins chromatographed on two stationary phases with low polarities are indicated in Table 1. The sample solutions were prepared with acetone, and to some of them methanol was added. Samples of 1 mL were injected into a gas chromatograph Shimadzu (Model GC-14A) equipped with a flame ionisation detector, connected to a recorder (Shimadzu Model C-R6A). The carrier gas was hydrogen at a flow-rate of 2.0 mL min-1 in all instances. Two fused silica capillary columns of LM 1 no. 360696 (L&M São Carlos, SP, Brazil) a polydimethylsiloxane liquid phase with 0.25 mm i.d. and of CBP5 PN 221-28636-25 (Shimadzu Co., Kyoto, Japan) a polydimethylphenylsiloxane 5% phenyl, bonded liquid phase with 0.22 mm i.d. were utilised. Both the columns were 25 m long and with 0.25 mm film thickness. The injector and detector temperatures were 250 °C and 280 °C, respectively. The retention times were measured at different isothermal column temperatures: 170 °C, 190 °C, 210 °C, 220 °C and 230 °C. The experimental retention time (tR) was obtained using the average of three experimental determinations. The standard deviations are less than 0.01%.

The dead time (tM) was determined by the Method of Sharles and Vermon21. The measurements were always taken before each determination of the retention time.

The Kovàts retention indices were determined through the adjusted retention times (tR - tM), according to the equation of Kovàts22, using n-alcanes as standards.

Method of calculation of molecular descriptors

The structural descriptors used in the present study were topological and quantum-chemical indices and physico-chemical properties.

Several extensive reviews have been published which give detailed descriptions of the theory and method of calculation of all topological indices used in the present investigation23-28.

The Wiener indices (W)23, the molecular connectivity indices of different orders (1X, 2X, 3Xp, 4Xp, 5Xp, 3Xc, 4Xpc,1Xv, 2Xv, 3Xvp, 4Xvp, 5Xvp, 3Xvc and 4Xvpc)24, indices of molecular shape, kappa values (1K, 2K, 3K)25, indices of electrotopological state (Si)26,27 and topological state indices (Ti)28, were calculated by the method of Kier and Hall, utilizing the Molconn-X computer program for molecular topological analysis29.

The topological state index Ti, of an atom, gives information on the topological environment of the atom according to the presence of the other atoms in the molecule. The values of Ti = T(=O), are related to the oxygen of the carbonyl in the basic structure of coumarins. T(-O-) indices are related to the oxygen of position 1 in the coumarin. They distinguish also all the compounds studied (see Table 2).

The electrotopological state index (Si) gives information about the electronic and topological state of an atom according to the presence of other atoms in the molecule. It was considered that S(-O-), related to the oxygen of position 1 in the coumarin molecule, distinguished all the compounds studied (Table 2).

The molecules were drawn using the PcModel version 3.01 computer program, and the figures were exported to be geometrically energy-minimised using the PM3 quantum mechanical method. Subsequently, the energy-optimised molecular structure was used for generating a number of molecule parameters utilising the semi-empirical molecular orbital Mopac program30.

The following quantum mechanical molecular properties were chosen for the study: the highest occupied molecular orbital energy (EHOMO), and the lowest unoccupied molecular orbital energy (ELUMO). Other derived properties obtained were: the total surface area (AT), saturated area (AS), unsaturated area (AUNS), polar area (AP), molecular volume (VMOLEC), molar volume (VMOLAR), total energy (ET), molar refractivity (RM). In addition, the following physical properties of the coumarins were also selected: molecular mass (MM) and boiling point (Tb). Some of the values are given in Table 2.

The best regression equations were selected on the basis of the multiple correlation coefficient (r), r-squared, standard deviation (SD), the overall F value and the F-remove for individual descriptors (F-to-remove: 4.00) in the stepwise selection. Finally, in order to estimate the predictive power of the model, cross-validation, following the leave-one-out scheme was performed32,33. The reliability of the model was tested in a cross-validation with determination of r2cv32.

All the calculations of single and multiple linear regression analyses were carried out using the Statgraf and Origin computer programs.

Results and Discussion

The basic structure of mono-substituted coumarins with the corresponding substituents studied are given below:

The values of the chromatographic retention indices (RI) of different coumarins studied, using the stationary phases LM-1 (RILM-1) and CBP5 (RICBP5) at different isothermal temperatures of columns 170-210 °C, are given in Table 1.

The retention times were determined at three different temperatures within the linear range of the curve. The correlation coefficients of these curves are in the range of 0.9992 - 0.9998 in CBP5 and 0.9962 - 0.9991 in LM-1.

Some molecular descriptors values are given in Table 2.

According to the r values, the molecular descriptors were classified in agreement with the method proposed by Dimov et al.3,4, into four groups depending on their individual significance for the retention studied. These groups are identified as follows: substantial, important, likely and specific.

In the present study, none of the available descriptors were enclosed in the group of the substantial and important ones, that is no r was obtained above 0.80. This classification is also shown in Table 3.

In the multiple linear regression, the contributors to the retention in agreement with the method proposed by Dimov et al.3,4 were divided into two groups, basic and tuning:

where RIcalc represents the calculated retention. The Bi parameters include those solute descriptors which allow the calculation of the RIcalc value closest to the experimental retention (RIexp). The parameters Tjare solute descriptors, which tune the RIcalc value to that of RIexp. Owing to the restriction of statistically-permitted intercorrelations, there is normally only one basic descriptor to be used. The problem is choice which basic descriptor forms the best combination with tuning contributor(s).

Different combinations of the variables representing each of two terms described above were tried to obtain the equation of best fit, which would relate the RI values of the reference compounds to their molecular structure. Multiple linear equations with two variables produced correlation coefficients with low values (the best was r = 0,8498 with AT and S(-o-) in LM-1).

The series of compounds studied can be described by an equation comprising three types of terms: the size of the molecule, and the topological and electronic factors. The best equations were obtained with AT, 1K, W, 3Xp and VM as the first term of the equation (basic descriptors), reflecting the bulk properties of the molecule. These basic descriptors have high colinearity to each other. However, only one of them was used in each equation. The second and third terms (tuning descriptors) were represented by S(-o-) related with the topological and electronic factors and EHOMO related with the ionisation potential of the molecule. The colinearities between them were also checked.

The best multiple linear regression Eqs. 2 - 3 with three variables with LM-1 and CBP5 stationary phases (170 °C) were:

The best simple regressions were obtained with descriptors related to the shape, size, area, volume and molecular refractivity of the substance studied. In the multiple linear regression the electronic descriptors were introduced.

The total area was the first parameters introduced into the multiple linear equations as it was found to be the most significant (see the values of F, Eqs. 2-3). The positive sign of the coefficient in the Eqs. 2-3 indicates that retention indices increase with the total area values as is observed in Table 2. The second parameter introduced in the equations (the second most significant according to the values for F, Eqs. 2 and 3) was the electrotopological index (Si) related to the oxygen atom in the position 1 of the coumarins. The nature of the substituents in the molecule influences this index. Substituents with electron-donating effect contribute to increase the value of the index and those with electron-withdrawing effect decrease it. The negative sign of the coefficient in Eqs. 2 and 3 indicates that retention indices increase as the electrotopologic indices (S-o-) in the molecule decrease. In agreement with the experimental retention indices, it can be observed that in a general way, compounds with more electronegative substituents have larger retention values.

The third parameter introduced in the multiple regression equations according to its significance (see the values of F in the Eqs. 2 and 3) was the energy of the highest occupied molecular orbital of the solute, EHOMO. The introduction of this parameter, that is less significant, resulted in an increase of the multiple correlation coefficients (r). EHOMO is related to the energy necessary to remove an electron from a molecule. The sign of the coefficient in the Eqs. 2-3 is positive, thus the negative values of EHOMO that correspond to more electron-donating molecules, contribute to decrease the chromatographic retention. This is, in accordance with the value of S(-o-), thus EHOMO should correct the values of S(-o-), considering that EHOMO is a molecule global value and not localised in an atom as S(-o-).

The validation of the model was checked through the values of r2cv (coefficient of validation of the capacity of prediction of the model) which was better for the Eqs. 2 and 3, where the total area was the most significant parameter.

The calculated retention indices with the Eqs. 2 and 3, residuals and standard residuals are given in Table 4.

The correlation between experimental retention indices (LM-1 and CBP5) and those calculated by Eqs. 2 and 3 is shown in Fig. 1.


It is observed in Fig. 1 that two points corresponding to compounds 2 and 15 (Table 4) exhibit a significantly large s. When the correlation is made without these outliers the correlation coefficient is improved, r = 0.9647 for the ILM-1 and r = 0.9642 for ICBP5.

The interactions between the molecules of the solute and those of the two stationary phases used in this work are similar. These are observed by the simple linear correlations obtained between the experimental retention indices (or calculated) of the two stationary phases, whose correlation coefficient is excellent.

The angular coefficient of this Eq. 4 demonstrates that there is not appreciable discrimination between the phases. However, the intercept shows that the values of ICBP5 are smaller than that of ILM-1. This fact could be explained by different diffusion rates in the stationary phases or other dynamic factors.

Conclusions

This study demonstrates that the methodology used previously by Dimov to model GC retention characteristics of the hydrocarbons (alkylbenzenes, alkylnaphthalenes, isoalkanes and alkenes) can also be applied to model the retention indices of substituted coumarins, although the correlations were not as good as the first ones.

The descriptors that are able to describe the phenomena that took place in the process of the retention of the studied compounds are related with the shape and the electronic characteristics of the molecules.

The models found have good predictive ability as established by cross-validation r2cv values and thus can be used to help in the elucidation of molecular structures or prediction of chromatographic retention indices of similar compounds.

Acknowledgement

The authors thank CNPq (Brazil) for financial support.

  • 1. Kaliszan R. Quantitative Structure-chromatographic Retention Relationships; John Wiley & Sons; New York, 1987.
  • 2. Sutter, J.M.; Peterson, T.A.; Jurs, P.C. Anal. Chim. Acta 1997, 342, 113.
  • 3. Dimov, N.; Osman, A. Anal. Chim. Acta 1996, 323, 15.
  • 4. Dimov, N.; Osman, A.; Mekenyan, Ov.; Papazova, D. Anal. Chim. Acta 1994, 298, 303.
  • 5. Heinzen, V.E.F.; Yunes, R.A. J. Chromatogr 1992, 598, 243.
  • 6. Heinzen, V.E.F.; Yunes, R.A. J. Chromatogr 1993, 654, 83.
  • 7. Arruda, A.C.; Heinzen, V.E.F.; Yunes, R.A. J. Chromatogr 1993, 630, 251.
  • 8. Heinzen, V.E.F.; Yunes, R.A. J. Chromatogr. A 1996, 719, 462.
  • 9. Salabert-Salvador, M.T.; Garcia-March, F.J.;Pérez-Giménez F. et al. Chromatographia 1995, 41, 702.
  • 10. Righezza, M.; Hassani, A.; Meklati, B.Y.; Chrétien, J.R. J. Chromatogr. A 1996, 723, 77.
  • 11. Garcia-March, F.J.; Antón-Fos, G.M.; Pérez-Giménez, F.; Salabert-Salvador, M.T.; Cercós-del-Pozo, R.A.; Julian-Ortiz, J.V. J. Chromatogr.A 1996, 719, 45.
  • 12. Zhang, X.; Lu, P. J. Chromatogr. A 1996, 731, 187.
  • 13. Peng, C.T. J. Chromatogr. A 1994, 687, 189.
  • 14. Li, J.; Carr, P.W. J. Chromatogr. A 1994, 659, 367.
  • 15. Hilal, S.H.; Carreira, L.A. J. Chromatogr. A 1994, 662, 269.
  • 16. Kalliszan, R. Anal. Chem 1992, 64, 619A.
  • 17. Mihalic, Z.; Trinajstic, N. J. Chem. Educ 1992, 69, 701.
  • 18. Kier, L.B. In Physical Chemical Properties of Drugs, Yalkowsky, S.H.; Sinkula, A.; Valvani, S.C., eds., Marcel Dekker; N. Y., Ch. 9, p. 277, 1980.
  • 19. Hansch, H; Leo, A. Exploring QSAR. Fundamentals and Applications in Chemistry and Biology; Washington, ACS, 1995.
  • 20. Santos, H.F.; Almeida, W.B. J. Molec. Struct. 1995, 335, 129.
  • 21. Sharples, W.E.; Vermon, F. J. Chromatogr 1978, 161, 83.
  • 22. Tarjan, G.; Nyiredy, Sz.; Gyor, M. et al. J. Chromatogr 1989, 472, 1.
  • 23. Wiener, H. J. Am. Chem. Soc. 1947, 69, 17 and 2636.
  • 24. Kier, L.B.; Hall, L.H. Molecular Connectivity in Structure-Activity Analysis; Wiley; New York, 1986.
  • 25. Kier, L.B. Quant. Struct.-Act. Relat 1987, 6, 8.
  • 26. Kier, L.B.; Hall, L.H. Pharm. Res 1990, 7, 801.
  • 27. Hall, L.H.; Mohney, B.; Kier, L.B. J. Chem. Inf. Comput. Sci. 1991, 31, 76.
  • 28. Hall, L.H.; Kier, L.B. Quant. Strut.-Act. Relat. 1990, 9, 115.
  • 29. Hall, L.H.; Kier, L.B. Molconn-X, a Program for Molecular Topology Analysis; Users Guide 1. ed. Quincy; Massachusetts, 1991.
  • 30. Stewart, J.J.P. MOPAC, version 6.0, J. Seiler Research Laboratory, US Air Force Academy, Colorado Springs. CO80840-6528, USA,1990.
  • 31. Makovskaya, V.; Dean, J.R.; Tomlinson, W.R.; Comber, M. Anal. Chim. Acta 1995, 315, 193.
  • 32. Muresan, S.; Bologa, C.; Mracec, M.; Chiriac, A.; Jastorff, B.; Simon, Z.; Naráy-Szabó, G. J. Molec. Struct 1995, 42, 161.
  • 33. Wagener, M.; Sadowski, J.; Gasteiger, J. J. Am. Chem. Soc 1995, 117, 7769.

Publication Dates

  • Publication in this collection
    23 Oct 2002
  • Date of issue
    1999

History

  • Received
    05 May 1998
Sociedade Brasileira de Química Instituto de Química - UNICAMP, Caixa Postal 6154, 13083-970 Campinas SP - Brazil, Tel./FAX.: +55 19 3521-3151 - São Paulo - SP - Brazil
E-mail: office@jbcs.sbq.org.br