Quantitative structure-activity relationship

	WikiDoc Resources for Quantitative structure-activity relationship
Articles
Most recent articles on Quantitative structure-activity relationship Most cited articles on Quantitative structure-activity relationship Review articles on Quantitative structure-activity relationship Articles on Quantitative structure-activity relationship in N Eng J Med, Lancet, BMJ
Media
Powerpoint slides on Quantitative structure-activity relationship Images of Quantitative structure-activity relationship Photos of Quantitative structure-activity relationship Podcasts & MP3s on Quantitative structure-activity relationship Videos on Quantitative structure-activity relationship
Evidence Based Medicine
Cochrane Collaboration on Quantitative structure-activity relationship Bandolier on Quantitative structure-activity relationship TRIP on Quantitative structure-activity relationship
Clinical Trials
Ongoing Trials on Quantitative structure-activity relationship at Clinical Trials.gov Trial results on Quantitative structure-activity relationship Clinical Trials on Quantitative structure-activity relationship at Google
Guidelines / Policies / Govt
US National Guidelines Clearinghouse on Quantitative structure-activity relationship NICE Guidance on Quantitative structure-activity relationship NHS PRODIGY Guidance FDA on Quantitative structure-activity relationship CDC on Quantitative structure-activity relationship
Books
Books on Quantitative structure-activity relationship
News
Quantitative structure-activity relationship in the news Be alerted to news on Quantitative structure-activity relationship News trends on Quantitative structure-activity relationship
Commentary
Blogs on Quantitative structure-activity relationship
Definitions
Definitions of Quantitative structure-activity relationship
Patient Resources / Community
Patient resources on Quantitative structure-activity relationship Discussion groups on Quantitative structure-activity relationship Patient Handouts on Quantitative structure-activity relationship Directions to Hospitals Treating Quantitative structure-activity relationship Risk calculators and risk factors for Quantitative structure-activity relationship
Healthcare Provider Resources
Symptoms of Quantitative structure-activity relationship Causes & Risk Factors for Quantitative structure-activity relationship Diagnostic studies for Quantitative structure-activity relationship Treatment of Quantitative structure-activity relationship
Continuing Medical Education (CME)
CME Programs on Quantitative structure-activity relationship
International
Quantitative structure-activity relationship en Espanol Quantitative structure-activity relationship en Francais
Business
Quantitative structure-activity relationship in the Marketplace Patents on Quantitative structure-activity relationship
Experimental / Informatics
List of terms related to Quantitative structure-activity relationship

Overview

Quantitative structure-activity relationship (QSAR) is the process by which chemical structure is quantitatively correlated with a well defined process, such as biological activity or chemical reactivity.

For example, biological activity can be expressed quantitatively as in the concentration of a substance required to give a certain biological response. Additionally, when physiochemical properties or structures are expressed by numbers, one can form a mathematical relationship, or quantitative structure-activity relationship, between the two. The mathematical expression can then be used to predict the biological response of other chemical structures.

QSAR's most general mathematical form is:

Activity = f(physiochemical properties and/or structural properties)

SAR and SAR paradox

The basic assumption for all molecule based hypotheses is that similar molecules have similar activities. This principle is also called Structure-Activity Relationship (SAR). The underlying problem is therefore how to define a small difference on a molecular level, since each kind of activity, e.g. reaction ability, biotransformation ability, solubility, target activity, and so on, might depend on another difference. A good example was given in the bioisosterism review of Patanie/LaVoie.^[1]

In general, one is more interested in finding strong trends. Created hypotheses usually rely on a finite number of chemical data. Thus, the induction principle should be respected to avoid overfitted hypotheses and deriving overfitted and useless interpretations on structural/molecular data.

The SAR paradox refers to the fact that it is not the case that all similar molecules have similar activities.

Applications

Chemical

One of the first historical QSAR applications was to predict boiling points.^[2]

It is well known for instance that within a particular family of chemical compounds, especially of organic chemistry, that there are strong correlations between structure and observed properties. A simple example is the relationship between the number of carbons in alkanes and their boiling points. There is a clear trend in the increase of boiling point with an increase in the number carbons and this serves as a means for predicting the boiling points of higher alkanes.

A still very interesting application is the Hammett equation, Taft equation and pKa prediction methods.

Biological

The biological activity of molecules is usually measured in assays to establish the level of inhibition of particular signal transduction or metabolic pathways. Chemicals can also be biologically active by being toxic. Drug discovery often involves the use of QSAR to identify chemical structures that could have good inhibitory effects on specific targets and have low toxicity (non-specific activity). Of special interest is the prediction of partition coefficient log P, which is an important measure used in identifying "drug-likeness" according to Lipinski's Rule of Five.

While many quantitative structure activity relationship analyses involve the interactions of a family of molecules with an enzyme or receptor binding site, QSAR can also be used to study the interactions between the structural domains of proteins. Protein-protein interactions can be quantitatively analyzed for structural variations resulted from site-directed mutagenesis.^[3].

It is part of the machine learning method to reduce the risk for a SAR paradox, especially taking into account that only a finite amount of data is available (see also MVUE). In general all QSAR problems can be divided into a coding^[4] and learning^[5] part.

Data mining

For the coding usually a relatively large number of features or molecular descriptors is calculated, which can lack structural interpretation ability. In combination with the later applied learning method or as preprocessing step occurs a feature selection problem.

A typical data mining based prediction uses e.g. support vector machines, decision trees, neural networks for inducing a predictive learning model.

3D-QSAR

3D-QSAR refers to the application of force field calculations requiring three-dimensional structures, e.g. based on protein crystallography or molecule superposition. It uses computed potentials, e.g. the Lennard-Jones potential, rather than experimental constants and is concerned with the overall molecule rather than a single substituent. It examines the steric fields (shape of the molecule) and the electrostatic fields based on the applied energy function.^[6]

The created data space is then usually reduced by a following feature extraction (see also dimensionality reduction). The following learning method can be any of the already mentioned machine learning methods, e.g. support vector machines.^[7]

In the literature it can be often found that chemists have a preference for partial least squares (PLS) methods, since it applies the feature extraction and induction in one step.

Molecule mining

Molecule mining approaches, a special case of structured data mining approaches, apply a similarity matrix based prediction or an automatic fragmentation scheme into molecular substructures. Furthermore there exist also approaches using maximum common subgraph searches or graph kernels.^[8] ^[9]

Fragment based (group contribution)

It has been shown that the logP of compound can be determined by the sum of its fragments. Fragmentary logP values have been determined statistically. This method gives mixed results and is generally not trusted to have accuracy of more than +/- 0.1 units.^[10]

Applicability Domain

As the use of (Q)SAR models for chemical risk management increases steadily and is also used for regulatory purposes (in the EU: Registration, Evaluation, Authorisation and Restriction of Chemicals), it is of crucial importance be able to assess the reliability of predictions. The chemical descriptor space spanned by a particular training set of chemicals is called Applicability Domain. It offers the opportunity to assess whether a compound can be reliably predicted.

References

↑ G. A. Patani, E. J. LaVoie, Bioisosterism: A Rational Approach in Drug Design. Chem. Rev., 1996, 96, 3147-3176. doi:10.1021/cr950066q
↑ D. Bonchev, D.H. Rouvray: Chemical Graph Theory: Introduction and Fundamentals. Gordon and Breach Science Publishers, 1990, ISBN 0-85626-454-7.
↑ E. K. Freyhult, K. Andersson, M. G. Gustafsson, Structural modeling extends QSAR analysis of antibody-lysozyme interactions to 3D-QSAR,J. Biophys., 2003, 84, Template:ISSN. PMID 12668435
↑ R. Todeschini, V. Consonni, Handbook of Molecular Descriptors, Wiley-VCH, 2000. ISBN 3527299130
↑ R.O. Duda, P.E. Hart, D.G. Stork, Pattern Classification, John Wiley & Sons, 2001. ISBN 0-471-05669-3
↑ A. Leach, Molecular Modelling: Principles and Applications, Prentice Hall, 2001. ISBN 0-582-38210-6
↑ Schölkopf, B., K. Tsuda and J. P. Vert: Kernel Methods in Computational Biology, MIT Press, Cambridge, MA, 2004.
↑ Gusfield, D., Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology, Cambridge University Press, 1997. ISBN 0-521-58519-8
↑ C. Helma (ed.), Predictive Toxicology, CRC, 2005. ISBN 0-8247-2397-X
↑ S. A. Wildman, G. M. Crippen, Prediction of Physicochemical Parameters by Atomic Contributions, J. Chem. Inf. Comput. Sci.}, 1999, 39, 868-873. doi:10.1021/ci990307l

External links

Template:Medicinal chemistry

Template:Jb1

Template:WikiDoc Sources

[1] G. A. Patani, E. J. LaVoie, Bioisosterism: A Rational Approach in Drug Design. Chem. Rev., 1996, 96, 3147-3176. doi:10.1021/cr950066q

[2] D. Bonchev, D.H. Rouvray: Chemical Graph Theory: Introduction and Fundamentals. Gordon and Breach Science Publishers, 1990, ISBN 0-85626-454-7.

[3] E. K. Freyhult, K. Andersson, M. G. Gustafsson, Structural modeling extends QSAR analysis of antibody-lysozyme interactions to 3D-QSAR,J. Biophys., 2003, 84, Template:ISSN. PMID 12668435

[4] R. Todeschini, V. Consonni, Handbook of Molecular Descriptors, Wiley-VCH, 2000. ISBN 3527299130

[5] R.O. Duda, P.E. Hart, D.G. Stork, Pattern Classification, John Wiley & Sons, 2001. ISBN 0-471-05669-3

[6] A. Leach, Molecular Modelling: Principles and Applications, Prentice Hall, 2001. ISBN 0-582-38210-6

[7] Schölkopf, B., K. Tsuda and J. P. Vert: Kernel Methods in Computational Biology, MIT Press, Cambridge, MA, 2004.

[8] Gusfield, D., Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology, Cambridge University Press, 1997. ISBN 0-521-58519-8

[9] C. Helma (ed.), Predictive Toxicology, CRC, 2005. ISBN 0-8247-2397-X

[10] S. A. Wildman, G. M. Crippen, Prediction of Physicochemical Parameters by Atomic Contributions, J. Chem. Inf. Comput. Sci.}, 1999, 39, 868-873. doi:10.1021/ci990307l

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

Quantitative structure-activity relationship

Contents

Overview

SAR and SAR paradox

Applications

Chemical

Biological

Data mining

3D-QSAR

Molecule mining

Fragment based (group contribution)

Applicability Domain

See also

References

External links

Navigation menu

Quantitative structure-activity relationship

Overview

SAR and SAR paradox

Applications

Chemical

Biological

Data mining

3D-QSAR

Molecule mining

Fragment based (group contribution)

Applicability Domain

See also

References

External links

Navigation menu

Search