Archive for the 'tips and hints' Category

eHiTS 2009 as a Blind Docking Tool

Tuesday, December 1st, 2009

<meta content="OpenOffice.org 2.4 (Unix)" name="GENERATOR" /> <style type="text/css"> <!-- @page { size: 8.5in 11in; margin: 0.79in } P { margin-bottom: 0.08in } --> </style></p> <p align="justify" style="margin-bottom: 0.08in"><font face="Liberation Serif, Times New Roman, serif">As the molecular docking paradigm solidifies its status as a significant tool for drug discovery, chemists explore additional applications of the methods in ways that sometime stretch the existing algorithms to their limits. Most docking programs, including eHiTS, have not been designed or optimized to perform blind docking. In structure based drug discovery, the user is typically expected to define, at some level of accuracy, the binding pocket in the target of interest. The binding site is determined either based on known binding modes of ligands as found in crystal structures of complexes, or based on an educated hypothesis. There are cases, however, in which assumptions about the possible locations of binding hot spots are difficult or should be avoided altogether. This is the case, for example, when the existence of secondary binding sites is suspected, or when one would like to screen active ligands and other compounds on a range of targets to estimate the possibility for drug side-effects, toxicity, and other types of biological activities.</font></p> <p align="justify" style="margin-bottom: 0.08in"><font face="Liberation Serif, Times New Roman, serif">The standard eHiTS usage requires a rough definition of the binding pocket. This is done through the clip file. This file should contain at least two sets of coordinates (or two spatial points) that are located in the designated binding pocket. eHiTS then draws a box around those points, expands it to some extent in all directions and places the search grid inside that box. Then, the box is “flooded” with a virtual fluid to detect all the cavities which will define the binding surface. This is a highly automated process, but it still relies on that user-defined clipping. Commonly the native ligand, amino acids from the binding pocket, or a few atoms from either are chosen as a clip file. If eHiTS is run with the -complex option, the native ligand is inferred as the clipping coordinates. However, eHiTS could be used without any clipping. In this case, the entire receptor will be considered for docking. The whole protein will be flooded, and sufficiently deep clefts will be searched on its surface. The final space in which docking will be performed is defined by the interconnected pockets found on the target. The search grid in such scenarios is typically large, and extensive sampling is required. Nevertheless, the computational efficiency of the eHiTS algorithm allows good sampling in reasonable timescales.</font></p> <p align="justify" style="margin-bottom: 0.08in"><font face="Liberation Serif, Times New Roman, serif">Several eHiTS users expressed specific interest in blind docking in recent months, and therefore we decided to evaluate eHiTS’ performance in this context. We used the set that was used in an earlier blind docking evaluation (</font><a target="_blank" title="Reference 1" href="http://autodock.scripps.edu/resources/science/hetenyi2006b.pdf"><font color="#000080"><u><font face="Liberation Serif, Times New Roman, serif">Hetenyi and van der Spoel, </font></u></font><font color="#000080"><u><font face="Liberation Serif, Times New Roman, serif"><strong>2006 [1]</strong></font></u></font></a><font face="Liberation Serif, Times New Roman, serif">). We focused on the 43 complexes used in the paper and have not attempted to use the apo structures. 3 codes (1B70, 1FIW and 1QIZ) were left out because of uncertainty regarding the exact structure used in the paper for docking. The default accuracy (3) was used throughout the study. The average blind docking time was 9 minutes per receptor for this set.</font></p> <p align="justify" style="margin-bottom: 0.08in"><meta content="text/html; charset=utf-8" http-equiv="CONTENT-TYPE" /><title /><meta content="OpenOffice.org 2.4 (Unix)" name="GENERATOR" /> <style type="text/css"> <!-- @page { size: 8.5in 11in; margin: 0.79in } P { margin-bottom: 0.08in } --> </style></p> <p align="justify" style="margin-bottom: 0.08in"><font face="Liberation Serif, Times New Roman, serif"><strong>Results</strong></font><font face="Liberation Serif, Times New Roman, serif">:</font></p> <p align="justify" style="margin-bottom: 0.08in"><font face="Liberation Serif, Times New Roman, serif">77.5% of the cases gave at least one conformation under 2 </font><font face="Liberation Serif, Times New Roman, serif">Å in the top 10 poses. In the other cases, one accumulative docking round using poses from the first round as clip files produced successful binding modes in the top 5 poses. The top rank pose is in most cases in the correct binding pocket, offering a good starting point for pose refinement.</font></p> <p align="justify" style="margin-bottom: 0.08in"><font face="Liberation Serif, Times New Roman, serif">The table <a target="_blank" title="eHiTS 2009.1 blind docking results in table form" href="http://www.simbiosys.ca/blog/2009-11-30-blind_docking_blog_results_table.html">here</a> details the results for the specific codes. The Job# column describes whether the results were obtained with a single blind docking run, or with 2 cycles. The Rank# and RMSD columns indicate the rank of the first pose under 2 Å and its RMSD from the crystallographic conformation. The last two columns indicate the top-rank and closest poses RMSDs.</font></p> <p><meta content="text/html; charset=utf-8" http-equiv="CONTENT-TYPE" /><title /><meta content="OpenOffice.org 2.4 (Unix)" name="GENERATOR" /> <style type="text/css"> <!-- @page { size: 8.5in 11in; margin: 0.79in } P { margin-bottom: 0.08in } --> </style></p> <p align="justify" style="margin-bottom: 0.08in"><font face="Liberation Serif, Times New Roman, serif">The blind docking of phenol into insulin (1MPJ) is shown in Picture1 below. The crystallographic pose is shown in cyan, and sample poses are shown in “hot spots” detected during docking. Those poses can be used to clip the receptor in accumulative docking runs in which the sampling is finer, and the binding pockets are better modelled. It should be noted that this code generates an unusually big number (5) of hot spots. In most cases in the set we observed three, two and often one hot spot, manifesting the detection of the correct binding pocket.</font></p> <p><img align="middle" title="1MPJ" alt="1MPJ" src="http://www.simbiosys.ca/blog/images/2009-11-30-picture1.jpg" /></p> <p><em>Picture1: Phenol binding to Insulin.Several potential binding pockets are detected for this small ligand.</em></p> <p>1NGP (N1G9 FAB fragment) is a case where the majority of poses are generated far from the native ligand. Picture 2 below shows that most of the poses are located in the big cavity between chains L and H of the crystal structure. Several poses, however, reproduce the x-ray binding mode (in cyan) with close to 1 Å RMSD.</p> <p><img align="middle" alt="1NGP" title="1NGP" src="http://www.simbiosys.ca/blog/images/2009-11-30-picture2.jpg" /></p> <p><em>Picture2: 2-(4-hydroxy-3-nitrophenyl)acetic acid docked into N1G9 FAB fragment. The majority of poses are located in the big cavity between chains L and H.</em></p> <p align="justify" style="margin-bottom: 0.08in"><font face="Liberation Serif, Times New Roman, serif"><strong>Conclusion</strong></font><font face="Liberation Serif, Times New Roman, serif"><strong>s:</strong></font></p> <p>The above results clearly demonstrate the viability of eHiTS as a blind docking tool. In all cases the correct binding pocket has been identified in the top 32 solutions, and in most cases good poses under 2 Å and even 1 Å were found at the top of the generated poses. The conformations may be further refined by clipping the receptor for subsequent runs, and by working at higher accuracies. As always in eHiTS, the jobs are extremely easy to setup with a simple command line, and with no required preparation for the receptor or the ligand. This, and the speed of the calculations make eHiTS a high throughput blind docking solution.</p> <p align="justify" style="margin-bottom: 0.08in"><meta http-equiv="CONTENT-TYPE" content="text/html; charset=utf-8" /><title /><meta name="GENERATOR" content="OpenOffice.org 2.4 (Unix)" /><font face="Liberation Serif, Times New Roman, serif"> <style type="text/css"> <!-- @page { size: 8.5in 11in; margin: 0.79in } P { margin-bottom: 0.08in } H2 { margin-bottom: 0.08in } H2.western { font-family: "Liberation Sans", "Arial", sans-serif; font-size: 14pt; font-style: italic } H2.cjk { font-family: "DejaVu Sans"; font-size: 14pt; font-style: italic } H2.ctl { font-family: "DejaVu Sans"; font-size: 14pt; font-style: italic } --> </style></font></p> <h2 class="western"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif">Reference:</font></font></h2> <ol><font face="Liberation Serif, Times New Roman, serif"> <font face="Liberation Serif, Times New Roman, serif"> <font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif" /></font> <font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif" /></font></font></font> <font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif" /></font></font></font></font></font></font> <font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"></p> <li>Hetenyi, C. Van der Spoel, D.: ”Blind docking of drug-sized compounds to proteins with up to a thousand residues.”;<br /> 2006 Feb 20;580(5):1447-50. Epub 2006 Jan 31.</li> <li><a target="_blank" title="eHiTS 2009.1 blind docking results in table form" href="http://www.simbiosys.ca/blog/2009-11-30-blind_docking_blog_results_table.html">Blind docking results for eHiTS 2009.1</a>, using the test set of Hetenyi et.al.[1]</li> <p></font></font></font></font></font></font></font></ol> <p><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"> </font></font></font></font></p> <p><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"> </font></font></font></font></p> <p align="left" style="margin-bottom: 0.08in"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"> </font></font></font></font></p> <p><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"> </font><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif"><font face="Liberation Serif, Times New Roman, serif">by Orr Ravitz</font></font></font></font> </font> </font></font> </p> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/software-products/" title="View all posts in Software products" rel="category tag">Software products</a>, <a href="http://www.simbiosys.com/blog/category/science/" title="View all posts in Science" rel="category tag">Science</a>, <a href="http://www.simbiosys.com/blog/category/technology/" title="View all posts in Technology" rel="category tag">Technology</a>, <a href="http://www.simbiosys.com/blog/category/science/tips-and-hints/" title="View all posts in tips and hints" rel="category tag">tips and hints</a> | <a href="http://www.simbiosys.com/blog/2009/12/01/ehits-2009-as-a-blind-docking-tool/#respond" title="Comment on eHiTS 2009 as a Blind Docking Tool">No Comments »</a></p> </div> <div class="post"> <h3 id="post-83"><a href="http://www.simbiosys.com/blog/2009/11/24/useful-scripts-available-as-free-download-on-the-simbiosys-website/" rel="bookmark" title="Permanent Link to Useful scripts available as free download on the SimBioSys website">Useful scripts available as free download on the SimBioSys website</a></h3> <small>Tuesday, November 24th, 2009</small> <div class="entry"> <p>We recently posted on the CCL (Computational Chemistry List, see link <a title="CCL posting by Aniko Simon, on Nov 3rd 2009" target="_blank" href="http://www.ccl.net/chemistry/resources/messages/2009/11/03.003-dir/index.html">here</a>), that there are some useful scripts for the molecular modelling Linux / Unix community available for free on our website. Some of these are specific for eHiTS users, some are more general-purpose. They are all available free of charge here: <a target="_blank" href="http://www.simbiosys.ca/download/scripts/index.html">http://www.simbiosys.ca/download/scripts/index.html</a></p> <p>Bookmark the above site, as we’ll keep updating it with more and more scripts. </p> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/software-products/" title="View all posts in Software products" rel="category tag">Software products</a>, <a href="http://www.simbiosys.com/blog/category/news/" title="View all posts in News" rel="category tag">News</a>, <a href="http://www.simbiosys.com/blog/category/technology/" title="View all posts in Technology" rel="category tag">Technology</a>, <a href="http://www.simbiosys.com/blog/category/science/tips-and-hints/" title="View all posts in tips and hints" rel="category tag">tips and hints</a> | <a href="http://www.simbiosys.com/blog/2009/11/24/useful-scripts-available-as-free-download-on-the-simbiosys-website/#respond" title="Comment on Useful scripts available as free download on the SimBioSys website">No Comments »</a></p> </div> <div class="post"> <h3 id="post-56"><a href="http://www.simbiosys.com/blog/2008/11/10/synthetic-complexity-and-accessibility-choosing-the-correct-tool-to-the-task-in-de-novo-triage/" rel="bookmark" title="Permanent Link to SYNTHETIC COMPLEXITY AND ACCESSIBILITY: CHOOSING THE CORRECT TOOL TO THE TASK IN DE NOVO TRIAGE.">SYNTHETIC COMPLEXITY AND ACCESSIBILITY: CHOOSING THE CORRECT TOOL TO THE TASK IN DE NOVO TRIAGE.</a></h3> <small>Monday, November 10th, 2008</small> <div class="entry"> <p>There are many instances in which one wants to assess the synthetic accessibility of a set of compounds. It is crucial to choose the right tool for the task. If one has just 10-15 compounds to assess then using ARChem route designer would be the correct choice. ARChem gives one an assessment of retrosynthetic pathways from consideration of 10-million+ rules derived from `clustering’ extended reaction cores derived from the literature. Often, however, one wants to survey the synthetic accessibility of a large database of `virtual compounds’ ranging from with hundreds to hundreds of thousands of compounds.</p> <p>This often arises in the contexts of de novo design in fragment based design paradigms. De novo design based on fragment docking and tethering entails: 1) ‘docking’ low molecular-weight/simple-topology fragments into the binding pockets of protein/receptor targets and then 2) finding linkers that can effectively tether the fragments, followed by 3) energy minimization of the assembled de novo compounds in the protein/receptor pocket. Typically one characterizes polar (hydrogen bond donor/acceptor and charged) components and hydrophobic regions of the binding pocket. These chemical features in the binding site landscape constitute features that may be exploited in the molecular design process to optimize the affinity of a virtual ligand to a target.</p> <p>The docking of even a few 10-20 fragments to 3-4 binding site interaction centers e.g. 2 hydrogen bond donor/acceptors and a hydrophobic region leads to a large number of scored interactions for the fragments. Add to this a set of 5-10 different tethering components to tie those fragments together rapidly leads to a combinatorial explosion. For such an approach to be useful one must do triage on the compounds generated from the de novo design procedure.</p> <p>SPROUT has been a pioneering tool in the field of de novo design and places emphasis on the pragmatic incorporation of tools to do `triage’ as one designs in the binding pocket of the target of interest. While one can do initial triage of the compounds one assembles from docked fragments based on predicted binding interaction scores (affinity), this is often not the correct approach. The reason, of course, is that typically the small fragments or small compounds tethered from just 1-2 fragments may only have 10-100 uM affinity. With addition of more fragments to the initially designed de novo leads, the affinity may improve substantially. For this reason it is wise to `prune’ first based on synthetic accessibility of the denovo leads.</p> <p><img align="left" alt="COMPOUNDTRIAGE" title="COMPOUNDTRIAGE" src="http://simbiosys.ca/blog/images/NOVEMBER_FIGS.jpg" /><br /> Figure 1 below shows 2 hydrogen bond acceptor sites and a hydrophobic region recognized by the native ligand in a b-secretase crystal structure 2OHP. Selection of several polar fragments and tethering elements in a 4-hour SPROUT run resulted in 93843 compounds (Figure 2) favorably interacting with those 3-sites recognized by Compound 3 in the manuscript reporting this crystal structure (J.Med.Chem. 50: 1124, 2007). SPROUT enables one to rapidly derive a total interaction score for each of these compounds but what we really want to do in the design process is to examine the ease with which one can synthesize the compounds designed. De novo design of compounds that are not synthetically accessible is a meaningless enterprise.</p> <p>There are two approaches that are tractable for such predictions from SimBioSys and KeyModule. The first is a tool embodied in SPROUT and in a new product TOPOMAX that assesses synthetic `complexity’ (Boda and Johnson J. Med. Chem., 2006, 49:5869-5879). The synthetic complexity approach is based on the compilation of a large number of compounds synthesized over the years and a detailed analysis of there substitution pattern in rings and chains. The concept is really an informatic principle that given sufficient sampling of a large synthetic compound space that the observed frequency of occurrence of particular structural substitution and topological patterns should infer the synthetic accessibility of a de novo compound. Boda and Johnson showed in their 2006 manuscript the manner in which compilation and use of a large synthetic complexity database allowed them to rapidly score compounds and that, in fact, that the synthetic complexity score had a good correlation with computations of synthetic accessibility (Figure 11 J. Med. Chem., 2006, 49:5869-5879) using medchem encoded PATCHEM rules in CAESA.</p> <p>How and why is that useful? Figure 3 shows the synthetic complexity scores computed within SPROUT for 93843 compounds! The entire computation required just 35-minutes of CPU time on a single processor. SPROUT allows one to do this in the course of a de novo run. Figure 4 shows a small 262 compound sample of the present beta-secretase lead-hop run where we plot the CAESA batch analysis of the compounds synthetic accessibility and the SPROUT synthetic complexity. The correlation is not perfect but it is clear that there is, generally, speaking a monotonic trend.</p> <p>This spectrum of tools allows one to pare down the number of virtual compounds using a combination of synthetic complexity and interaction scores to hundreds and thousands of virtual ligands employing SPROUT (and or TOPOMAX) with confidence that the complexity triage is indeed linked to the `ease of synthesis’. One then can then obtain a synthetic accessibility score and complete retrosynthetic pathways to known starting materials for a subset of compounds in a matter of minutes using CAESA batch.</p> <p>This example problem highlights the importance of using the correct tool to the task. De Novo design commonly encompasses combinatorial explosion if the user does not perform triage on candidates based on both synthetic complexity(accessibility) and predicted binding. SimBioSys has a range of tools to gauge synthetic accessibility: CAESA/CAESA-batch, and docking tools with good Score-log(Kd) predicticity: eHiTS and eHiTS Lightning. The goal of SimBioSys tools is always to achieve speed without compromising accuracy. Look for a new technical note on this topic under Science:White papers on our web site next week!</p> <p>Posted by Dan Harris </p> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/science/" title="View all posts in Science" rel="category tag">Science</a>, <a href="http://www.simbiosys.com/blog/category/science/tips-and-hints/" title="View all posts in tips and hints" rel="category tag">tips and hints</a> | <a href="http://www.simbiosys.com/blog/2008/11/10/synthetic-complexity-and-accessibility-choosing-the-correct-tool-to-the-task-in-de-novo-triage/#respond" title="Comment on SYNTHETIC COMPLEXITY AND ACCESSIBILITY: CHOOSING THE CORRECT TOOL TO THE TASK IN DE NOVO TRIAGE.">No Comments »</a></p> </div> <div class="post"> <h3 id="post-47"><a href="http://www.simbiosys.com/blog/2008/09/29/use-of-ehits-crossdocking-approaches-to-survey-the-ligandtarget-landscape/" rel="bookmark" title="Permanent Link to Use of eHiTS CrossDocking Approaches to Survey the Ligand/Target Landscape">Use of eHiTS CrossDocking Approaches to Survey the Ligand/Target Landscape</a></h3> <small>Monday, September 29th, 2008</small> <div class="entry"> <p><meta content="text/html; charset=utf-8" http-equiv="CONTENT-TYPE" /><title /><meta content="OpenOffice.org 2.0 (Linux)" name="GENERATOR" /><meta content="20080929;4451400" name="CREATED" /><meta content="20080929;10390700" name="CHANGED" /> <style type="text/css"> <!-- @page { size: 8.5in 11in; margin: 0.79in } P { margin-bottom: 0.08in } --> </style></p> <p align="justify" style="margin-bottom: 0in">Virtual high throughput screening requires the use of approximate approaches to distinguish the complimentarity of ligands to target. Often this falls to the domain of 2D approaches such as fingerprint, topological, and shape screens, given the computational efficiency of such approaches. High throughput docking, employing the 3D structures of ligand and targets, however, provides a facile first approach to glean principles for ligand recognition of targets. The endpoint might be screening a database against a single target to determine which ligands are plausible lead compounds to be explored with other computational approaches (e.g. free energy computation and estimation of binding free energies) or recommendation for synthesis or testing <em>in vitro</em> high throughput assays. Alternatively, one might wish to know how such compounds ’score’ against an array of targets that one <u>doesn’t</u> want to “hit” with a novel lead candidate. Such an approach may be important in eliminating unwanted side-effects/toxicity</p> <p align="justify" style="margin-bottom: 0in"> <p align="justify" style="margin-bottom: 0in">Before using eHiTS for such a screening enterprise, it is useful to perform a benchmark crossdocking screen using a set of crystal structures of ligand-receptor complexes. Each of the ‘native’ ligands extracted from each of the co-complexes is then docked against the panel of other targets. The matrix below shows an example of such a hypothetical screen to examine how each of the native ligands of 13 targets score against targets in the same or distinct pharmacological families. Each cell entry corresponds to the eHiTS docking score of the ligand extracted from the top row complex against the target extracted from the left column complex. The diagonal elements (bold-font) show the best docking score for (self docking of) each ligand against its own crystal structure target. The blue font entries in each column illustrate that while quite (most) often the ‘native ligand’ for a target crystal structure is amongst the best scoring (within 1-2 eHiTS scoring units) it is not always the best score. Quite often ‘native’ ligands for other targets will have significant affinity for your target of interest (and will plausibly have good eHiTS docking scores) and this is both the challenge of drug design in attempts to control specificity and selectivity for a panorama of target proteins/receptors.</p> <p align="justify" style="margin-bottom: 0in"><img align="middle" alt="Table 1" title="Table 1" src="http://simbiosys.ca/blog/images/Table.jpg" /></p> <p align="justify" style="margin-bottom: 0in">One is looking for several facets in such a preliminary screen. One expects that each of the native ligands found in a co-complex crystal structure of course does dock to the native target. One expects, in most instances, that the docking score for each native ligand for its ‘natural’ target will be near the minimum score compared to other ligands (typically within a few scoring units). The figures below shows in what portion of the cases the native ligand is among the best N scoring ligands. It is of course possible that ‘native’ ligands derived from other target complexes will score well against your target of interest, but that kind of ‘crosstalk’ is precisely the type of information that one wants to glean from such a low cost computer ‘experiment.’ One also wants to know how use of different crystal structures with alternative conformations of binding site residues affects the docking scores. While a docking score is not a binding free energy, eHiTS docking scores often give at least monotonic score correlations with lnKd’s in instances where the scoring function calibration has included interaction types of the nature of your target of interest.</p> <p align="justify" style="margin-bottom: 0in"><img align="middle" src="http://simbiosys.ca/blog/images/PIECHART.jpg" /></p> <p align="justify" style="margin-bottom: 0in"> <p align="justify" style="margin-bottom: 0in"> <p align="justify" style="margin-bottom: 0in"> <p align="justify" style="margin-bottom: 0in"><meta content="text/html; charset=utf-8" http-equiv="CONTENT-TYPE" /><title /><meta content="OpenOffice.org 2.0 (Linux)" name="GENERATOR" /><meta content="20080929;4451400" name="CREATED" /><meta content="20080929;10390700" name="CHANGED" /> <style type="text/css"> <!-- @page { size: 8.5in 11in; margin: 0.79in } P { margin-bottom: 0.08in } --> </style></p> <p align="justify" style="margin-bottom: 0in"> <p align="justify" style="margin-bottom: 0in"> <p align="justify" style="margin-bottom: 0in"> <p align="justify" style="margin-bottom: 0in"><img align="middle" src="http://simbiosys.ca/blog/images/CUMULATIVE.jpg" /></p> <p align="justify" style="margin-bottom: 0in"> <p align="justify" style="margin-bottom: 0in">What about the timescale for such screens? For eHiTS 6.2 on Intel platforms versus the new Lighting release? In a recent benchmark I found it to be ~3.5 minutes/ligand (using 4-Xeon 3.2GHz nodes) versus an average of 30 seconds per ligand on Cell processor in a Sony Playstation 3 (PS3). Note that this corresponds to docking at the highest accuracy. For screening of large libraries of molecules, a lower level of accuracy is recommended with a typical docking time of less than 10 seconds per ligand on the PS3.</p> <p align="justify" style="margin-bottom: 0in"> <p align="justify" style="margin-bottom: 0in">While the timescale of such 3D approaches has not been competitive with 2D methods, the advances in porting eHiTS to the Cell will make this approach feasible for high throughput analysis with quantitative estimation of both docking-score/IC50 correlations and accurate pose prediction.</p> <p align="justify" style="margin-bottom: 0in"> <p style="margin-bottom: 0in">Blog post by Dan Harris</p> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/science/" title="View all posts in Science" rel="category tag">Science</a>, <a href="http://www.simbiosys.com/blog/category/science/tips-and-hints/" title="View all posts in tips and hints" rel="category tag">tips and hints</a> | <a href="http://www.simbiosys.com/blog/2008/09/29/use-of-ehits-crossdocking-approaches-to-survey-the-ligandtarget-landscape/#respond" title="Comment on Use of eHiTS CrossDocking Approaches to Survey the Ligand/Target Landscape">No Comments »</a></p> </div> <div class="post"> <h3 id="post-41"><a href="http://www.simbiosys.com/blog/2008/06/19/ehits-and-score-low-rmsdlnkdic50-correlations/" rel="bookmark" title="Permanent Link to eHiTS and Score - Low RMSD/lnKd(IC50) Correlations.">eHiTS and Score - Low RMSD/lnKd(IC50) Correlations.</a></h3> <small>Thursday, June 19th, 2008</small> <div class="entry"> <p align="left">Most flexible ligand/rigid protein and flexible ligand/flexible protein docking approaches do predict poses reproducing known structural solutions amongst their top ranking scored poses. But when assessing diverse top scoring poses for ligands in a screening exercise I find it time consuming, even with employing some auxiliary pharmacophoric information, deciding which poses amongst my top scoring poses are the most pharmacologically relevant.</p> <p>Let me simplify that statement! What I would really like is to have confidence that if I took my `top 5-10 poses’ that there would be a much higher likelihood of finding the biologically relevant pose in that group than the `next ten’. Moreover, we all would like our pose scores to bear some resemblance to IC50/Kd rankings of our screened ligands, be they agonists, inverse agonists, or inhibitors at the pharmacological endpoint!</p> <p>I am a guy that early on naively believed that docking solutions relying on physics based approaches, for example carefully developed charge sets (1-6-12 potentials) could get me there. After all, I have done numerous MD simulations and employed MMPBSA approaches to estimate binding free energies to good effect(Proteins 55:895-914). But this worked well, if I had the right ligand pose(!) and if either the enthalpic components dominated or my crude estimate of entropic terms was adequate. The bottom line is that most docking approaches while doing OK on pose prediction do not, to date, give you good Score-Based Good/Bad RMS separation or give you much confidence in using docking scores to `rank’ prospective ligands for synthesis(J. Med. Chem, 49:5912-5931).</p> <p>eHiTS is an informatics based approach (J.Mol. Graphics Mod. 26: 198-212) and what I have learned is that it is powerful in providing me with two major items on `my Christmas wish list’:<br /> 1.Good Score-RMS correlation (good scores have low RMSD), and<br /> 2.Good correlation with ln(IC50)/ln(Kd)!</p> <p>What I have learned this month is that if you `train a customized scoring function’ for my pharmacological protein target by using ~5 co-crystallized complexes I can achieve both endpoints on my Christmas wish list. That is powerful.</p> <p>How about an example of this?<br /> Nicotinic Acetylcholine receptors (nAChR’s) are an important class of proteins amongst a superfamily of ligand `gated’ (allosterically modulated) ion-channels. One of the surrogate proteins having binding motifs and pharmacology analogous to nAChRs is the acetylcholine binding protein (AchBP). We have begun investigations on this class of proteins given it is a challenging problem of ligand recognition via conserved aromatic motifs in the binding pocket via the cation interactions with a `box’ of Trp/Tyr residues. The upper left hand panel of the figure below shows three classic cationic ligands, acetylcholine, nicotine, and carbamylcholine(CCE). All of these ligands bind with considerable affinity to the binding pocket containing Trp/Tyr aromatic residues with the cationic center interacting with Trp/Tyr residues. Considerable experimental and computational evidence suggests that the pi-cation interactions are a substantial contribution to the binding free energy of these ligands to AChBP. The right uppermost panel shows this interaction motif for CCE in the crystal structure 1UV6. The lower left panel shows eHiTS superimposed docked poses of nicotine(NIC) and carbamylcholine (CCE) interacting with the surrounding pi electron system. While these poses have low scores for the default `out-of the box scoring function’, the informatics based scoring function did not involve `training’ including this class of proteins, and the top plot (labeled `untrained’) shows that there is no correlation of `score-regime’ with low RMSD poses (relative to the crystallographic pose). “What happens if you train a scoring function specific to this class of proteins as regards score separation of low (good) RMSD and high RMSD poses?” The lower plot in the right lower panel tells the story. One obtains: Score Based Separation (Correlation) of your low RMSD from your high (RMSD) poses. Which scoring function would you want to use for virtual screening of new leads for this class of protein? I think you know the answer to that one. You would want to use the one where you had confidence that your top scoring poses were the ones with probable low RMSD to the actual pharmacological pose.<br /> Come back for the 2nd part of the story eHiTS Score-lnKd based correlations in my next blog.</p> <p>Posted by DLH</p> <div style="text-align: center"><img width="385" height="427" title="EHITS_SCORE_RMSD_SEPARATION" alt="EHITS_SCORE_RMSD_SEPARATION" src="http://simbiosys.ca/blog/images/4PANEL_FIGURE.jpg" /></div> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/science/" title="View all posts in Science" rel="category tag">Science</a>, <a href="http://www.simbiosys.com/blog/category/science/tips-and-hints/" title="View all posts in tips and hints" rel="category tag">tips and hints</a> | <a href="http://www.simbiosys.com/blog/2008/06/19/ehits-and-score-low-rmsdlnkdic50-correlations/#respond" title="Comment on eHiTS and Score - Low RMSD/lnKd(IC50) Correlations.">No Comments »</a></p> </div> <div class="navigation"> <div class="alignleft"></div> <div class="alignright"></div> </div> </div> <div id="sidebar"> <ul> <li> <a href="http://www.simbiosys.com"><img src="images/SimBioSysLogo_name_long.gif" width="200" ></a> </li> <li> <form method="get" id="searchform" action="http://www.simbiosys.com/blog/"> <div><input type="text" value="" name="s" id="s" /> <input type="submit" id="searchsubmit" value="Search" /> </div> </form> </li> <li> <form style="border:1px solid #ccc;padding:3px;text-align:center;" action="http://www.feedburner.com/fb/a/emailverify" method="post" target="popupwindow" onsubmit="window.open('http://www.feedburner.com/fb/a/emailverifySubmit?feedId=1550225', 'popupwindow', 'scrollbars=yes,width=550,height=520');return true"><p>Enter your email address:</p><p><input type="text" style="width:140px" name="email"/></p><input type="hidden" value="http://feeds.feedburner.com/~e?ffid=1550225" name="url"/><input type="hidden" value="SimBioSys Blog" name="title"/><input type="hidden" name="loc" value="en_US"/><input type="submit" value="Subscribe" /><p>Delivered by <a href="http://www.feedburner.com" target="_blank">FeedBurner</a></p></form> </li> <!-- Author information is disabled per default. Uncomment and fill in your details if you want to use it. <li><h2>Author</h2> <p>A little something about you, the author. Nothing lengthy, just an overview.</p> </li> --> <li> <p>You are currently browsing the archives for the tips and hints category.</p> </li> <li class="pagenav"><h2>Pages</h2><ul><li class="page_item"><a href="http://www.simbiosys.com/blog/about/" title="About SimBioSys">About SimBioSys</a></li> </ul></li> <li><h2>Archives</h2> <ul> <li><a href='http://www.simbiosys.com/blog/2010/09/' title='September 2010'>September 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2010/08/' title='August 2010'>August 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2010/07/' title='July 2010'>July 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2010/06/' title='June 2010'>June 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2010/03/' title='March 2010'>March 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2010/01/' title='January 2010'>January 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2009/12/' title='December 2009'>December 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/11/' title='November 2009'>November 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/10/' title='October 2009'>October 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/07/' title='July 2009'>July 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/06/' title='June 2009'>June 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/05/' title='May 2009'>May 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/03/' title='March 2009'>March 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/02/' title='February 2009'>February 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/01/' title='January 2009'>January 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2008/12/' title='December 2008'>December 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/11/' title='November 2008'>November 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/10/' title='October 2008'>October 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/09/' title='September 2008'>September 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/08/' title='August 2008'>August 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/07/' title='July 2008'>July 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/06/' title='June 2008'>June 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/05/' title='May 2008'>May 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/04/' title='April 2008'>April 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/03/' title='March 2008'>March 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/02/' title='February 2008'>February 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/01/' title='January 2008'>January 2008</a></li> </ul> </li> <li><h2>Categories</h2> <ul> <li><a href="http://www.simbiosys.com/blog/category/general-discussion/" title="View all posts filed under General Discussion">General Discussion</a> (21) </li> <li><a href="http://www.simbiosys.com/blog/category/news/" title="View all posts filed under News">News</a> (54) </li> <li><a href="http://www.simbiosys.com/blog/category/science/" title="View all posts filed under Science">Science</a> (35) </li> <li><a href="http://www.simbiosys.com/blog/category/software-products/" title="View all posts filed under Software products">Software products</a> (33) </li> <li><a href="http://www.simbiosys.com/blog/category/technology/" title="View all posts filed under Technology">Technology</a> (21) </li> <li class="current-cat"><a href="http://www.simbiosys.com/blog/category/science/tips-and-hints/" title="Tells us tips and hints on how to best use eHiTS, and other SimBioSys tools, to maximize your ROI (return on invested time, money, energy etc.). It’s a must-read!">tips and hints</a> (5) </li> <li><a href="http://www.simbiosys.com/blog/category/uncategorized/" title="View all posts filed under Uncategorized">Uncategorized</a> (1) </li> </ul> </li> </ul> </div> <hr /> <div id="footer"> <!-- If you'd like to support WordPress, having the "powered by" link someone on your blog is the best way, it's our only promotion or advertising. --> <p> SimBioSys Blog is proudly powered by <a href="http://wordpress.org/">WordPress</a> <br /><a href="feed:http://www.simbiosys.com/blog/feed/">Entries (RSS)</a> and <a href="feed:http://www.simbiosys.com/blog/comments/feed/">Comments (RSS)</a>. <!-- 18 queries. 0.525 seconds. --> </p> </div> </div> <!-- Gorgeous design by Michael Heilemann - http://binarybonsai.com/kubrick/ --> </body> </html>