<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.0.11" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: Correct protonation state for docking</title>
	<link>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/</link>
	<description>Addressing the challenges of computational drug discovery</description>
	<pubDate>Sat, 04 Feb 2012 18:45:03 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.11</generator>

	<item>
		<title>by: ChemSpider Blog &#187; Blog Archive &#187; Protonation States and Their Effect on Docking</title>
		<link>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/#comment-1477</link>
		<pubDate>Thu, 22 May 2008 06:54:10 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/#comment-1477</guid>
					<description>[...] Zsolt came searching ChemSpider for the amino acid structures and found the complexities in terms of charged/stereo forms. But his postinf regarding the &#8220;Correct Protonation State for Docking&#8221; was an education in itself. If you are engaged in docking experiments at all this is likely a must read. For the rest of us neophytes it&#8217;s education!  addthis_url = 'http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fprotonation-states-and-their-effect-on-docking.html'; addthis_title = 'Protonation+States+and+Their+Effect+on+Docking'; addthis_pub = ''; [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] Zsolt came searching ChemSpider for the amino acid structures and found the complexities in terms of charged/stereo forms. But his postinf regarding the &#8220;Correct Protonation State for Docking&#8221; was an education in itself. If you are engaged in docking experiments at all this is likely a must read. For the rest of us neophytes it&#8217;s education!  addthis_url = &#8216;http%3A%2F%2Fwww.chemspider.com%2Fblog%2Fprotonation-states-and-their-effect-on-docking.html&#8217;; addthis_title = &#8216;Protonation+States+and+Their+Effect+on+Docking&#8217;; addthis_pub = &#8216;&#8217;; [&#8230;]
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: zsolt</title>
		<link>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/#comment-1447</link>
		<pubDate>Tue, 20 May 2008 15:30:56 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/#comment-1447</guid>
					<description>Andy and Danni,
I have added an update to the post in response to your comments. Bottomline is, we can generate the right poses and the right protonation states in eHiTS already -- among other (incorrect) poses with their protonation states. What we need is an accurate QM-supported scoring and ranking of the solutions -- we are working on this, stay tuned...

ZZ</description>
		<content:encoded><![CDATA[<p>Andy and Danni,<br />
I have added an update to the post in response to your comments. Bottomline is, we can generate the right poses and the right protonation states in eHiTS already &#8212; among other (incorrect) poses with their protonation states. What we need is an accurate QM-supported scoring and ranking of the solutions &#8212; we are working on this, stay tuned&#8230;</p>
<p>ZZ
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Danni Harris</title>
		<link>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/#comment-1423</link>
		<pubDate>Mon, 19 May 2008 19:30:59 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/#comment-1423</guid>
					<description>I agree with all above and echo Andy's concerns about the problem complexity" 

In addition to the complexity of deducing the 'correct' complementary protonation states of ligand and polar/acidic amino acid sidechains in the binding site cavity one has in some instances:

1)variable conformations of acidic/polar residues in the substrate/inhibitor binding site (e.g. E297 in CYP2C5 crystal structures :  1dt6, 1n6b, 1nr6) and variable water occupancies proximate to those acid residues,

2) connectivity of the ligand and surrounding polar/acidic binding side residues to bulk solvent water via extended hydrogen bond networks (sometimes low-barrier hydrogen bond networks),

3) variable electrostatic influence of enzymatically active species--&#62; that differ with stage in the enzymatic cycle (e.g. P450s).

Bottom line: Even when you can frame it with QM/MM and include environment you have to frame the issue for the correct structure ---(amino acids + H2O's) and this may vary with both ligand pose and ligand identity. So doing what you are doing 'on the fly' ZZ is  a big step. The next step 'filtering' based on thermodynamic/energetic criteria will be a significant challenge.

That being said---let's keep struggling with the problem! :)</description>
		<content:encoded><![CDATA[<p>I agree with all above and echo Andy&#8217;s concerns about the problem complexity&#8221; </p>
<p>In addition to the complexity of deducing the &#8216;correct&#8217; complementary protonation states of ligand and polar/acidic amino acid sidechains in the binding site cavity one has in some instances:</p>
<p>1)variable conformations of acidic/polar residues in the substrate/inhibitor binding site (e.g. E297 in CYP2C5 crystal structures :  1dt6, 1n6b, 1nr6) and variable water occupancies proximate to those acid residues,</p>
<p>2) connectivity of the ligand and surrounding polar/acidic binding side residues to bulk solvent water via extended hydrogen bond networks (sometimes low-barrier hydrogen bond networks),</p>
<p>3) variable electrostatic influence of enzymatically active species&#8211;&gt; that differ with stage in the enzymatic cycle (e.g. P450s).</p>
<p>Bottom line: Even when you can frame it with QM/MM and include environment you have to frame the issue for the correct structure &#8212;(amino acids + H2O&#8217;s) and this may vary with both ligand pose and ligand identity. So doing what you are doing &#8216;on the fly&#8217; ZZ is  a big step. The next step &#8216;filtering&#8217; based on thermodynamic/energetic criteria will be a significant challenge.</p>
<p>That being said&#8212;let&#8217;s keep struggling with the problem! <img src='http://www.simbiosys.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Andy Good</title>
		<link>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/#comment-1422</link>
		<pubDate>Mon, 19 May 2008 16:15:07 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/#comment-1422</guid>
					<description>Agree completely. This is further exemplified by extending the disuccsion on serine proteases and the catalytic triad histidine. Not only are multiple neutral states possible, in the case of HCV protease the histidine is capable of protonating to form a salt bridge with an acidic ligand. This speaks further to your point regarding protein protonation states being influenced on the fly by different ligands.
Whether one can predict these states on the fly is open to debate. Perhaps it would be possible to probe each ambiguous residue with acidic / neutral / basic groups using QM or the like to determine ahead of time which states are energetically acessible before the on the fly screen process begins. It seems unlikely that a purely empircal approach without such an initial callibration would be very accurarate, though I stand ready to be corrected :-)</description>
		<content:encoded><![CDATA[<p>Agree completely. This is further exemplified by extending the disuccsion on serine proteases and the catalytic triad histidine. Not only are multiple neutral states possible, in the case of HCV protease the histidine is capable of protonating to form a salt bridge with an acidic ligand. This speaks further to your point regarding protein protonation states being influenced on the fly by different ligands.<br />
Whether one can predict these states on the fly is open to debate. Perhaps it would be possible to probe each ambiguous residue with acidic / neutral / basic groups using QM or the like to determine ahead of time which states are energetically acessible before the on the fly screen process begins. It seems unlikely that a purely empircal approach without such an initial callibration would be very accurarate, though I stand ready to be corrected <img src='http://www.simbiosys.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: zsolt</title>
		<link>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/#comment-1383</link>
		<pubDate>Sat, 17 May 2008 16:38:14 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/#comment-1383</guid>
					<description>Egon, yes, you used the right phrase "educated guesses", even QM methods will do just that if applied in isolation to the ligand and receptor. My point is, there are tough cases, like HIV-1 and kinases that tend to switch protonation states (both ligand and receptor) upon binding, so for a given ligand what is correct for binding to one receptor is incorrect for binding to another. That is why preparation methods (no matter how sophisticated) are not good enough. You need to analyze each binding pose AFTER docking and make the decision then. But if the docking engine does not work with ambivalent states then it will never place the ligand in a colliding (proton-to-proton or lone-pair to lone-pair) pose so you never find the correct pose (unless you run multiple docking with all possibilities enumerated - very expensive).

As for standard benchmark to test on, use the DUD dataset: http://dud.docking.org
Of course, that will still not tell you the right protonation state for each pose, but at least you know which ligands are supposed to bind and which ones (although similar) are not.
It would be very good to have a test set of PDB complex structures with correctly assigned protonations for some HIV-1 and kinase series. I do not know about the existence of such (PDB-bind has structures and binding affinities but the protonation states were not treated well).
One set I know that has been at least eye-balled is the Astex diverse set:
http://www.ccdc.cam.ac.uk/products/life_sciences/gold/validation/astex_diverse/
Although it is not focused on the protonation problem.
ZZ.</description>
		<content:encoded><![CDATA[<p>Egon, yes, you used the right phrase &#8220;educated guesses&#8221;, even QM methods will do just that if applied in isolation to the ligand and receptor. My point is, there are tough cases, like HIV-1 and kinases that tend to switch protonation states (both ligand and receptor) upon binding, so for a given ligand what is correct for binding to one receptor is incorrect for binding to another. That is why preparation methods (no matter how sophisticated) are not good enough. You need to analyze each binding pose AFTER docking and make the decision then. But if the docking engine does not work with ambivalent states then it will never place the ligand in a colliding (proton-to-proton or lone-pair to lone-pair) pose so you never find the correct pose (unless you run multiple docking with all possibilities enumerated - very expensive).</p>
<p>As for standard benchmark to test on, use the DUD dataset: <a href="http://dud.docking.org" rel="nofollow">http://dud.docking.org</a><br />
Of course, that will still not tell you the right protonation state for each pose, but at least you know which ligands are supposed to bind and which ones (although similar) are not.<br />
It would be very good to have a test set of PDB complex structures with correctly assigned protonations for some HIV-1 and kinase series. I do not know about the existence of such (PDB-bind has structures and binding affinities but the protonation states were not treated well).<br />
One set I know that has been at least eye-balled is the Astex diverse set:<br />
<a href="http://www.ccdc.cam.ac.uk/products/life_sciences/gold/validation/astex_diverse/" rel="nofollow">http://www.ccdc.cam.ac.uk/products/life_sciences/gold/validation/astex_diverse/</a><br />
Although it is not focused on the protonation problem.<br />
ZZ.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Egon Willighagen</title>
		<link>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/#comment-1377</link>
		<pubDate>Sat, 17 May 2008 13:15:29 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/05/17/correct-protonation-state-for-docking/#comment-1377</guid>
					<description>One can make educated guesses of what the protonation state is, one can also apply some QM-based approach to calculate pKa's... Clearly, it is important... I don't think the CDK currently has an algorithm to do this... 

Is there a gold standard; a good training set to set approaches on? As you already said, crystallography is not going to help. Databases with experimental pKa database are often proprietary... Any suggestions?</description>
		<content:encoded><![CDATA[<p>One can make educated guesses of what the protonation state is, one can also apply some QM-based approach to calculate pKa&#8217;s&#8230; Clearly, it is important&#8230; I don&#8217;t think the CDK currently has an algorithm to do this&#8230; </p>
<p>Is there a gold standard; a good training set to set approaches on? As you already said, crystallography is not going to help. Databases with experimental pKa database are often proprietary&#8230; Any suggestions?
</p>
]]></content:encoded>
				</item>
</channel>
</rss>

