<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.0.11" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: Crystal structure errors &#8212; in CSD too</title>
	<link>http://www.simbiosys.com/blog/2008/06/13/crystal-structure-errors-in-csd-too/</link>
	<description>Addressing the challenges of computational drug discovery</description>
	<pubDate>Thu, 09 Feb 2012 21:51:18 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.11</generator>

	<item>
		<title>by: ChemSpider Blog &#187; Blog Archive &#187; When a Scientific Blog Posting, Data Licensing and Open Data Access Come Together</title>
		<link>http://www.simbiosys.com/blog/2008/06/13/crystal-structure-errors-in-csd-too/#comment-1493</link>
		<pubDate>Thu, 19 Jun 2008 12:37:05 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/06/13/crystal-structure-errors-in-csd-too/#comment-1493</guid>
					<description>[...] The past couple of days has seen an interesting exchange going on over on the SimBioSys blog. Zsolt Zsoldos is someone I respect, not only for his passion for his science but also for his want to educate others in the challenges of what he does in developing software. I believe his blog post entitled &#8220;Crystal Structure Errors in CSD too&#8221; was an honest attempt to tell people to be &#8220;careful&#8221; when using data from databases. I don&#8217;t care whether the database is ChemSpider, PubChem, the CAS Registry or any of the other databases available via free access of commercial transaction, they ALL have errors. It is inevitable. Zsolt&#8217;s attempt to highlight that such errors exist was done, I believe, with pedagogical intent. [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] The past couple of days has seen an interesting exchange going on over on the SimBioSys blog. Zsolt Zsoldos is someone I respect, not only for his passion for his science but also for his want to educate others in the challenges of what he does in developing software. I believe his blog post entitled &#8220;Crystal Structure Errors in CSD too&#8221; was an honest attempt to tell people to be &#8220;careful&#8221; when using data from databases. I don&#8217;t care whether the database is ChemSpider, PubChem, the CAS Registry or any of the other databases available via free access of commercial transaction, they ALL have errors. It is inevitable. Zsolt&#8217;s attempt to highlight that such errors exist was done, I believe, with pedagogical intent. [&#8230;]
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: J</title>
		<link>http://www.simbiosys.com/blog/2008/06/13/crystal-structure-errors-in-csd-too/#comment-1491</link>
		<pubDate>Mon, 16 Jun 2008 13:39:18 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/06/13/crystal-structure-errors-in-csd-too/#comment-1491</guid>
					<description>These comments are interesting - not because they reveal anything that a small molecule crystallographer doesn't know: More because they reveal that modellers have expectations of information that are a bit naive.

I'm a long time user of the CSD and, as a small molecule crystallographer, I understand the caveats behind crystallographic data. There are errors in the published crystal structures, and not all of them will get spotted during peer review or data curation. CSD users would be well advised to try to understand them and factor them in to their work. I particularly turn your attention to Points 3 and 4 below ...


Point 1.
 
H-positions are sometimes hard to resolve in small molecule studies, and need to be treated warily in crystal structures. Ok - the entry QUICNA01 is a neutron study, so one would expect them to be better, but disordering is an issue.

One should always look at both the 2D and 3D structural information when working with crystal structures. If you look at the 2D representation in the CSD for QUICNA01 it is correct.


Point 2.

Undiagnosed disorder/symmetry can lead to problems: There are structural studies in the CSD where the crystallographer has missed a disorder, or missed some symmetry. AACRUB is an example of missed symmetry - and when you look at the study you see rather dubious bond lengths and angles, due to correlations in the refinement co-variance matrix.

Quite often, when this sort of thing happens, a later study will then correct the error: see AACRUB01 in this case.

Missed dis-orderings and symmetry are hard to spot, note: This is by far the most likely thing to trip up a modeller who 'just wants the coordinates'.


Point 3. 

Newer structural studies are more likely to be more reliable than older studies due to enormous improvements in equipment and software to undertake the studies. I think, in the case of QUICNA01 this is very pertinent. The structure was published in 1974 .... Ok - if this is the only structure then you may have to use it but ....


Point 4.

If there are several similar studies of a structure, they end up in a CSD refcode family. In the case of QUICNA01 you also have some later studies - namely QUICNA02 QUICNA03 QUICNA10 QUICNA11 QUICNA12 QUICNA13  

QUICNA10, QUICNA11, QUICNA12 and QUICNA13  are all later studies of the structure, and they *all* have the proton to which you refer, since they are 'deuterated' compounds which will resolve better in neutron studies.

Now - you might quite reasonably say 'but how do I know which one to pick?' - There is this study

http://www.ccdc.cam.ac.uk/free_services/best_representative/

Though admittedly for QUICNA note that the choice is inconclusive based on the 4 lists given: I think the hydrogen list may not account for deuteration.</description>
		<content:encoded><![CDATA[<p>These comments are interesting - not because they reveal anything that a small molecule crystallographer doesn&#8217;t know: More because they reveal that modellers have expectations of information that are a bit naive.</p>
<p>I&#8217;m a long time user of the CSD and, as a small molecule crystallographer, I understand the caveats behind crystallographic data. There are errors in the published crystal structures, and not all of them will get spotted during peer review or data curation. CSD users would be well advised to try to understand them and factor them in to their work. I particularly turn your attention to Points 3 and 4 below &#8230;</p>
<p>Point 1.</p>
<p>H-positions are sometimes hard to resolve in small molecule studies, and need to be treated warily in crystal structures. Ok - the entry QUICNA01 is a neutron study, so one would expect them to be better, but disordering is an issue.</p>
<p>One should always look at both the 2D and 3D structural information when working with crystal structures. If you look at the 2D representation in the CSD for QUICNA01 it is correct.</p>
<p>Point 2.</p>
<p>Undiagnosed disorder/symmetry can lead to problems: There are structural studies in the CSD where the crystallographer has missed a disorder, or missed some symmetry. AACRUB is an example of missed symmetry - and when you look at the study you see rather dubious bond lengths and angles, due to correlations in the refinement co-variance matrix.</p>
<p>Quite often, when this sort of thing happens, a later study will then correct the error: see AACRUB01 in this case.</p>
<p>Missed dis-orderings and symmetry are hard to spot, note: This is by far the most likely thing to trip up a modeller who &#8216;just wants the coordinates&#8217;.</p>
<p>Point 3. </p>
<p>Newer structural studies are more likely to be more reliable than older studies due to enormous improvements in equipment and software to undertake the studies. I think, in the case of QUICNA01 this is very pertinent. The structure was published in 1974 &#8230;. Ok - if this is the only structure then you may have to use it but &#8230;.</p>
<p>Point 4.</p>
<p>If there are several similar studies of a structure, they end up in a CSD refcode family. In the case of QUICNA01 you also have some later studies - namely QUICNA02 QUICNA03 QUICNA10 QUICNA11 QUICNA12 QUICNA13  </p>
<p>QUICNA10, QUICNA11, QUICNA12 and QUICNA13  are all later studies of the structure, and they *all* have the proton to which you refer, since they are &#8216;deuterated&#8217; compounds which will resolve better in neutron studies.</p>
<p>Now - you might quite reasonably say &#8216;but how do I know which one to pick?&#8217; - There is this study</p>
<p><a href="http://www.ccdc.cam.ac.uk/free_services/best_representative/" rel="nofollow">http://www.ccdc.cam.ac.uk/free_services/best_representative/</a></p>
<p>Though admittedly for QUICNA note that the choice is inconclusive based on the 4 lists given: I think the hydrogen list may not account for deuteration.
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Ashutosh</title>
		<link>http://www.simbiosys.com/blog/2008/06/13/crystal-structure-errors-in-csd-too/#comment-1490</link>
		<pubDate>Sat, 14 Jun 2008 06:12:15 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/06/13/crystal-structure-errors-in-csd-too/#comment-1490</guid>
					<description>Gerhard Kleywegt is a very entertaining and funny speaker. His recent talk at an OpenEye meeting was both hilarious and informative. It's good to be mindful of errors in crystal structures as you indicated. While you note correctly that x-ray structures should not be too far from the global minimum, since global minima are force-field dependent, it's hard to judge exactly how far they are from the solution global minimum. Plus, gas phase force-field global minima are often compromised and overstabilized by electrostatic interactions.</description>
		<content:encoded><![CDATA[<p>Gerhard Kleywegt is a very entertaining and funny speaker. His recent talk at an OpenEye meeting was both hilarious and informative. It&#8217;s good to be mindful of errors in crystal structures as you indicated. While you note correctly that x-ray structures should not be too far from the global minimum, since global minima are force-field dependent, it&#8217;s hard to judge exactly how far they are from the solution global minimum. Plus, gas phase force-field global minima are often compromised and overstabilized by electrostatic interactions.
</p>
]]></content:encoded>
				</item>
</channel>
</rss>

