Archive for the 'General Discussion' Category

SimBioSys’ Announces Distribution Network in Japan

Tuesday, November 4th, 2008

Over the past few months we have seen an increasing demand for our software products in various parts of the world. The reputation of our eHiTS docking software in particular has gathered a lot of attention in the past couple of years and, with our recent developments in terms of eHiTS on a Cell processor (eHiTS Lightning) we have decided to furnish sales and support directly through local distributors. We now distribute across the majority of mainland Europe through our distributor ChemCad and we now have distributors in Japan.

We are happy to announce that Cybernet Systems Co., Ltd. (http://www.cybernet.co.jp/lifescience/) will distribute our full portfolio of software solutions to life science research scientists throughout Japan. In addition Argo Graphics Inc. (http://www.argo-graph.co.jp/) will distribute eHiTS Lightning on IBM’s Cell platform. Cybernet is ideally placed within the Japanese marketplace to take our products to their existing Life Science customers. Argo Graphics Inc. is ideally placed to take our eHiTS Lightning as a combined hardware and software solution to their Life Science IT customers. We look forward to a long and successful relationship with both Cybernet and Argo Graphics.

SimBioSys will be at the eCheminfo meeting Oct 14-17 in Bryn Mawr, PA, USA

Sunday, October 5th, 2008

We will be attending the eCheminfo Autumn Community of Practice meeting Oct 14-17, 2008, Bryn Mawr, PA, USA.

Zsolt Zsoldos, Chief Scientific/Technical Officer of SimBioSys will be presenting a talk on “The eHiTS scoring function”, while

Orr Ravitz, Applications Scientist, will be presenting a poster on “Utilizing training techniques in eHiTS improve score-RMSD and score-IC50 correlations in in silico high throughput screening”.

Looking at the events and the abstracts, this should be a very interesting meeting. Full list of the “Docking & Scoring” event is posted here:

http://barryhardy.blogs.com/cheminfostream/2008/09/docking-and-sco.html

If you are going to the event and would like to set aside time to discuss or demo our products please contact us at info[at]simbiosys.ca

posted by Aniko

Orr Ravitz joined SimBioSys

Tuesday, September 2nd, 2008

<meta name="GENERATOR" content="OpenOffice.org 2.4 (Unix)" /> <style type="text/css"> <!-- @page { size: 8.5in 11in; margin: 0.79in } P { margin-bottom: 0.08in } --> </style></p> <p style="margin-bottom: 0in"><font size="2" style="font-size: 11pt">Hi,</font></p> <p style="margin-bottom: 0in"><font size="2" style="font-size: 11pt">Allow me to Introduce myself. My name is Orr Ravitz, and I recently joined SimBioSys as an applications scientist. I was fortunate to join the company in a time of significant developments: in the next few months we will launch improved versions of our drug discovery tools, eHiTS, eHiTS Lightning, and LASSO, we are in the process of introducing to the market our unique synthetic route designer, ARChem, and we also have several innovative scientific concepts in the pipeline. The fact that this is happening while the pharmaceutical industry is undergoing dramatic restructuring, offers SimBioSys an opportunity to make a huge impact on the process of drug discovery and drug development.</font></p> <p style="margin-bottom: 0in"><font size="2" style="font-size: 11pt"><br /> My professional path has been, in a sense, a small-scale analogy to the evolution of computational chemistry as many of us envision it. I started in basic research, working with pen and paper on mathematical problems in quantum dynamics. I then switched to applying computational quantum methods to study spectral tuning mechanisms in rhodopsin, and rhodopsin isoforms. And in my current incarnation, I am participating in the grand endeavour of developing and delivering tools for computer aided drug design. This gradual transition from working on scientific problems from a pure academic perspective, to working on scientific products with very concrete applications is both intriguing and challenging.</font></p> <p style="margin-bottom: 0in"><font size="2" style="font-size: 11pt">It is no secret that we are in trying times for the pharmaceutical industry. In spite of tremendous amounts of public and private money spent on R&D, we see very few new drugs entering the market each year. It may be argued that this is partly because of the business models of the pharmaceutical companies, but, to a large extent, this is a genuine manifestation for our (much) less-than-perfect ability to translate our scientific knowledge, understanding and capabilities to practical solutions to acute problems. This is where SimBioSys plays a crucial role.</font></p> <p style="margin-bottom: 0in"><font size="2" style="font-size: 11pt">SimBioSys is providing cutting edge solutions (see picture below)<img alt="Orr provides cutting edge solutions - as a fire fighter" title="Orr provides cutting edge solutions - as a fire fighter" src="http://www.simbiosys.ca/images/orr_blog_photo.jpg" /> </font></p> <p style="margin-bottom: 0in"><font size="2" style="font-size: 11pt">to reduce the costs and enhance the capabilities of drug discovery and drug development. I strongly believe that computational tools are going to become an increasingly dominant means to improve the scientific and financial performance of pharmaceutical companies. SimBioSys, with a team that offers a well balanced combination of computer scientists and chemists, is in a position to become one of the leaders in the area. I hope to be able to add my own perspective to this group of talented people, and I further hope to maintain an open dialogue with our clients, and with people of the chemistry community and learn how SimBioSys can better serve the noble cause of finding cures for diseases and improving the well being of people.</font></p> <p style="margin-bottom: 0in">posted by Orr </p> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/general-discussion/" title="View all posts in General Discussion" rel="category tag">General Discussion</a>, <a href="http://www.simbiosys.com/blog/category/news/" title="View all posts in News" rel="category tag">News</a> | <a href="http://www.simbiosys.com/blog/2008/09/02/orr-ravitz-joined-simbiosys/#respond" title="Comment on Orr Ravitz joined SimBioSys">No Comments »</a></p> </div> <div class="post"> <h3 id="post-40"><a href="http://www.simbiosys.com/blog/2008/06/18/public-apology-to-ccdc/" rel="bookmark" title="Permanent Link to Public apology to CCDC">Public apology to CCDC</a></h3> <small>Wednesday, June 18th, 2008</small> <div class="entry"> <p>My previous post about errors in crystal structures have triggered strong reactions from CCDC (not only response post, direct email, but also email to my former PhD supervisor in the UK asking him for remedy and explanation). Apparently, they have interpreted my post as an attack on the quality of their services. Let me clarify first, that I have never intended to imply anything negative or derogatory about the CCDC services or software. My sincere apologies if my post came across that way. All I wanted to do is raise awareness in the docking/scoring community that small molecule crystallographic data is not free of errors. My understanding is, that the data deposited in CSD has been determined by thousands of people all over the world and published in <a title="IUCr" href="http://www.iucr.org/iucr-top/iucr/gicsi/positionpaper.html">various scientific journals</a>, while CCDC aggregates the data and creates a comprehensive, validated and value-added database known as the Cambridge Structural Database (CSD), and the complete CSD System (CSDS) includes the CSD itself and associated software for search, visualisation and analysis of stored information. I acknowledge that CCDC provides a valuable service to the community and any error in the data is not their fault.</p> <p>They have also sent us a “friendly reminder” that since our license to CSD has expired, according to the signed agreement we are not allowed to retain or use any data downloaded form CSD, not even any derived information or data. As I already stated in the update added to the previous blog entry, we have ceased using any data derived from CSD to comply with the license. I have even removed the image of the molecule from the post (since that can also be considered as derived data). We have not incorporated any data into our software. As I mentioned in the previous post, we had the intention to improve our scoring function with statistics collected from CSD (while we had the license during 2007), but it did not prove to be useful, therefore we abandoned that approach and continue to use publicly available PDB crystal structure data — which has been used for all released version of the software. We have not renewed our (rather pricey) license for 2008 for this reason.<br /> One lesson I learned from this exchange is the importance of <a title="open data" href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=906">Open Data</a> for scientific advancement (some scientists believe that <a title="research data" href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1145#comment-232476">research data must be free</a>), e.g. such that is available from <a title="CrystalEye" href="http://wwmm.ch.cam.ac.uk/crystaleye/">CrystalEye</a>. When even non-profit organizations (registered as charity) use draconian license agreements protecting data created and published by others, then fully commercial entities (like pharmaceutical companies) must be guarding their own data even stronger. It makes it difficult to make scientific progress if a single blog mention of an error in a data entry invites the wrath of the company who sells services on the data. It is ironic that the links expressing the need for open data and the open repository happens to point to a web site within the same University where CCDC resides.<br /> ZZ </p> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/general-discussion/" title="View all posts in General Discussion" rel="category tag">General Discussion</a> | <a href="http://www.simbiosys.com/blog/2008/06/18/public-apology-to-ccdc/#comments" title="Comment on Public apology to CCDC">6 Comments »</a></p> </div> <div class="post"> <h3 id="post-32"><a href="http://www.simbiosys.com/blog/2008/06/04/chemical-software-quality-3-polar-surface-area/" rel="bookmark" title="Permanent Link to Chemical software quality 3 - polar surface area">Chemical software quality 3 - polar surface area</a></h3> <small>Wednesday, June 4th, 2008</small> <div class="entry"> <p>This is my third post in the ongoing debate on chemical software quality. To give quick look-up table for those who join the discussion late, let me throw in some pointers in a time-line:</p> <ol> <li>Egon Willighagen wrote a blog about unit testing in CDK: <a title="cdk diff 1" href="http://chem-bla-ics.blogspot.com/2008/06/finding-differences-between.html">Finding differences between IChemObjects</a></li> <li>Peter Murray Rust responded: <a rel="bookmark" title="Permanent Link: The Blue Obelisk - Egon’s diff is boring" href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1130">The Blue Obelisk - Egon’s diff is boring</a> making some general comments about chemical software quality</li> <li>Egon continued his unit testing story: <a href="http://chem-bla-ics.blogspot.com/2008/06/finding-differences-between_03.html">Finding differences between IChemObjects #2</a></li> <li>Egon responded to PMR (2): <a href="http://chem-bla-ics.blogspot.com/2008/06/good-scientists-pimp-there-research-was.html">Good Scientists Pimp there Research (was: Damn, I’m boring…)</a></li> <li>I responded to PMR (2, also citing 1,3,4): <a title="Permanent Link: Research and software testing" rel="bookmark" href="http://www.simbiosys.ca/2008/06/03/research-and-software-testing/">Research and software testing</a></li> <li>PMR responded to my post (5): <a title="Permanent Link: Quality in chemical software - a debate" rel="bookmark" href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1133">Quality in chemical software - a debate</a></li> <li>I responded to PMR (6):<a title="Permanent Link: Quality in chemical software - the debate continues" rel="bookmark" href="http://www.simbiosys.ca/2008/06/03/quality-in-chemical-software-the-debate-continues/"> Quality in chemical software - the debate continues</a></li> <li>PMR responded to (7): <a rel="bookmark" title="Permanent Link: Quality is emerging in chemical software" href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1134">Quality is emerging in chemical software</a></li> <li>Egon responded to (2,5,6,7) in his post: <a href="http://chem-bla-ics.blogspot.com/2008/06/recovering-full-mass-spectra-from-gc-ms.html">Recovering full mass spectra from GC-MS data</a></li> </ol> <p>In post (9) Egon points out that annual competition results and benchmark results that I referred to in (7) have very little connection with unit testing and basic software quality — i.e. detecting, fixing and avoiding bugs — which is the focus of unit testing discussed in (1,3,5). I agree and that is exactly why I focused on that definition of software quality in my first post (5) in this debate. PMR has accepted my defense of closed source software quality in the software engineering sense (”<em>I am prepared to believe that a company is able to reproduce its own results internally and I suspect that the quality is better than it was 10 years ago.</em>“) and jumped onto a different definition of chemical software quality in (6) - one that has to do with assessing the scientific value of the answers provided by the software. This is what I addressed in (7), so I am not confusing the quality addressed by unit testing and the competitions or benchmarks, and I hope it is equally clear to everyone else that these are two very distinct issues. It seems everybody is agreement that unit testing is crucial, and now we have a common understanding that it has always been (traditionally) applied in the commercial chemical software world. Now I will respond to the new points raised by PMR’s post (8):</p> <blockquote><p><em>PMR: By a tradition of quality I mean that there is a communal understanding that quality matters. Although quality is a wide term it is often difficult to discuss unless it is measured.</em></p></blockquote> <p>Indeed, and we already touched on two very different meanings of the word as I elaborated above.</p> <blockquote><p><em>PMR: Leaving aside the stochastic aspect - which we agree on (and which makes quality assessment much harder) my concern is not whether a given calculation is reproducible when confined to a manufacturers platform, but whether the results have been assessed as meaningful. Now I agree that this is not easy, but unless the manufacturers develop interoperable standards then the quality of the result is only assessable by public assessment, requiring standard data sets and standard results. I gave the example of “(total) polar surface area” which should, in principle, be computable reproducibly by all manufacturers. But only if it is defined in a manner that all agree upon. Otherwise we have as many different values as there are manufacturers. And I would content that - unless each has a clear defintions of the lagorithm and the proerty calculated - this is a lack of quality.</em></p></blockquote> <p>Well the question: “what is the (total) polar surface area of a molecule ?” really belongs to the computing non-observable category. I would say it is about as well defined as the question: “what is <em>the</em> favourite color ?” Of course, there is no single “correct” answer for either one. You need to specify the question far more precisely if you want to get a meaningful answer. First, which <a title="molecular surfaces" href="http://www.simbiosys.ca/sprout/eccc/cangaroo.html">surface</a> are you talking about ? The <a title="vdWSA" href="http://en.wikipedia.org/wiki/Van_der_Waals_surface">van der Waals surface</a>, the <a title="ASA" href="http://en.wikipedia.org/wiki/Accessible_surface_area">solvent accessible surface</a>, the <a title="SES" href="http://www.scripps.edu/~sanner/html/msms_home.html">solvent excluded</a> (aka<a title="Connolly surface" href="http://www.netsci.org/Science/Compchem/feature14.html"> Connolly</a>) surface or any of the <a title="isosurface" href="http://en.wikipedia.org/wiki/Isosurface">iso-surfaces</a>, e.g. electron density iso-surface at any given cut-off ? If you choose one of the former three, what radius values should be used to define the atom spheres ? Each force field has a different set of vdw radii. If you choose electron density iso-surface, what cut-off value should be chosen ? All of these choices will significantly alter the “correct answer” to the question. Then how do you define what part of the surface is considered polar ? For vdwSA or SAS it maybe defined based on atom type, like O, N being polar, hydrophobic carbons being apolar. But what about aromatic carbons next to a nitrogen, or the carbon of a charged group, like carboxylate, should that be considered polar or not ? Or should polarity be defined by computing the net charge effect of all atom based partial charges for every single point of the surface and sum that up via a surface integral ? How do you assign partial charge values for that ? Or use a QM method to compute the charges at surface points ? What level of theory to use ? All these questions and choices are outside the scope of software correctness or quality. There is a correct answer for each variation of the question and each set of parameters chosen. Once the choices and parameters are fixed, then you can ask how accurately a given software computes the polar surface area for the given specification. So, simply this “property” isn’t a single property but a whole range. Oh, and don’t forget the conformation, because a lot of interesting molecules are flexible and the PSA will depend on what conformation you use to compute it. The lowest energy conformation may not be the most relevant if you are interested in bio-activity against a specific target, you need to know the bioactive conformation. So before we can even begin to address the question of software quality metrics, we need to define the problems precisely. Otherwise, you may get a totally different result from every piece of software package and all of them can be accurate.</p> <blockquote><p><em>PMR: I have not - and will not - claim that the Open Source movement in chemistry is of higher quality than closed source.</em></p></blockquote> <p>This statement is easy to refute by a verbatim quote from <a title="PMR(2)" href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1130">PMR’s post (2)</a>:</p> <blockquote><p><em>PMR: So the Blue Obelisk is emerging as the main area which takes quality in chemical software and chemical data seriously. More organisations are taking Open Source seriously. I met a chemical software company last week - no names - who is seriously looking at Open Source and thinking of integrating its competitors’ products. Perhaps not RSN, but they are looking at it.</em></p> <p><em>And when they do they will find the Blue Obelisk is the <strong>only</strong> place for software and data quality. </em></p></blockquote> <p>Notice the word <strong>only</strong> (emphasis is mine) in the last sentence. It is clear that it is an even stronger statement than claiming OS to be higher quality than closed, it implies there is no quality in closed source chemical software at all. Incidentally, this is the statement that “inspired” me to enter this debate in the first place.</p> <blockquote><p><em>PMR: I said there was no tradition of quality. As a result of your post I will moderate this statement slightly.</em></p></blockquote> <blockquote><p><em>PMR: I agree this, but note that many of these are very recent. So I would be prepared to say that in certain fields a tradition of quality metrics is starting to emerge. Almost all of these relate to docking into proteins and are driven, at least in part, by the tradition of competitions in proteins such as CASP which has for many years been involved in predicting protein structure.</em></p> <p><em> So I wish them well and will now exclude docking (but not QSAR) from my remarks.</em></p></blockquote> <p>Thank you very much, Peter. I am glad you changed your view about docking. This area has been the main focus of my research and development for the past 6 years and I believe we have very good results measured with sound scientific quality metrics. Mind you, some vendors in the past have used <a title="zride" href="http://www.simbiosys.ca/blog/2008/05/26/cognate-docking-accuracy-measurement-vs-pre-optimized-pose/">rather questionable metrics</a> to report good results, and I have explained in that post how such metric can lead to ridiculous results.<br /> ZZ </p> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/general-discussion/" title="View all posts in General Discussion" rel="category tag">General Discussion</a>, <a href="http://www.simbiosys.com/blog/category/science/" title="View all posts in Science" rel="category tag">Science</a> | <a href="http://www.simbiosys.com/blog/2008/06/04/chemical-software-quality-3-polar-surface-area/#respond" title="Comment on Chemical software quality 3 - polar surface area">No Comments »</a></p> </div> <div class="post"> <h3 id="post-26"><a href="http://www.simbiosys.com/blog/2008/05/28/floating-point-errors/" rel="bookmark" title="Permanent Link to Floating point errors">Floating point errors</a></h3> <small>Wednesday, May 28th, 2008</small> <div class="entry"> <p>On the <a title="CCL" href="http://www.ccl.net/">CCL</a> mailing list <span style="color: #00681c" class="EP8xU">Sina Türeli</span> has posted this question:</p> <p style="margin-left: 40px">“I am working on a project related to proteins and my precision for coordinates is there digits after the dot. When I do operations like rotation around a dihedral, the dihedrals which shouldn’t change change at about 0.01 angles and so. I am afraid though that is not much it might accumulate over time. So do you have any suggestions for reducing floating integer errors? Would’t be feasible if I turned lets say the coordinate 21.567 to 21567 and do my operations? Or maybe even 215670? “</p> <p>I have spent a lot of time fighting similar problems so I know how annoying this can be. To understand what is hapenning, look at how floating point numbers are stored and operated upon according to the <a title="IEEE 754 wiki" href="http://en.wikipedia.org/wiki/IEEE_754">IEEE 754 standard</a>. Because of the binary representation, our decimal fractional numbers do not match exactly to float/double numbers. Even though, the 23 fraction bits in a floating point binary number map roughly to 6 decimal digit precision, it still does not mean that all 3 digit decimal numbers can be represented precisely. On the other hand, integers can be stored exactly, so that gives basis to the idea to store the number 21.567 as 21567 or 215670. This would work for storing and also for applying some basic arithmetic (addition, subtraction and multiplication) to numbers accurately without any error. However, division starts a problem and any trigonometric functions or sqare root function blows up the problem to be much worse than what you have with floating point numbers. Those functions produce <a title="irrational number wiki" href="http://en.wikipedia.org/wiki/Irrational_number">irrational numbers</a> i.e. they cannot be represented by a division of two integers. Unfortunately, rotations are typically defined by angles and the coordinate transformation requires the sine and cosine of the angle — depending on the task, sometimes the transformation required can be expressed in other ways, e.g. by quaternions, and sometimes we can compute transformations by simpler arithmetic if the goal is to transform some atoms to specific positions, e.g. an overlay (rather than rotation by a given angle). So, in short using integer fixed point representation will not solve the problem of coordinate drifting errors of 3D transformations, especially if rotations by a given angle is required. On the other hand, it can solve simpler problems.<br /> How bad is the floating point problem and what can be solved by fixed point integer arithmetic ? Let me give you an example: you learned in school that a+b+c = c+a+b. Well, this simple rule breaks even for single digit precision floating point numbers! Consider the following simple C code (you can download the <a title="sumorder.c" href="http://www.simbiosys.ca/blog/examples/sumorder.c">source</a>, or a linux <a title="sumorder" href="http://www.simbiosys.ca/blog/examples/sumorder">binary</a>):</p> <pre><tt>#include int main() { float a, b, c, d, s; for (a = 0.1; a < 9.9; a += 0.2) { for (b=0.4; b < 9.9; b += 0.1) { for (c=0.7; c < 9.9; c += 0.3) { s = a + b; s += c; d = c + a; d += b; if ( s != d ) { printf( "a=%f, b=%f, c=%f, a+b+c=%f, c+a+b=%f hex: a=0x%08x b=0x%08x c=0x%08x s=0x%08x d=0x%08xn", a, b, c, s, d, *(int *) &a,*(int *) &b,*(int *) &c,*(int *) &s, *(int *) &d ); } } } } }</tt></pre> <p>Sorry for the lack of indentation, I could not convince WordPress to keep it pre-formatted.</p> <p>You can see that if the summation in two orders were the same for all tested cases it would never print anything. However, when you run it, you get plenty of examples (34585 cases on my computer, about one third of all cases tested) printed where the rule breaks. Here are the first few examples you get:</p> <pre><tt>a=0.100000, b=0.400000, c=3.700000, a+b+c=4.200000, c+a+b=4.199999 hex: a=0×3dcccccd b=0×3ecccccd c=0×406ccccb s=0×40866666 d=0×40866665 a=0.100000, b=0.400000, c=7.600002, a+b+c=8.100002, c+a+b=8.100001 hex: a=0×3dcccccd b=0×3ecccccd c=0×40f33337 s=0×4101999c d=0×4101999b a=0.100000, b=0.500000, c=2.500000, a+b+c=3.100000, c+a+b=3.100000 hex: a=0×3dcccccd b=0×3f000000 c=0×401fffff s=0×40466666 d=0×40466665</tt></pre> <p>As you can see from the hexadecimal version, the difference is only in the last 1 bit. Nevertheless, it is enough to throw off the result. Imagine, if you sum up score components, sort them and select only the best N solutions. Suddenly, you may keep or lose a solution depending on the order of summation. Now, that is scary…This problem <strong>cannot be solved by using double precision</strong>, but it can be solved simply by using fixed point integer representation. However, fixed point does not help for the rotation problem I am afraid.</p> <p>ZZ </p> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/general-discussion/" title="View all posts in General Discussion" rel="category tag">General Discussion</a>, <a href="http://www.simbiosys.com/blog/category/technology/" title="View all posts in Technology" rel="category tag">Technology</a> | <a href="http://www.simbiosys.com/blog/2008/05/28/floating-point-errors/#comments" title="Comment on Floating point errors">5 Comments »</a></p> </div> <div class="post"> <h3 id="post-21"><a href="http://www.simbiosys.com/blog/2008/05/13/conformance-problems-odf-and-ooxml/" rel="bookmark" title="Permanent Link to Conformance problems: ODF and OOXML">Conformance problems: ODF and OOXML</a></h3> <small>Tuesday, May 13th, 2008</small> <div class="entry"> <p>Apparently, the wikipedia page I linked to in my previous post about ODF supporting software is overly optimistic according to <a title="ODF conformance problems" href="http://ptsefton.com/2008/05/13/claims-about-odf-support-are-typically-meaningless.htm#comment-1847">Peter Sefton’s blog</a>. He demonstrates that only OpenOffice.org and StarOffice works properly with ODF while others have serious problems even with very basic formating. There is also a very useful <a title="converter table" href="http://ptsefton.com/2008/05/13/more-on-odf-and-ooxml.htm">converter table</a> posted by Peter.</p> <p>OK, so that brings the ODF conformance count down to 2, however, this is still 2 more than the number of applications that conform to the OOXML standard, which is exactly zero at this moment according to <a title="OOXML test" href="http://www.griffinbrown.co.uk/blog/PermaLink,guid,3e2202cd-59a3-4356-8f30-b8eb79735e1a.aspx">these</a> <a title="OOXML test on zdnet" href="http://news.zdnet.co.uk/software/0,1000000121,39388229,00.htm">tests</a>. So, the race is on, the result is 2:0 so far with ODF in the lead <img src='http://www.simbiosys.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p> <p>ZZ. — A proud member of the ODF “cheer squad” </p> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/general-discussion/" title="View all posts in General Discussion" rel="category tag">General Discussion</a>, <a href="http://www.simbiosys.com/blog/category/technology/" title="View all posts in Technology" rel="category tag">Technology</a> | <a href="http://www.simbiosys.com/blog/2008/05/13/conformance-problems-odf-and-ooxml/#comments" title="Comment on Conformance problems: ODF and OOXML">2 Comments »</a></p> </div> <div class="post"> <h3 id="post-20"><a href="http://www.simbiosys.com/blog/2008/05/11/what-is-wrong-with-ooxml/" rel="bookmark" title="Permanent Link to What is wrong with OOXML">What is wrong with OOXML</a></h3> <small>Sunday, May 11th, 2008</small> <div class="entry"> <p>Peter MR has <a title="petermr about OOXML" href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1091">voiced his opinions on his blog</a> about the use of OOXML for archiving chemistry documents:</p> <blockquote><p>The reason I currently like OOXML is that we can make it work and that we have material in Word that we can use. I’ll be demoing it publicly in a week’s time (more later). If we had material in ODT we’d use that, but we don’t.</p> <p>…</p> <p>My worry about Open Office (which emits ODT) is that I don’t yet believe that has reached a state where I could evangelize it without it falling over or being too difficult to install.</p></blockquote> <p>I would much rather recommend the ODF (ODT) format, which is a truly open <a href="http://www.oasis-open.org/specs/index.php#opendocumentv1.0">ISO standard</a> (approved on May 1st 2006). OpenOffice.org is only one of the tools that can generate it, there are <a title="wiki list of ODF compatible sw" href="http://en.wikipedia.org/wiki/List_of_applications_supporting_OpenDocument">several others</a> as well as various converters (e.g.<a title="SUNS office plugin" href="http://www.sun.com/software/star/odf_plugin/index.jsp"> SUN’s MS Office plugin</a>, <a title="clever age translator" href="http://sourceforge.net/projects/odf-converter">Clever Age ODF translator</a>) available for MS Word users.<br /> His point is that it is still better to use OOXML than the binary doc format of MS Word. I do not agree with this point, I think OOXML is just as bad as the binary Word doc, for these reasons:</p> <ol> <li>It is a <a title="OOXML patent encumbered" href="http://holloway.co.nz/can-other-vendors-implement-ooxml.html">single vendor format with patent encumbered binary extensions</a> — so it might as well be called proprietary. OOXML <a title="GPL incompatibility" href="http://www.groklaw.net/articlebasic.php?story=20080221184924826">cannot be implemented by open source software</a> due to incompatibilities with the GPL.</li> <li>The <a title="NB comments on OOXML" href="http://iso-vote.com/comments.html">national bodies have raised over a thousand unique objections</a> about technical details of the format during the ISO process (see also <a href="http://www.dis29500.org/">the wiki collection</a>), less than<a title="what happened at BRM" href="http://www.groklaw.net/article.php?story=20080328090328998"> 20 percent of which has been discussed</a> during the <a title="BRM" href="http://www.jtc1sc34.org/repository/0932.htm">Ballot Resolution Meeting</a> and most of those was not resolved to the satisfaction of the opponents. You can find a good collection of remaining problems <a title="OOXML problems" href="http://www.grokdoc.net/index.php/EOOXML_objections">here</a></li> <li>It has been accepted as a standard via blatant <a title="Gates asks president" href="http://www.groklaw.net/article.php?story=20080327104739103">manipulation</a>, <a title="ballot stuffing" href="http://www.noooxml.org/forum/t-47722/last-minute-committee-stuffing-in-romania">ballot stuffing</a>, <a title="yes without vote" href="http://www.groklaw.net/comment.php?mode=display&sid=20080327104739103">corruption</a> in various levels, see some of <a title="OOXML ISO history" href="http://holloway.co.nz/blog/?p=10">the history here</a> and <a title="ISO 29600 history" href="http://www.tideway.com/community/blog-post/how-to-buy-a-standard-in-10-days/">here</a>. More irregularities: <a title="polish rule" href="http://www.noooxml.org/forum/t-49455/polish-chairwomen-distributes-microsoft-propaganda">Poland’s new rule: no vote equals yes</a>, <a title="no as yes" href="http://www.groklaw.net/article.php?story=20080324121844682">Cuba’s No vote counted as yes</a>, Microsoft friendly <a title="yes men" href="http://www.noooxml.org/forum/t-48345/belgium-also-stuffed-with-microsoft-busi%20ness-partners">“yes-men” invaded Belgium’s</a> Technical Committee, Denmark voted <a title="50% consensus" href="http://www.ds.dk/4225">yes by consensus while 50% oppose</a>d, interesting <a title="croatia" href="http://www.groklaw.net/article.php?story=2008032913190768">vote counting in Croatia</a> (14 No + 3 Yes = Yes), how the <a title="Filippines" href="http://www.groklaw.net/article.php?story=20080402003610230">Philippines changed</a> their vote from no to yes.</li> <li>ISO has violated the WTO rules by allowing a duplicative standard to an existing one (ODF), <a title="WTO rule violation" href="http://ec.europa.eu/idabc/en/document/7546/469">according to Tineke Egyedi</a>, president of the European Academy for Standardisation.</li> <li>OOXML reinvents the wheel, ignoring and replacing mature standards like SVG, MathML, XForms and even XML. The most prominent example is the neglection of MathML where OOXML defines its own formula markup language (OOMML).</li> <li>OOXML requires <strong>undisclosed copyrighted material from Microsoft Office.</strong> The previous problem of Border Style art being undisclosed was acknowledged and fixed on February 22nd 2008 however Part 4 2.18.94 ST_TextEffect (Animated Text Effects) describes VML art that is not included in the specification.</li> <li><strong>OOXML does not provide the Binary to XML mapping which is required to fully represent the existing corpus of user documents.</strong> No other application supporting OOXML will be able to faithfully or fully recreate the look of Microsoft’s legacy binary documents. Although the binary Office document specifications have been posted by Microsoft (15 Feb 2008), no standardized mappings were offered during the BRM, as requested by the US, United Kingdom, Brazil, and Malaysia, amongst others.</li> <li><strong>Markets cannot rely on ISO standards with calculation errors.</strong> Spreadsheet formulas still result in calculation errors. Although the CEILING function was recognised to have a legacy bug and fixed during the BRM, there exist more mathematical inaccuracies in OOXML’s spreadsheet function. The FLOOR function has been identified to have a similar mathematical inaccuracies for negative numbers. This is a problem that needs to get carefully studied. We recall that Intel faced a consumer scandal and losses when their new Pentium chip was found to have a calculation error. The Y2K problem, a standardization issue, resulted in billions of investment for damage control.</li> <li><strong>Macro functionality is not properly defined.</strong> Section 2.16.5.41 defines a “MACROBUTTON” field that allows the definition of a button in the document that will trigger a macro. But little is said about how the macro is stored, bound, what API’s are available, or what the security model is for this feature. ECMA’s disposition (approved in batch by the BRM without discussion or opportunity for objection), was something quite different and unsatisfactory. ECMA simply added: “The mechanism by which the command specified by text in field-argument-1 is located and/or executed by an application is <strong>“implementation-defined”.</strong> Unfortunately, with this addition, not only is it impossible to have cross-platform interoperability of this feature, it is unlikely that vendors will be able to implement a reasonable security policy to detect, scan or block macros included in documents.</li> <li>There are additional 850+ technical problems raised during the ISO process and has not been resolved, I will not list all of them here <img src='http://www.simbiosys.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </li> </ol> <p>In a single sentence: OOXML is nothing more than a marketing check-box for Microsoft, so that they can now claim to have an open ISO standard document format, but in reality it is neither open, nor standard by any rational definition of the words.</p> <p>ZZ. </p> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/general-discussion/" title="View all posts in General Discussion" rel="category tag">General Discussion</a>, <a href="http://www.simbiosys.com/blog/category/technology/" title="View all posts in Technology" rel="category tag">Technology</a> | <a href="http://www.simbiosys.com/blog/2008/05/11/what-is-wrong-with-ooxml/#comments" title="Comment on What is wrong with OOXML">4 Comments »</a></p> </div> <div class="post"> <h3 id="post-13"><a href="http://www.simbiosys.com/blog/2008/04/24/feedback-from-zsolts-acs-talk-and-pre-acs-benchmarks/" rel="bookmark" title="Permanent Link to Feedback from Zsolt’s ACS talk and Pre-ACS Benchmarks.">Feedback from Zsolt’s ACS talk and Pre-ACS Benchmarks.</a></h3> <small>Thursday, April 24th, 2008</small> <div class="entry"> <p><meta content="text/html; charset=utf-8" http-equiv="Content-Type" /><meta content="Word.Document" name="ProgId" /><meta content="Microsoft Word 9" name="Generator" /><meta content="Microsoft Word 9" name="Originator" /></p> <p><img align="left" title="CELL SPEEDUP" alt="CELL SPEEDUP" src="http://www.simbiosys.ca/blog/images/CELL_SPEEDUP.jpg" /></p> <link rel="File-List" /><!--[if gte mso 9]><xml> <w:WordDocument> <w:View>Normal</w:View> <w:Zoom>0</w:Zoom> <w:DoNotOptimizeForBrowser/> </w:WordDocument> </xml><![endif]--><style> <!-- /* Font Definitions */ @font-face {font-family:Helvetica; panose-1:0 0 0 0 0 0 0 0 0 0; mso-font-charset:0; mso-generic-font-family:auto; mso-font-format:other; mso-font-pitch:variable; mso-font-signature:3 0 0 0 1 0;} @font-face {font-family:"MS Mincho"; panose-1:0 0 0 0 0 0 0 0 0 0; mso-font-alt:"MS 明朝"; mso-font-charset:128; mso-generic-font-family:roman; mso-font-format:other; mso-font-pitch:fixed; mso-font-signature:1 134676480 16 0 131072 0;} @font-face {font-family:"\@MS Mincho"; panose-1:0 0 0 0 0 0 0 0 0 0; mso-font-charset:128; mso-generic-font-family:roman; mso-font-format:other; mso-font-pitch:fixed; mso-font-signature:1 134676480 16 0 131072 0;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal {mso-style-parent:""; margin:0in; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:11.0pt; mso-bidi-font-size:12.0pt; font-family:Helvetica; mso-fareast-font-family:"Times New Roman"; mso-bidi-font-family:"Times New Roman"; font-weight:bold;} p.MsoPlainText, li.MsoPlainText, div.MsoPlainText {margin:0in; margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:10.0pt; font-family:"Courier New"; mso-fareast-font-family:"Times New Roman"; font-weight:bold;} @page Section1 {size:8.5in 11.0in; margin:1.0in 1.25in 1.0in 1.25in; mso-header-margin:.5in; mso-footer-margin:.5in; mso-paper-source:0;} div.Section1 {page:Section1;} --> </style> <!--StartFragment--> <p class="MsoPlainText"><span style="font-size: 12pt; font-family: Arial; font-weight: normal">Zsolt gave a great talk at ACS with lots of enthusiastic feedback. At the SimBioSys booth he discussed, in detail, the informatics underlying his accurate scoring function. Ryan Lilien (CS, prof and MD from UofT) pointed out that several speakers in the session mentioned the importance of including a scoring term for ligand burial depth, a facet that Zsolt pointed out had been in eHiTS for some time. David Kombo of Targacept came by to comment and discuss the underlying science of the scoring function. His view was echoed by several others, that while accuracy and enrichment metrics presented were impressive, the port to the cell is what really got them ‘hot’. Why was that? <u><span style="color: black">Look at the speedups ZZ is getting PRE-ACS</span></u><span style="color: black"> </span>in the figure below. <span style="color: black">At the time of the ACS meeting overall Zsolt was getting about 36X speed up on the QS20 and 13X on the PS3 for total flexible ligand docking to 20-targets</span><span style="color: #3366ff">.</span> He’s since then made significant improvements <u>nearly doubling those speedups for a problem of even greater complexity</u>. Come to his poster at BIO-IT and to Booth 319 to view the latest results. <em>Want to rapidly dock against the proteome with supercomputer/cluster performance, do family training for robust correlations with IC50’s, and do it even on a $400 platform? Stay tuned for eHiTS Lightening on the Cell. Accurate Results in Lead Discovery As Fast as a Lightning Strike.</em> </span></p> <p><!--EndFragment--> </p> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/general-discussion/" title="View all posts in General Discussion" rel="category tag">General Discussion</a>, <a href="http://www.simbiosys.com/blog/category/news/" title="View all posts in News" rel="category tag">News</a> | <a href="http://www.simbiosys.com/blog/2008/04/24/feedback-from-zsolts-acs-talk-and-pre-acs-benchmarks/#respond" title="Comment on Feedback from Zsolt’s ACS talk and Pre-ACS Benchmarks.">No Comments »</a></p> </div> <div class="post"> <h3 id="post-12"><a href="http://www.simbiosys.com/blog/2008/04/22/introducing-zsolt-zsoldos-csocto-of-simbiosys/" rel="bookmark" title="Permanent Link to Introducing Zsolt Zsoldos, CSO/CTO of SimBioSys">Introducing Zsolt Zsoldos, CSO/CTO of SimBioSys</a></h3> <small>Tuesday, April 22nd, 2008</small> <div class="entry"> <p>Hi, I’m Zsolt Zsoldos, the Chief Scientific/Technical Officer here at SimBioSys. I know the blog has been going for a few months now and I am a bit late to the party, but there has been a reason for that. I’ve been focused on meeting a particular deadline, <a target="_blank" title="Bio-IT" href="http://www.bio-itworldexpo.com/">Bio-IT in Boston</a> next week. At Bio-IT we will be showing, for the first time, eHITS lightning. This is the port of our eHITS software to the Cell processor. I won’t describe this in detail here since we’ve done it in our <a target="_blank" title="eHiTS lightning white paper" href="http://www.simbiosys.ca/science/white_papers/eHiTS_on_the_Cell.pdf">White Paper</a> already. I also spoke about it at the ACS – my presentation is online <a target="_blank" title="ACS 2008 spring talk" href="http://www.simbiosys.ca/science/presentations/2008-acs-04/presentations/eHiTS_Lightning_2008.pdf">here</a>. Take a look at slide 27 in that presentation to see the speed up achieved to date. Things have progressed since that report and I will post more details after the Bio-IT. If you happen to be coming to Bio-IT please do stop by at booth #319 to visit and see eHiTS lightning running live on a Sony Playstation3.</p> <p>Z.Z. </p> </div> <p class="postmetadata">Posted in <a href="http://www.simbiosys.com/blog/category/general-discussion/" title="View all posts in General Discussion" rel="category tag">General Discussion</a>, <a href="http://www.simbiosys.com/blog/category/news/" title="View all posts in News" rel="category tag">News</a> | <a href="http://www.simbiosys.com/blog/2008/04/22/introducing-zsolt-zsoldos-csocto-of-simbiosys/#comments" title="Comment on Introducing Zsolt Zsoldos, CSO/CTO of SimBioSys">1 Comment »</a></p> </div> <div class="navigation"> <div class="alignleft"><a href="http://www.simbiosys.com/blog/category/general-discussion/page/3/">« Previous Entries</a></div> <div class="alignright"><a href="http://www.simbiosys.com/blog/category/general-discussion/">Next Entries »</a></div> </div> </div> <div id="sidebar"> <ul> <li> <a href="http://www.simbiosys.com"><img src="images/SimBioSysLogo_name_long.gif" width="200" ></a> </li> <li> <form method="get" id="searchform" action="http://www.simbiosys.com/blog/"> <div><input type="text" value="" name="s" id="s" /> <input type="submit" id="searchsubmit" value="Search" /> </div> </form> </li> <li> <form style="border:1px solid #ccc;padding:3px;text-align:center;" action="http://www.feedburner.com/fb/a/emailverify" method="post" target="popupwindow" onsubmit="window.open('http://www.feedburner.com/fb/a/emailverifySubmit?feedId=1550225', 'popupwindow', 'scrollbars=yes,width=550,height=520');return true"><p>Enter your email address:</p><p><input type="text" style="width:140px" name="email"/></p><input type="hidden" value="http://feeds.feedburner.com/~e?ffid=1550225" name="url"/><input type="hidden" value="SimBioSys Blog" name="title"/><input type="hidden" name="loc" value="en_US"/><input type="submit" value="Subscribe" /><p>Delivered by <a href="http://www.feedburner.com" target="_blank">FeedBurner</a></p></form> </li> <!-- Author information is disabled per default. Uncomment and fill in your details if you want to use it. <li><h2>Author</h2> <p>A little something about you, the author. Nothing lengthy, just an overview.</p> </li> --> <li> <p>You are currently browsing the archives for the General Discussion category.</p> </li> <li class="pagenav"><h2>Pages</h2><ul><li class="page_item"><a href="http://www.simbiosys.com/blog/about/" title="About SimBioSys">About SimBioSys</a></li> </ul></li> <li><h2>Archives</h2> <ul> <li><a href='http://www.simbiosys.com/blog/2012/01/' title='January 2012'>January 2012</a></li> <li><a href='http://www.simbiosys.com/blog/2011/12/' title='December 2011'>December 2011</a></li> <li><a href='http://www.simbiosys.com/blog/2011/11/' title='November 2011'>November 2011</a></li> <li><a href='http://www.simbiosys.com/blog/2011/09/' title='September 2011'>September 2011</a></li> <li><a href='http://www.simbiosys.com/blog/2011/08/' title='August 2011'>August 2011</a></li> <li><a href='http://www.simbiosys.com/blog/2011/04/' title='April 2011'>April 2011</a></li> <li><a href='http://www.simbiosys.com/blog/2011/03/' title='March 2011'>March 2011</a></li> <li><a href='http://www.simbiosys.com/blog/2011/01/' title='January 2011'>January 2011</a></li> <li><a href='http://www.simbiosys.com/blog/2010/10/' title='October 2010'>October 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2010/09/' title='September 2010'>September 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2010/08/' title='August 2010'>August 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2010/07/' title='July 2010'>July 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2010/06/' title='June 2010'>June 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2010/03/' title='March 2010'>March 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2010/01/' title='January 2010'>January 2010</a></li> <li><a href='http://www.simbiosys.com/blog/2009/12/' title='December 2009'>December 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/11/' title='November 2009'>November 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/10/' title='October 2009'>October 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/07/' title='July 2009'>July 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/06/' title='June 2009'>June 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/05/' title='May 2009'>May 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/03/' title='March 2009'>March 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/02/' title='February 2009'>February 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2009/01/' title='January 2009'>January 2009</a></li> <li><a href='http://www.simbiosys.com/blog/2008/12/' title='December 2008'>December 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/11/' title='November 2008'>November 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/10/' title='October 2008'>October 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/09/' title='September 2008'>September 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/08/' title='August 2008'>August 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/07/' title='July 2008'>July 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/06/' title='June 2008'>June 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/05/' title='May 2008'>May 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/04/' title='April 2008'>April 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/03/' title='March 2008'>March 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/02/' title='February 2008'>February 2008</a></li> <li><a href='http://www.simbiosys.com/blog/2008/01/' title='January 2008'>January 2008</a></li> </ul> </li> <li><h2>Categories</h2> <ul> <li class="current-cat"><a href="http://www.simbiosys.com/blog/category/general-discussion/" title="View all posts filed under General Discussion">General Discussion</a> (22) </li> <li><a href="http://www.simbiosys.com/blog/category/news/" title="View all posts filed under News">News</a> (64) </li> <li><a href="http://www.simbiosys.com/blog/category/science/" title="View all posts filed under Science">Science</a> (42) </li> <li><a href="http://www.simbiosys.com/blog/category/software-products/" title="View all posts filed under Software products">Software products</a> (38) </li> <li><a href="http://www.simbiosys.com/blog/category/technology/" title="View all posts filed under Technology">Technology</a> (21) </li> <li><a href="http://www.simbiosys.com/blog/category/science/tips-and-hints/" title="Tells us tips and hints on how to best use eHiTS, and other SimBioSys tools, to maximize your ROI (return on invested time, money, energy etc.). It’s a must-read!">tips and hints</a> (5) </li> </ul> </li> </ul> </div> <hr /> <div id="footer"> <!-- If you'd like to support WordPress, having the "powered by" link someone on your blog is the best way, it's our only promotion or advertising. --> <p> SimBioSys Blog is proudly powered by <a href="http://wordpress.org/">WordPress</a> <br /><a href="feed:http://www.simbiosys.com/blog/feed/">Entries (RSS)</a> and <a href="feed:http://www.simbiosys.com/blog/comments/feed/">Comments (RSS)</a>. <!-- 20 queries. 0.401 seconds. --> </p> </div> </div> <!-- Gorgeous design by Michael Heilemann - http://binarybonsai.com/kubrick/ --> </body> </html>