<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.0.11" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: What is wrong with OOXML</title>
	<link>http://www.simbiosys.com/blog/2008/05/11/what-is-wrong-with-ooxml/</link>
	<description>Addressing the challenges of computational drug discovery</description>
	<pubDate>Sat, 04 Feb 2012 19:04:48 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.11</generator>

	<item>
		<title>by: PT&#8217;s blog &#187; Blog Archive &#187; Claims about ODF support are typically meaningless</title>
		<link>http://www.simbiosys.com/blog/2008/05/11/what-is-wrong-with-ooxml/#comment-1282</link>
		<pubDate>Tue, 13 May 2008 07:53:31 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/05/11/what-is-wrong-with-ooxml/#comment-1282</guid>
					<description>[...] Claims about ODF support are typically meaningless   I know I&#8217;m repeating myself a bit. But as you know there&#8217;s a Wikipedia page about applications that support the Open Document Format and it gets quoted and linked to. A lot.I linked to Peter Murray Rust yesterday, and one of the commenters on his blog also talks about the number of implementations.OpenOffice.org is only one of the tools that can generate it, there are several others as well as various converters (e.g. SUN’s MS Office plugin, Clever Age ODF translator) available for MS Word users.But folks, as of 2008-05-13 mid afternoon Toowoomba time, that Wikipedia page is not much help to people who might want to, like you know work on real documents. GoogleDocs, for example will throw away your styles if you happen to care about them. And why would you? It&#8217;s a web two-dot-oh world now what do we need with styles?I&#8217;m going to get around to editing that page, but I&#8217;m not an expert so I have held off.I just know that it&#8217;s not useful. Lots of things on that list don&#8217;t work with my ODF documents all created with applications derived from OpenOffice.org. As part of my homework for when I do engage the page I went to the spec.What does the ODF spec (v1.1) say about conformance? You need only read the first sentence of this extract, which I have highlighted for your convenience.The OpenDocument specification does not specify which elements and attributes conforming application must, should, or may support. The intention behind this is to ensure that the OpenDocument specification can be used by as many implementations as possible, even if these applications do not support some or many of the elements and attributes defined in this specification. Viewer applications for instance may not support all editing relates elements and attributes (like change tracking), other application may support only the content related elements and attributes, but none of the style related ones.Even typical office applications may only support a subset of the elements and attributes defined in this specification. They may for instance not support lists within text boxes or may not support some of the language related element and attributes.So you don&#8217;t have to do anything in particular to claim to support ODF. Maybe just allowing the top level element would be enough?We clearly need to add some detail to the Wikipedia page about who supports what specifically.My working definition of support for ODF (the text format is actually what I care about) is people using our word processing templates to edit files interoperably with Writer and some other application. And you know what? There&#8217;s nothing outrageous in any of our templates, but the only applications that work are ones that are based on the original Star Office code base that spawned OpenOffice.org. This is unsurprising. The file format was built around OpenOffice.org. Lots of people point out that nobody but Microsoft will be able to build an office suite that supports OOXML. People, this is the same process at work.Let me tell you about a quick experiment I did. I looked up the bit in the ODF spec about lists. Apparently you can have a thing called a list-header at the start of the list. A list header is a special kind of list item. It contains one or more paragraphs that are displayed before a list. The paragraphs are formatted like list items but they do not have a preceding number or bullet. The list header is represented by the list header element.So I made a .odt using NeoOffice, put in a list using the default style List 1 saved it, unzipped it and added a list-header to the start of the list, then rezipped it and opened in NeoOffice. Hmm, well it does display, but the Writer interface seems to know nothing about list headers. The only way to create one seems to be outside of the application. Having a feature like this creates some very serious weirdness. You can load up a document with multiple paragraphs in the list header and they are preserved but if you try to add one then you just get a normal list item.Now I know this is not quite in line with what I&#8217;m saying about the file format being largely derived from OpenOffice.org. I don&#8217;t know the history of this element but OpenOffice.org doesn&#8217;t support it in any useful way. Looks like something added by a standards committee raised on SGML with a clear idea of what makes a &#8216;good&#8217; document format and not much consideration about what makes a usable word processor interface but that&#8217;s just a guess.Lets talk about the formatting I got from my experiment. Is this what the spec really means? See how the list header (Header 1) is indented? Not formatting I can imagine using. But it seems to be what the spec says. “The paragraphs are formatted like list items but they do not have a preceding number or bullet.”How many of the ODF cheer squad have read the standard? Dealt with document interop issues? Me, I have only glanced at the OOXML spec and so I don&#8217;t go around telling people about how bad it is, but on the few occasions I have looked at ODF and tested support for various things in Writer I have found problems or lack of support in OOo. The good thing with OpenOffice.org, of course is that you can tell Sun about the bug and it will likely get fixed sooner or later, particularly if you can rally enough supporters to vote.The bottom line is that if you want to work with ODT you have to check out whether the other applications are going to support the bit you want to use. I will get around to changing that wikipedia page to be more useful. [...]</description>
		<content:encoded><![CDATA[<p>[&#8230;] Claims about ODF support are typically meaningless   I know I&#8217;m repeating myself a bit. But as you know there&#8217;s a Wikipedia page about applications that support the Open Document Format and it gets quoted and linked to. A lot.I linked to Peter Murray Rust yesterday, and one of the commenters on his blog also talks about the number of implementations.OpenOffice.org is only one of the tools that can generate it, there are several others as well as various converters (e.g. SUN’s MS Office plugin, Clever Age ODF translator) available for MS Word users.But folks, as of 2008-05-13 mid afternoon Toowoomba time, that Wikipedia page is not much help to people who might want to, like you know work on real documents. GoogleDocs, for example will throw away your styles if you happen to care about them. And why would you? It&#8217;s a web two-dot-oh world now what do we need with styles?I&#8217;m going to get around to editing that page, but I&#8217;m not an expert so I have held off.I just know that it&#8217;s not useful. Lots of things on that list don&#8217;t work with my ODF documents all created with applications derived from OpenOffice.org. As part of my homework for when I do engage the page I went to the spec.What does the ODF spec (v1.1) say about conformance? You need only read the first sentence of this extract, which I have highlighted for your convenience.The OpenDocument specification does not specify which elements and attributes conforming application must, should, or may support. The intention behind this is to ensure that the OpenDocument specification can be used by as many implementations as possible, even if these applications do not support some or many of the elements and attributes defined in this specification. Viewer applications for instance may not support all editing relates elements and attributes (like change tracking), other application may support only the content related elements and attributes, but none of the style related ones.Even typical office applications may only support a subset of the elements and attributes defined in this specification. They may for instance not support lists within text boxes or may not support some of the language related element and attributes.So you don&#8217;t have to do anything in particular to claim to support ODF. Maybe just allowing the top level element would be enough?We clearly need to add some detail to the Wikipedia page about who supports what specifically.My working definition of support for ODF (the text format is actually what I care about) is people using our word processing templates to edit files interoperably with Writer and some other application. And you know what? There&#8217;s nothing outrageous in any of our templates, but the only applications that work are ones that are based on the original Star Office code base that spawned OpenOffice.org. This is unsurprising. The file format was built around OpenOffice.org. Lots of people point out that nobody but Microsoft will be able to build an office suite that supports OOXML. People, this is the same process at work.Let me tell you about a quick experiment I did. I looked up the bit in the ODF spec about lists. Apparently you can have a thing called a list-header at the start of the list. A list header is a special kind of list item. It contains one or more paragraphs that are displayed before a list. The paragraphs are formatted like list items but they do not have a preceding number or bullet. The list header is represented by the list header element.So I made a .odt using NeoOffice, put in a list using the default style List 1 saved it, unzipped it and added a list-header to the start of the list, then rezipped it and opened in NeoOffice. Hmm, well it does display, but the Writer interface seems to know nothing about list headers. The only way to create one seems to be outside of the application. Having a feature like this creates some very serious weirdness. You can load up a document with multiple paragraphs in the list header and they are preserved but if you try to add one then you just get a normal list item.Now I know this is not quite in line with what I&#8217;m saying about the file format being largely derived from OpenOffice.org. I don&#8217;t know the history of this element but OpenOffice.org doesn&#8217;t support it in any useful way. Looks like something added by a standards committee raised on SGML with a clear idea of what makes a &#8216;good&#8217; document format and not much consideration about what makes a usable word processor interface but that&#8217;s just a guess.Lets talk about the formatting I got from my experiment. Is this what the spec really means? See how the list header (Header 1) is indented? Not formatting I can imagine using. But it seems to be what the spec says. “The paragraphs are formatted like list items but they do not have a preceding number or bullet.”How many of the ODF cheer squad have read the standard? Dealt with document interop issues? Me, I have only glanced at the OOXML spec and so I don&#8217;t go around telling people about how bad it is, but on the few occasions I have looked at ODF and tested support for various things in Writer I have found problems or lack of support in OOo. The good thing with OpenOffice.org, of course is that you can tell Sun about the bug and it will likely get fixed sooner or later, particularly if you can rally enough supporters to vote.The bottom line is that if you want to work with ODT you have to check out whether the other applications are going to support the bit you want to use. I will get around to changing that wikipedia page to be more useful. [&#8230;]
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: zsolt</title>
		<link>http://www.simbiosys.com/blog/2008/05/11/what-is-wrong-with-ooxml/#comment-1263</link>
		<pubDate>Mon, 12 May 2008 15:15:07 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/05/11/what-is-wrong-with-ooxml/#comment-1263</guid>
					<description>Egon, I understand that Peter is pushing for the "Right Thing (TM)", and I completely agree with the intentions. I just don't agree with the compromise he is willing to make, i.e. to work with the binary doc format or the XML dump of the same (OOXML) which still has all the technical problems, binary blobs, patent problems, application and platform dependence.

In response to your questions:
1. Yes, so far 'we' have failed to deliver semantic authoring tools, but we can still work on them. As for the existing documents, my company SimBioSys has a tool to get the chemical semantics out of PDF files: http://www.simbiosys.ca/clide/index.html

2. No, we shouldn't settle, of course! :-) People use it not by choice, but because the monopoly pushes it on to them. We have to work hard to get semantic documents into Open Standard documents. If we roll over and let the monopoly win, then even if we get semantics into documents we will have to keep paying through the nose to use and access our own data!

ZZ</description>
		<content:encoded><![CDATA[<p>Egon, I understand that Peter is pushing for the &#8220;Right Thing &#8482;&#8221;, and I completely agree with the intentions. I just don&#8217;t agree with the compromise he is willing to make, i.e. to work with the binary doc format or the XML dump of the same (OOXML) which still has all the technical problems, binary blobs, patent problems, application and platform dependence.</p>
<p>In response to your questions:<br />
1. Yes, so far &#8216;we&#8217; have failed to deliver semantic authoring tools, but we can still work on them. As for the existing documents, my company SimBioSys has a tool to get the chemical semantics out of PDF files: <a href="http://www.simbiosys.ca/clide/index.html" rel="nofollow">http://www.simbiosys.ca/clide/index.html</a></p>
<p>2. No, we shouldn&#8217;t settle, of course! <img src='http://www.simbiosys.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  People use it not by choice, but because the monopoly pushes it on to them. We have to work hard to get semantic documents into Open Standard documents. If we roll over and let the monopoly win, then even if we get semantics into documents we will have to keep paying through the nose to use and access our own data!</p>
<p>ZZ
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Egon Willighagen</title>
		<link>http://www.simbiosys.com/blog/2008/05/11/what-is-wrong-with-ooxml/#comment-1257</link>
		<pubDate>Mon, 12 May 2008 09:09:40 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/05/11/what-is-wrong-with-ooxml/#comment-1257</guid>
					<description>Peter, most visually, has tried to get foots on the ground with the datument concept. There clearly was not enough critical mass to get such things going. The audience is happy with 'hamburgers' (OOXML,PDF,ODF), and has currently zero incentive to do things properly... Even LaTeX has very limited means of semantic markup of chemical information. HTML does allow that, and has been focus of at least some of the work Peter has done.

Something I thought was rather promosing, semantically embedding chemical data in RSS feeds (CMLRSS paper in JCIM), has seen little uptake outside the author group.

While not happy about the choice (I don't use word, so will not be able to comment on these efforts of Peter), I do understand his wish to push semantics into the authoring world. See further comments on this tradeof in Peter's blog:

http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1091#comments

Let me pop the big question here too then:

- did 'we' fail to deliver a semantic authoring tool for chemistry?
- and, should we settle for OOXML because that's what people use?

From your blog item, you're not in favor of settling... but what would you consider a good strategy to get better tools developed and used?

Do you think the Blue Obelisk [1] would manage to attract enough critical mass from the KOffice/OpenOffice communities to work on the ICE equivalent Peter mentioned? Should we try to bootstrap this?

1.http://www.blueobelisk.org/</description>
		<content:encoded><![CDATA[<p>Peter, most visually, has tried to get foots on the ground with the datument concept. There clearly was not enough critical mass to get such things going. The audience is happy with &#8216;hamburgers&#8217; (OOXML,PDF,ODF), and has currently zero incentive to do things properly&#8230; Even LaTeX has very limited means of semantic markup of chemical information. HTML does allow that, and has been focus of at least some of the work Peter has done.</p>
<p>Something I thought was rather promosing, semantically embedding chemical data in RSS feeds (CMLRSS paper in JCIM), has seen little uptake outside the author group.</p>
<p>While not happy about the choice (I don&#8217;t use word, so will not be able to comment on these efforts of Peter), I do understand his wish to push semantics into the authoring world. See further comments on this tradeof in Peter&#8217;s blog:</p>
<p><a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1091#comments" rel="nofollow">http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1091#comments</a></p>
<p>Let me pop the big question here too then:</p>
<p>- did &#8216;we&#8217; fail to deliver a semantic authoring tool for chemistry?<br />
- and, should we settle for OOXML because that&#8217;s what people use?</p>
<p>From your blog item, you&#8217;re not in favor of settling&#8230; but what would you consider a good strategy to get better tools developed and used?</p>
<p>Do you think the Blue Obelisk [1] would manage to attract enough critical mass from the KOffice/OpenOffice communities to work on the ICE equivalent Peter mentioned? Should we try to bootstrap this?</p>
<p>1.http://www.blueobelisk.org/
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: Matthew Holloway</title>
		<link>http://www.simbiosys.com/blog/2008/05/11/what-is-wrong-with-ooxml/#comment-1247</link>
		<pubDate>Mon, 12 May 2008 00:13:34 +0000</pubDate>
		<guid>http://www.simbiosys.com/blog/2008/05/11/what-is-wrong-with-ooxml/#comment-1247</guid>
					<description>I agree :)</description>
		<content:encoded><![CDATA[<p>I agree <img src='http://www.simbiosys.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />
</p>
]]></content:encoded>
				</item>
</channel>
</rss>

