<?xml version="1.0" encoding="utf-8"?>
<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Scobleizer - Latest Comments in Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.disqus.com/</link><description>Tech enthusiast, video blogger, media innovator, fanatical about startups at Rackspace, home of fanatical support for Internet entrepreneurs.</description><atom:link href="https://scobleizer.disqus.com/microsoft_word_generates_clean_html_for_blogs/latest.rss" rel="self"></atom:link><language>en</language><lastBuildDate>Sat, 08 Nov 2008 17:26:57 -0000</lastBuildDate><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639965</link><description>&lt;p&gt;Another Word cleanup tool (free and online, or available in PHP to download and implement in any project):&lt;/p&gt;&lt;p&gt;&lt;a href="http://www.webmastersherpa.com/content/useful-code/cleanup/" rel="nofollow noopener" target="_blank" title="http://www.webmastersherpa.com/content/useful-code/cleanup/"&gt;http://www.webmastersherpa....&lt;/a&gt;&lt;/p&gt;&lt;p&gt;Regards,&lt;br&gt;Jeff&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jeff Blum</dc:creator><pubDate>Sat, 08 Nov 2008 17:26:57 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639964</link><description>&lt;p&gt;I know this is a bit off topic...but if you want a doc converted to HTML try this:&lt;br&gt;&lt;a href="http://www.documentsfortheweb.com/free/" rel="nofollow noopener" target="_blank" title="http://www.documentsfortheweb.com/free/"&gt;http://www.documentsforthew...&lt;/a&gt;&lt;br&gt;Fred&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">fred</dc:creator><pubDate>Wed, 25 Jun 2008 16:57:29 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639963</link><description>&lt;p&gt;Copy paste - even using styles and the new blogformat option is all messed up in Word 2007.&lt;/p&gt;&lt;p&gt;I can take any Word 97-2003 document and copy past directly to the web - discussion boards etc and all formatting is preserved without stupid [FONT] tags inserted all over the text that is pasted.&lt;/p&gt;&lt;p&gt;But try doing that in Word 2007! It just doesn't work. About the best I can get from it is letting 2007 removing all my work put into formatting the text -- and then paste it.&lt;/p&gt;&lt;p&gt;What a pain!&lt;/p&gt;&lt;p&gt;How can I get the simple copy-paste-that-works feature of Word 97-2003?&lt;/p&gt;&lt;p&gt;Bob&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Cliverty</dc:creator><pubDate>Sun, 09 Mar 2008 00:59:54 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639962</link><description>&lt;p&gt;I wasted a week, first trying to convert a Word 2007 document (.docx) or Word "Compatibility Mode" (.doc) with images to a webpage by saving it as .htm, or by writing the document as HTML and saving it as .htm.  All I got was HTML text on the webpage! Then my son told me Word-to-HTML sucks, just write the HTML in Wordpad, save as .txt, FTP it to the server and rename it there to .htm.  Works fine.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Tom Cooper</dc:creator><pubDate>Fri, 25 Jan 2008 14:45:37 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639961</link><description>&lt;p&gt;Well, I tried that feature and I was amazed. If they can get it right, can't they at least introduce a new save-as feature? Like "save-as-blog-html" so at least we can use that for those blogs that Word does not support.&lt;br&gt;BTW, it seems they've ported that to (or maybe it was ported from) Microsoft Windows Live Writer. Now, I think this is a very cool tool. Works nicely with WordPress after some minimal changes to web config. I've documented it in my blog though.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">unicynic</dc:creator><pubDate>Tue, 27 Nov 2007 22:51:49 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639960</link><description>&lt;p&gt;I just posted about Word 2007 as a blog wriing tool. The content is in Thai, though. It is not as good as Writer from the blog writer perspective. Still have some problem to upload pictures to wordpress, problem with Category, no additional support for Tag, etc. But as a simple Blog writing tool, I think it is quite good. No need to learn new thing.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Khun T</dc:creator><pubDate>Thu, 15 Feb 2007 04:17:36 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639959</link><description>&lt;p&gt;beautiful online information center. greatest work... thanks&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">thorp</dc:creator><pubDate>Tue, 20 Jun 2006 16:07:17 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639958</link><description>&lt;p&gt;Very needed information found here, thank you for your work&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">carvel</dc:creator><pubDate>Tue, 20 Jun 2006 16:07:17 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639957</link><description>&lt;p&gt;Robert: That's what I was trying to find out, thanks very much for letting me know.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">mcm</dc:creator><pubDate>Mon, 22 May 2006 20:18:40 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639956</link><description>&lt;p&gt;For me, this is HUGE. A big part of my job is converting your average Word-produced business documents into something our intranet application won't choke on. And it always chokes on the invalid markup generated by Word. Even the "Save As Web" command produces something so far from valid XHTML it's scary. That's before you deal with the nexted divs &amp;amp; font tags...ugh... nightmarish. Especially when business owners don't understand why you can't just copy and paste from Word.&lt;/p&gt;&lt;p&gt;For a while I was using Textism Word Cleaner tool, and that worked well. Looking forward to this new feature... hopefully they'll extend it to the rest of the Office suite (at least the ones that support "Save as web page".&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Jarrod</dc:creator><pubDate>Mon, 22 May 2006 19:49:40 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639955</link><description>&lt;p&gt;Couldn't agree more.  I use Word for my daily work (I'm primarily a writer) but if I cut and paste text from Word into my blog (&lt;a href="http://www.badlanguage.net" rel="nofollow noopener" target="_blank" title="http://www.badlanguage.net"&gt;http://www.badlanguage.net&lt;/a&gt;) or my flying website (&lt;a href="http://www.modernpilot.com" rel="nofollow noopener" target="_blank" title="http://www.modernpilot.com"&gt;http://www.modernpilot.com&lt;/a&gt;) it comes with a complete set of Microsoft-specific tags that I don't want or need.  The easiest way to wash them out is - get this - to paste the text into Notepad and then copy it from there.  How low-tech is that?  It's encouraging to know that there's someone inside Microsoft who understands this daily frustration.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Matthew Stibbe</dc:creator><pubDate>Sat, 20 May 2006 10:06:51 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639954</link><description>&lt;p&gt;mcm: Wordpress already is supported because Wordpress supports the Metaweblog API.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Robert Scoble</dc:creator><pubDate>Sat, 20 May 2006 01:50:26 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639953</link><description>&lt;p&gt;Will it ever be likely that common open source blogging tools like Wordpress will be supported?&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">mcm</dc:creator><pubDate>Sat, 20 May 2006 01:15:00 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639952</link><description>&lt;p&gt;This is good news but which version(s) of XHTML will Word render - 1.0 transitional, 1.1 strict or 2.0?&lt;/p&gt;&lt;p&gt;I for one think this is the strongest reason to upgrade my version of Outlook and Word so long as the metaweblog API and Atom publication support allow me to  blog to a variety of blogging tools and not just Spaces.&lt;/p&gt;&lt;p&gt;I wonder if Microsoft could go further and fully support CSS for template formatting in Word and JavaScript for macros.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Sam Sethi</dc:creator><pubDate>Fri, 19 May 2006 02:33:24 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639951</link><description>&lt;p&gt;You can get an amen, Darren. I'll even throw in an Alleluia. I cannot count the hours I have wasted dorking around with terribly formatted Word output.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">farlane</dc:creator><pubDate>Thu, 18 May 2006 16:36:58 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639950</link><description>&lt;p&gt;Regarding post #24 - pretty good analysis!&lt;/p&gt;&lt;p&gt;I just checked windows media player ver 10&lt;br&gt;and it seems to allow mp3 up to 320 Kbps.&lt;br&gt;And the max for Windows Media Audio is 192Kbps.&lt;br&gt;There is a lossless option that goes up to 940Kbps.&lt;/p&gt;&lt;p&gt;I happen to believe that you are right about previous versions. So hopefully other things will continue in this direction (of competative sotware).&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Elias</dc:creator><pubDate>Thu, 18 May 2006 13:14:35 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639949</link><description>&lt;p&gt;Paul,&lt;br&gt;Just like Dream Weaver, our blog editor (Zoundry), also has to do "special work" to clean up MS Word generated content. The Word's "Save as Filtered HTML" helps, but we found most users simply copy and paste from MS Word (can't blame them).&lt;/p&gt;&lt;p&gt;In our case, (in addition for validation), we have to go to great pains to clean up (at least we try) the content so that we have a well formed xml document (which we need to use as a in-memory memory model).&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Pidge</dc:creator><pubDate>Thu, 18 May 2006 11:41:21 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639948</link><description>&lt;p&gt;Are you they aren't just saying that "normal people don't care about HTML quality" because they can't openly tell you the truth - that they've been told to generate munged output that only works in IE as an attempt at browser lock-in?&lt;/p&gt;&lt;p&gt;If that's not the purpose, it's certainly an effect.&lt;/p&gt;&lt;p&gt;I think there's a certain level of doublethink going on at Microsoft. In the past, you could at least be open internally about anti-competitive moves like integrating IE into the OS (Brad Silverberg's famous comment about the pressure to do "unnatural and losing things to 'protect' Windows" and fear that "trying to win the Internet using Windows is a losing strategy"), but now there's too much risk of a conversation like that coming to light via subpoena (as Brad's did).&lt;/p&gt;&lt;p&gt;There are still so many little details of Windows and MS software that are clearly attempts to achieve lock-in (why is the MP3 ripping support in my Media Player limited to only 56kbps quality, if not to steer me into using WMA?). Someone's taken the decision to pursue a lock-in strategy there, but how is this stuff discussed? Face-to-face in a sauna to avoid bugging? By exchanging memos on rice paper?&lt;/p&gt;&lt;p&gt;Not only is lock-in annoying, but in an age of software distribution via HTTP, it's going to get increasingly ineffective. When I can download iTunes and VLC, why would I stick with crappy WMP? Users might not fully understand &lt;i&gt;why&lt;/i&gt; they can't turn up the MP3 quality slider, but they'll certainly have learnt that it's better to try and find an alternative that works properly than try to fix the Microsoft solution.&lt;/p&gt;&lt;p&gt;There's a conversation that's worth having within Microsoft. Lock-in is lame and it doesn't work, so why don't you eradicate it? Does Hotmail have free POP access for everyone yet, like Gmail does? Does *every* app have an Export button as well as an Import button (and is the output in an open format like OPML? and is it human readable or hopelessly munged and riddled with CDATA?)&lt;/p&gt;&lt;p&gt;If MS apps are truly offering the best experience, and not just relying on bundling and lock-in, then they should let users get their data out in useful formats. The Word team has taken a step in the right direction.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Ben</dc:creator><pubDate>Thu, 18 May 2006 09:47:11 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639947</link><description>&lt;p&gt;Tell those program managers "your competitors won't be able to boast that their product can clean up Word HTML". I've got Dreamweaver Ultradev 4 and it has a special function to do just that.&lt;/p&gt;&lt;p&gt;I've had to sort out a web page created using a dynamic edit control where the user had pasted Word text into the WYSIWYG edit box. He cared that it looked a mess by the time it was rendered on a web page.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Paul Morriss</dc:creator><pubDate>Thu, 18 May 2006 09:38:38 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639946</link><description>&lt;p&gt;Thanks for bringing that up!  When blogging/building my show notes for Podcheck, I copy-paste from my Outlook e-mails.  I have hacked countless PHP and java scripts on my blog site to strip out those lame "mso:normal blah; blah:lame size:12pt style:css:unecessary:crap-12 mso:makes-me:so-mad" tags.  Thank you Scoble!&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Scott Fletcher</dc:creator><pubDate>Thu, 18 May 2006 09:26:45 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639945</link><description>&lt;p&gt;Saving the web page as an HTML (Filtered) page usually takes away a lot of the crappy html... still not as clean as I'd like it though, but only a million times better than saving it as you normally would.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">twisterjosh</dc:creator><pubDate>Thu, 18 May 2006 08:57:10 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639944</link><description>&lt;p&gt;Seriously, though, why would I want to use a word processing program to post to a blog? Any blog software worth its weight in salt has an interface for quickly posting, many of them have WYSIWYG interfaces now.  If your blog supports the metaweblog API, which I can safely suppose from the screenshots is a requirement of using Word like this anyway, it almost definitely has customizable posting capabilities already.&lt;/p&gt;&lt;p&gt;I saw this, and I suppose it's nice to add a new feature beyond a shell that blends into the Windows environment, but isn't this just the Word team adding features for the sake of adding features?  Were there people clamoring for this?&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Adam S</dc:creator><pubDate>Thu, 18 May 2006 08:36:23 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639943</link><description>&lt;p&gt;at #7. Am I the only person to find it funny that &lt;a href="http://www.badpage.info" rel="nofollow noopener" target="_blank" title="http://www.badpage.info"&gt;http://www.badpage.info&lt;/a&gt; doesn't even validate against it's own tests? :P&lt;/p&gt;&lt;p&gt;Mat&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Mat Steeples</dc:creator><pubDate>Thu, 18 May 2006 08:01:31 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639942</link><description>&lt;p&gt;I remember when I built my first websites, back in 2000, I used to use Word 97 with the HTML editor as it was simple and easy to use for someone inexperienced in HTML like myself.&lt;/p&gt;&lt;p&gt;It's nice to hear that Word can yet have a role in web publishing in the modern web - while maybe there are better tools out there, few have the sheer accessibility of Word.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Brian</dc:creator><pubDate>Thu, 18 May 2006 04:27:07 -0000</pubDate></item><item><title>Re: Microsoft Word generates clean HTML for blogs?</title><link>http://scobleizer.com/2006/05/17/microsoft-word-generates-clean-html-for-blogs/#comment-9639941</link><description>&lt;p&gt;Can I get an amen? Thank the Lord for clean HTML.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">dbarefoot</dc:creator><pubDate>Thu, 18 May 2006 01:09:27 -0000</pubDate></item></channel></rss>