June 11, 2003

Posting through NewsGator - Part 3

Greg Reinacker posted a response to my NewsGator posting problem on the NewsGator support forum.

So after some digging, it looks like the underlying source of the problem is that the Outlook editor is passing through HTML data which is not necessarily completely valid XML, and is definnitely not valid XHTML. It's an interesting question here, but I think the interface contract between NewsGator and the plugin is that it should really receive at least valid XML, rather than just any old HTML (say with unclosed <br> tags), and preferably proper XHTML. I exchanged some mails with Matt Berther who wrote the MT plugin, and we both agreed it was probably best for this "data cleanup" to be done in one single place rather than having to be repeated in every plugin.

As Greg says, it's going to take some time to build in code to clean up all possible data problems, but having a clear understanding of the problem and what's happening is always a good first step.

Entry categories: Weblog
Posted by Jorgen Thelin at June 11, 2003 09:48 AM - [PermaLink]
Traceback List
Update on RSS 2.0 Schema
Excerpt: I have had a some comment, feedback and queries about the RSS 2.0 Schema I created previously. Randy Charles Morin asked whether I intend to maintain this schema, to which the answer is yes. Sean Gerety asked which regexps I still need. Here's the lis...
Weblog: TheArchitect.co.uk - Jorgen Thelin's weblog
Tracked: June 25, 2003 08:39 PM
Glitches Posting from Outlook with NewsGator
Excerpt: My last postings flushed out a few more glitches with using the Outlook / NewsGator and the Moveable Type posting plugins: Any HTML-specific entity values such as &nbsp; need to be removed, otherwise the RSS / XML produced from the posting will be ...
Weblog: TheArchitect.co.uk - Jorgen Thelin's weblog
Tracked: June 25, 2003 08:43 PM
Sounds OK. As for building it, the guys working on BlogX already built one that does a very decent clean up of the generated HTML into valid XHTML. I built my own IBlogExtension implementation that used that on top of Matt's plugin and I'm fairly happy with it! Posted by: Tomas Restrepo on June 11, 2003 10:51 AM
The IBlogExtension contract does require valid xml; however, the HTML which makes up the description is escaped, so pretty much anything goes. I have a layered plug-in ready which uses HtmlTidy to clean up the HTML before it gets sent to the plug-in. This will probably be rolled into the next version of NewsGator, but in the meantime I'd like a few folks to be working with it to make sure it's working in every case we can throw at it. I can't find a way to email you...can you send a note to support@newsgator.com, and we'll send it over to you to try? Posted by: Greg Reinacker on June 11, 2003 03:29 PM