This post originated from an RSS feed registered with .NET Buzz
by Brad Wilson.
Original Post: Blog Importer
Feed Title: The .NET Guy
Feed URL: /error.aspx?aspxerrorpath=/dotnetguy/Rss.aspx
Feed Description: A personal blog about technology in general, .NET in specific, and when all else fails, the real world.
When I moved to agileprogrammer.com, I needed to bring over my content from my old blog engine. I've done this once before (when I moved from quality.nu); last time I didn't catch all the broken links, so some of the content stayed broken for years.
This time around, I decided to leverage all these idle CPUs to do the work for me. A big motivation for me was to be able to fix not only the links on the current site, but also the older broken ones from quality.nu. The importer allowed me to make repeated tests against a local install of .Text until such time as I was happy that I'd found everything I could, and moved the minimum required content to the new server.
I combined the import task with my self-mandate to learn Ruby this year, and wrote the code in Ruby. It turned out very compact and relatively effective. I wrote a an article roughly describing the process of developing the importer. My newness means I can't promise it's the best Ruby code you'll ever see, but it turned out good enough to share. It's general purpose, and can be used to push RSS formatted content into a blog that supports the metaWebLog API.
If you want a full version of the source code, please contact me.