Crawling HTML Validator to RSS

Posted 22 years ago by Kay

In a comment in yesterday’s Validator to RSS in CF post, someone calling themselves “bward” suggested I modify the code to make it validate an entire site.

Believe me, I’ve thought about it. For a start, it would need to run on a schedule and write the RSS to a file, as “validation on demand” would take too long. That’s straightforward enough. Also, I’d need to spider the site to get the pages to validate.

Hmmm… I wonder does the W3C Link Validator have XML output? Maybe that’s something to investigate. Or I could scrape the output. The RSS feed could then return link validation errors as well as HTML validation errors. More hmmm…

Wow, it’s not like I don’t have a million things to do anyway! Among other things, I’m in the process of moving the site from Domain Host in Melbourne to PerthWeb’s own hosting service, WebClick. Don’t get me wrong – Erez and Donna at Domain Host have been awesome these last couple of years, but seeing as we’re launching our own hosting service I thought I might as well take advantage of it. And I’m thinking of upgrading to Fuseblog 2 while I’m at it. Busy busy busy!

Kay lives here

working with the web

Crawling HTML Validator to RSS