This post originated from an RSS feed registered with .NET Buzz
by Scott Hanselman.
Original Post: Target: Referral Spam in dasBlog
Feed Title: Scott Hanselman's ComputerZen.com
Feed URL: http://radio-weblogs.com/0106747/rss.xml
Feed Description: Scott Hanselman's ComputerZen.com is a .NET/WebServices/XML Weblog. I offer details of obscurities (internals of ASP.NET, WebServices, XML, etc) and best practices from real world scenarios.
I've pretty much solved the comment-Spam problem
(only one person has voiced their distaste so far) but a recently perusal of my logs
and older posts indicated a ridiculous amount of referral spam.
This is when someone hits a post on your site and has changed/hacked the HTTP Referrer
Header to indicate where they came from. If your blog adds this referrer to the page,
as most to, you've just linked to Hot Gay Sex (not that there's anything wrong with
Hot Sex between consenting adults : ) ) or whatever by their actions.
The story goes when Google comes around, they see that you've linked to them, and
they get Google Juice via the Page
Rank System.
Not only is this potentially offensive to my readers, it also obscures the posts and
comments when they are filled with referrals.
Potential Solutions:
Stop printing out referrals on my pages.
Personally, I like to see them, and I think they provide value to the reader so they
can see other places with information of interest. It also promotes cross-linking
between my peer blogs.
Modify dasBlog to NOT add icky referrals.
This would be idea. However, it will likely be in version 1.7 in some way, either
via James
Snape's whitelist solution (I think a whitelist removes the point of referrals,
and I'll greatly prefer a keyword-based black list) or some other technique.
I've avoided running a "private build" of dasBlog so far (as evidenced by my care
in creating the CAPTCHA
solution without recompiling) and I'd to continue as such
Clean the .xml files occasionally with a process
This is quick, easy, can be automated, and will work in the short term for me as I
await dasBlog 1.7.
Here's what I did. Use at your own risk, back up your /content directory, and know
that this will only have to run on your "*.dayextra.xml" files from dasBlog.
No error handling, no warrenty, but it worked for me. Enjoy.
WARNING: The words I put in the .config file
are ; delimited and are unquestionably offensive. Not only do they include
most of George
Carlin's words but they also include "bloglines" and "artima" because they
don't provide a value in my referral list.