The Artima Developer Community
Sponsored Link

Agile Buzz Forum
The Problem with Blacklist Filters

0 replies on 1 page.

Welcome Guest
  Sign In

Go back to the topic listing  Back to Topic List Click to reply to this topic  Reply to this Topic Click to search messages in this forum  Search Forum Click for a threaded view of the topic  Threaded View   
Previous Topic   Next Topic
Flat View: This topic has 0 replies on 1 page
James Robertson

Posts: 29924
Nickname: jarober61
Registered: Jun, 2003

David Buck, Smalltalker at large
The Problem with Blacklist Filters Posted: Jan 14, 2009 5:46 AM
Reply to this message Reply

This post originated from an RSS feed registered with Agile Buzz by James Robertson.
Original Post: The Problem with Blacklist Filters
Feed Title: Cincom Smalltalk Blog - Smalltalk with Rants
Feed URL: http://www.cincomsmalltalk.com/rssBlog/rssBlogView.xml
Feed Description: James Robertson comments on Cincom Smalltalk, the Smalltalk development community, and IT trends and issues in general.
Latest Agile Buzz Posts
Latest Agile Buzz Posts by James Robertson
Latest Posts From Cincom Smalltalk Blog - Smalltalk with Rants

Advertisement

Over in the UK, they are trying to filter out illegal content at the ISP level. The problem? The blacklist filters cast way, way too big a net over things:

According to multiple customers of Demon Internet - now owned by Brit telecom Thus - the London-based ISP is blocking access to all sites stored in the archive. When they query the Wayback Machine, hoping to retrieve archived pages, customers are met with generic "not found" error pages. But judging from their urls, these pages are generated by a web filter based on the blacklist compiled by the Internet Watch Foundation, a government-backed organization charged with policing online pornography.

This is where well intentioned - but too simple - schemes go awry. I've seen this kind of thing myself. I have a simple minded filter for comments on the blogs here, and it's been known to block legitimate comments based on accidental matches against poorly chosen keywords. Basically, when you decide to filter, you have to decide what level of false positive you're willing to put up with. Sure, Baysian filters do a better job - but heck, even there, I have to continually go in and check the junk folder. For awhile, my mail client had decided that everything our company President sent was spam. None of these systems are perfect.

Technorati Tags: ,

Read: The Problem with Blacklist Filters

Topic: Smalltalk Daily 1/12/09: Writing a JPG File Previous Topic   Next Topic Topic: Parsing Expression Grammars

Sponsored Links



Google
  Web Artima.com   

Copyright © 1996-2019 Artima, Inc. All Rights Reserved. - Privacy Policy - Terms of Use