This post originated from an RSS feed registered with Java Buzz
by News Manager.
Original Post: Does anyone know of a reliable open source library for parsing robots.txt files in Java? I found...
Feed Title: Cafe au Lait
Feed URL: http://www.cafeaulait.org/today.rss
Feed Description: Cafe au Lait is the preeminent independent source of Java information on the net. Unlike many other Java sites, Cafe au Lait is neither beholden to specific companies nor to advertisers. At Cafe au Lait you'll find many resources to help you develop your Java programming skills here including daily news summaries, FAQ lists, tutorials, course notes, examples, exercises, book reviews, user groups and more.
Does anyone know of a reliable open source library for parsing robots.txt files in Java? I found a couple of open source projects at Sourceforge but neither had released anything. This is part of the standard library in Python and available in CPAN for Perl, so there must be something. Perhaps hidden off inside some other project like Lucene or Nutch? If you know of one, please drop me a line. Otherwise, I may have to resort to writing it myself.