The Artima Developer Community
Sponsored Link

Legacy Java Answers Forum
August 2001



This page contains an archived post to the Java Answers Forum made prior to February 25, 2002. If you wish to participate in discussions, please visit the new Artima Forums.


How to strip off all HTML tags in a Java Object ?

Posted by Salil V Gangal on August 04, 2001 at 7:20 PM


I have a requirement where I want to strip off all the HTML tags from a web-page, which is read from the WWW. Say as an example, I want to read web-page in a Java object perhaps by using ===>

URL u = new URL("");
try {
Object o = u.getContent();

Now I want to strip of all the HTML tags which are in Object o. How to do this ?

Salil V Gangal

  • spoofing Rutul August 17, 2001 at 12:03 PM (0)
  • spoofing Rutul August 17, 2001 at 12:02 PM (0)

Sponsored Links

Copyright © 1996-2009 Artima, Inc. All Rights Reserved. - Privacy Policy - Terms of Use - Advertise with Us