The Artima Developer Community
Sponsored Link

Legacy Java Answers Forum
August 2001

Advertisement

Advertisement

This page contains an archived post to the Java Answers Forum made prior to February 25, 2002. If you wish to participate in discussions, please visit the new Artima Forums.

Message:

How to strip off all HTML tags in a Java Object ?

Posted by Salil V Gangal on August 04, 2001 at 7:20 PM

Friends,

I have a requirement where I want to strip off all the HTML tags from a web-page, which is read from the WWW. Say as an example, I want to read web-page http://www.yahoo.com in a Java object perhaps by using ===>


URL u = new URL("http://www.yahoo.com");
try {
Object o = u.getContent();
...
...
...
}
...
...
...

Now I want to strip of all the HTML tags which are in Object o. How to do this ?

Regards,
Salil V Gangal



Replies:
  • spoofing Rutul August 17, 2001 at 12:03 PM (0)
  • spoofing Rutul August 17, 2001 at 12:02 PM (0)

Sponsored Links



Google
  Web Artima.com   
Copyright © 1996-2009 Artima, Inc. All Rights Reserved. - Privacy Policy - Terms of Use - Advertise with Us