Don Hill
Posts: 70
Nickname: ssswdon
Registered: Jul, 2002
|
|
Re: How do I Extrract strings from HTML page like Price of a book ?
|
Posted: Sep 16, 2002 10:40 AM
|
|
If I was doing this, I would want to turn the html into valid XML, this can be done by using a product named "Tidy". Most of the mapping tools on the market use this tool.
After you have run the html against tidy you will have valid XML, this will allow you to use a Document to process the html or in this case it will be a DOM Document. As stated below you can never guarentee that the html will be the same as websites html change all the time.
HTH
> How can I use parse HTMl to extract text strings from > a webpage like price, isbn,title of book. > I'm trying to develop a web spider? > please help
|
|