curl-library
parsing html/xml, lynx -dump, APIs, Expat
From: James Wettenhall <wettenhall_at_wehi.edu.au>
Date: Thu, 15 May 2003 00:13:39 +1000 (EST)
Date: Thu, 15 May 2003 00:13:39 +1000 (EST)
Hi,
Just replying to my recent post: I'm wanting to save a
webpage as text in the way that lynx -dump does, but
preferably using an API, rather than a system call.
Thanks for the responses.
I found an XML parser library which looks pretty good:
http://sourceforge.net/projects/expat/
I think it requires strict XML, but that's OK, because I've
discovered that the HTML pages I was trying to convert to
text can be generated in XML instead of HTML.
Regards,
James
-------------------------------------------------------
Enterprise Linux Forum Conference & Expo, June 4-6, 2003, Santa Clara
The only event dedicated to issues related to Linux enterprise solutions
www.enterpriselinuxforum.com
Received on 2003-05-14