curl-library
RE: Libcurl Help Required
Date: Fri, 27 Jan 2006 12:57:55 +0530
I had used tidy only for parsing html long back. We used VC 6 to compile
it. Don't know of any other html parsing libs.
Thanks
Dinesh
________________________________
From: curl-library-bounces_at_cool.haxx.se
[mailto:curl-library-bounces_at_cool.haxx.se] On Behalf Of Kulbhushan
Sharma
Sent: Friday, January 27, 2006 12:33 PM
To: libcurl development
Subject: Re: Libcurl Help Required
Hi,
Thanks for the Reply.
We searched on the Net and we found a Library "LibTidy" To Parse the
HTML.
Do You have any idea that weather this Library can be used on Windows
and with which Compiler.
Or, If any alternate Library is available please tell me.
Waiting for your reply.
Kulbhushan Sharma
On 1/27/06, Rathi, Dinesh <drathi_at_informatica.com> wrote:
Libcurl would allow you to make HTTP(S) requests. However in order to
build a crawler you would also need to parse the html to extract the
outgoing links from that web page. You would need some html parsing
library to do that. However if you just have the list of URLs to fetch,
that shud be straightforward.
Thanks
Dinesh
________________________________
From: curl-library-bounces_at_cool.haxx.se
[mailto:curl-library-bounces_at_cool.haxx.se] On Behalf Of Kulbhushan
Sharma
Sent: Friday, January 27, 2006 11:22 AM
To: curl-library_at_cool.haxx.se
Subject: Libcurl Help Required
Hi Everyone,
I am a New Joiny to the world of Libcurl Users.
Can anyone please tell me that weather I can build a "Webcrawler" Using
Libcurl. What all I needed.
Please Reply Soon.
Kulbhushan
Received on 2006-01-27