cURL / Mailing Lists / curl-library / Single Mail

curl-library

RE: Libcurl Help Required

From: Rathi, Dinesh <drathi_at_informatica.com>
Date: Fri, 27 Jan 2006 12:57:55 +0530

I had used tidy only for parsing html long back. We used VC 6 to compile
it. Don't know of any other html parsing libs.

 

Thanks

Dinesh

 

________________________________

From: curl-library-bounces_at_cool.haxx.se
[mailto:curl-library-bounces_at_cool.haxx.se] On Behalf Of Kulbhushan
Sharma
Sent: Friday, January 27, 2006 12:33 PM
To: libcurl development
Subject: Re: Libcurl Help Required

 

Hi,

 

Thanks for the Reply.

 

We searched on the Net and we found a Library "LibTidy" To Parse the
HTML.

 

Do You have any idea that weather this Library can be used on Windows
and with which Compiler.
 

Or, If any alternate Library is available please tell me.

 

Waiting for your reply.

 

Kulbhushan Sharma
 

 

 

 

On 1/27/06, Rathi, Dinesh <drathi_at_informatica.com> wrote:

Libcurl would allow you to make HTTP(S) requests. However in order to
build a crawler you would also need to parse the html to extract the
outgoing links from that web page. You would need some html parsing
library to do that. However if you just have the list of URLs to fetch,
that shud be straightforward.

 

Thanks

Dinesh

 

________________________________

From: curl-library-bounces_at_cool.haxx.se
[mailto:curl-library-bounces_at_cool.haxx.se] On Behalf Of Kulbhushan
Sharma
Sent: Friday, January 27, 2006 11:22 AM
To: curl-library_at_cool.haxx.se
Subject: Libcurl Help Required

 

Hi Everyone,

 

I am a New Joiny to the world of Libcurl Users.

 

Can anyone please tell me that weather I can build a "Webcrawler" Using
Libcurl. What all I needed.

 

Please Reply Soon.

 

Kulbhushan

 
Received on 2006-01-27