cURL / Mailing Lists / curl-users / Single Mail

curl-users

cURL starting questions

From: Jason Todd Slack-Moehrle <mailinglists_at_mailnewsrss.com>
Date: Sat, 18 Apr 2009 21:25:24 -0700

Hi All,

I have some starting cURL questions that I am hoping to gain insight
about.

I want to start at Dmoz.org and follow links for entertainment (like
concerts, art gallery events, etc) and examine the link to see if I
should get data back about it and from it.

My questions:

1. Can cURL start at a given URL and examine every link (based upon my
criteria)?

2. If I find a link that has certain keywords that I find of interest,
can I hit that link of interest and get information from that page?

3. How do I get the information about the link of interest and its
content of interest into a MySQL database? (I know ColdFusion and
MySQL and PHP). I think what I am asking is how do I get back to my
database from a crawler?

4. I bought Webbots, spiders and screen scrapers in PHP and so far it
is interesting, but I am wondering what best practices are..

Am I making any sense?

-Jason

-------------------------------------------------------------------
List admin: http://cool.haxx.se/cgi-bin/mailman/listinfo/curl-users
FAQ: http://curl.haxx.se/docs/faq.html
Etiquette: http://curl.haxx.se/mail/etiquette.html
Received on 2009-04-19

This message: [ Message body ]
Next message: Ralph Mitchell: "Re: cURL starting questions"
Previous message: Daniel Stenberg: "Re: Bogus --libcurl output"
Next in thread: Ralph Mitchell: "Re: cURL starting questions"
Reply: Ralph Mitchell: "Re: cURL starting questions"

Contemporary messages sorted: [ by date ] [ by thread ] [ by subject ] [ by author ] [ by messages with attachments ]