curl-and-python

Re: Recursive file download

From: Jayne <corvine_at_gmail.com>
Date: Sat, 10 Nov 2007 21:50:39 +0000

BeautifulSoup is the best Python HTML parser out there that I know of. I'd
certainly recommend looking at that if you're going the route of pulling
links out of pages. http://www.crummy.com/software/BeautifulSoup/

On Nov 10, 2007 6:38 AM, <Ar18_at_comcast.net> wrote:

> I know this is two posts in a row, but they are separate questions, and I
> think that it would make it easier to have them as separate emails/posts.
> Anyways, on to the question. :)
>
> What's the best way to implement recursive file downloading in curl? I
> didn't check whether pycurl does it (licensing is still an issue, so that
> option is out).
>
> Do I need to roll my own solution?
> For example, do a search for the <a tags.... and extract the urls or
> find a Python sgml/html parser and extract the urls it finds
>
> or are there several more robust solutions already available in Python?
> _______________________________________________
> http://cool.haxx.se/cgi-bin/mailman/listinfo/curl-and-python
>

_______________________________________________
http://cool.haxx.se/cgi-bin/mailman/listinfo/curl-and-python
Received on 2007-11-10