Re: Recursive file download

From: Jayne <>
Date: Sat, 10 Nov 2007 21:50:39 +0000

BeautifulSoup is the best Python HTML parser out there that I know of. I'd
certainly recommend looking at that if you're going the route of pulling
links out of pages.

On Nov 10, 2007 6:38 AM, <> wrote:

> I know this is two posts in a row, but they are separate questions, and I
> think that it would make it easier to have them as separate emails/posts.
> Anyways, on to the question. :)
> What's the best way to implement recursive file downloading in curl? I
> didn't check whether pycurl does it (licensing is still an issue, so that
> option is out).
> Do I need to roll my own solution?
> For example, do a search for the <a tags.... and extract the urls or
> find a Python sgml/html parser and extract the urls it finds
> or are there several more robust solutions already available in Python?
> _______________________________________________

Received on 2007-11-10