curl-and-python
Re: Recursive file download
Date: Sun, 11 Nov 2007 01:32:46 +0000
Thanks for the tip. :) If you got any other suggestions, feel free. :)
Also, I'd like to follow up with two more questions....
1) Are there any alternatives to Curl that are written entirely in Python?
2) Getting really off-topic... are there any Python proxy scripts that support the Icecast protocol? I'm looking for something like Polipo, but with Icecast support. (Realistically, where should I even ask such a question like this?)
Thanks again,
Kevin
-------------- Original message ----------------------
From: Jayne <corvine_at_gmail.com>
> BeautifulSoup is the best Python HTML parser out there that I know of. I'd
> certainly recommend looking at that if you're going the route of pulling
> links out of pages. http://www.crummy.com/software/BeautifulSoup/
>
> On Nov 10, 2007 6:38 AM, <Ar18_at_comcast.net> wrote:
>
> > I know this is two posts in a row, but they are separate questions, and I
> > think that it would make it easier to have them as separate emails/posts.
> > Anyways, on to the question. :)
> >
> > What's the best way to implement recursive file downloading in curl? I
> > didn't check whether pycurl does it (licensing is still an issue, so that
> > option is out).
> >
> > Do I need to roll my own solution?
> > For example, do a search for the <a tags.... and extract the urls or
> > find a Python sgml/html parser and extract the urls it finds
> >
> > or are there several more robust solutions already available in Python?
> > _______________________________________________
> > http://cool.haxx.se/cgi-bin/mailman/listinfo/curl-and-python
> >
attached mail follows:
BeautifulSoup is the best Python HTML parser out there that I know of. I'd
certainly recommend looking at that if you're going the route of pulling
links out of pages. http://www.crummy.com/software/BeautifulSoup/
On Nov 10, 2007 6:38 AM, <Ar18_at_comcast.net> wrote:
> I know this is two posts in a row, but they are separate questions, and I
> think that it would make it easier to have them as separate emails/posts.
> Anyways, on to the question. :)
>
> What's the best way to implement recursive file downloading in curl? I
> didn't check whether pycurl does it (licensing is still an issue, so that
> option is out).
>
> Do I need to roll my own solution?
> For example, do a search for the <a tags.... and extract the urls or
> find a Python sgml/html parser and extract the urls it finds
>
> or are there several more robust solutions already available in Python?
> _______________________________________________
> http://cool.haxx.se/cgi-bin/mailman/listinfo/curl-and-python
>
_______________________________________________
http://cool.haxx.se/cgi-bin/mailman/listinfo/curl-and-python
_______________________________________________
http://cool.haxx.se/cgi-bin/mailman/listinfo/curl-and-python
Received on 2007-11-11