cURL / Mailing Lists / curl-users / Single Mail

curl-users

Re: Quality of the download page

From: Ralph Mitchell <ralphmitchell_at_gmail.com>
Date: Wed, 21 Feb 2007 14:15:51 -0600

On 2/21/07, wen sun <ey_lverduo_at_yahoo.com> wrote:
> Hi, All:
> I am working on a project which need to download a whole website. I found
> that the quality of the download page decreases according to the layer of
> the web page. That means, for the homepage of the website, or the link to
> the first layer, the download page looks exactly like the original one. But
> when following the link to deeper layer webpage, the quality of the
> downloaded page is not good enough then. Some symbols or pictures on that
> web page are missed. I was wondering if there is any thing I can do to avoid
> this problem.

You probably need to be processing the html to pick up image and
source (IMG & SRC) tags, then download those files too, or at least
convert relative urls to absolute urls so that they can be fetched
from the original site by your browser.

Ralph Mitchell
Received on 2007-02-21