cURL / Mailing Lists / curl-users / Single Mail

curl-users

Re: simple/weird question

From: Von Hawkins via curl-users <curl-users_at_cool.haxx.se>
Date: Mon, 30 Nov 2015 15:04:09 +0000 (UTC)

Bruce,

> Trying to fetch a simple page.
>
> The target is:
> http://www.foothill.edu/schedule/schedule.php
>
> In different browsers, without javascript or referer, results come
> back as expected.
>
> Using different curls settings, can't seem to get the complete
> returned content. The content seems to be missing the select for the
> terms!!
> Almost as though the returned data is corrupted.. But it happens
> consistently, regardless of trying different headers, etc..
>
> my test curl is:
> curl -vvv -k -A  "User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0)
> Gecko/20100101 Firefox/38.0"  -H 'Accept:
> text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8'  -H
> 'Accept-Language: en-US,en;q=0.5'  -H 'Accept-Encoding: gzip, deflate'
>  --cookie-jar aa.lwp --cookie aa.lwp    -L
> "http://www.foothill.edu/schedule/schedule.php"
>

I cannot replicate your issue.  Using curl version:$ curl -Vcurl 7.29.0 (x86_64-redhat-linux-gnu) libcurl/7.29.0 NSS/3.15.4 zlib/1.2.7 libidn/1.28 libssh2/1.4.3Protocols: dict file ftp ftps gopher http https imap imaps ldap ldaps pop3 pop3s rtsp scp sftp smtp smtps telnet tftp Features: AsynchDNS GSS-Negotiate IDN IPv6 Largefile NTLM NTLM_WB SSL libz 

Using either the simple:curl http://www.foothill.edu/schedule/schedule.php%c2 > junk.php
or the exact command you provided above always results in a text file that contains the same contents I get from Chrome when I "view page source".  In other words, the file ends with:
<script type="text/javascript">_uacct = "UA-1477111-1";urchinTracker();</script></body></html>

What version and environment are you running?  I'm just using a CentOS 7 VM with the stock curl version.
--obivon
 

-------------------------------------------------------------------
List admin: http://cool.haxx.se/list/listinfo/curl-users
FAQ: http://curl.haxx.se/docs/faq.html
Etiquette: http://curl.haxx.se/mail/etiquette.html
Received on 2015-11-30