The problem with that command is that you have to download the image in its entirety to get its size.
Also poal uses python and JS only.
The code poal uses downloads the header only.
No no no no no!
HEAD
The HEAD method asks for a response identical to that of a GET request, but without the response body. This is useful for retrieving meta-information written in response headers, without having to transport the entire content.
Nothing's downloaded. The entire response (Wireshark, "Length" field) is 903 bytes and only contains the headers. You can test the same URL with http://
to see the unencrypted data in Wireshark (and as I suspected the capitalization of letters changed.)
I don't know what library you are using, but on the inside, all that's needed is to replace "GET /di-CF3ZOA.png" in the HTTP request with "HEAD /di-CF3ZOA.png" to get the headers.
https://docs.python.org/3/library/http.client.html#http.client.HTTPConnection.request
or https://stackoverflow.com/questions/107405/how-do-you-send-a-head-http-request-in-python-2
PS/Edit: Maybe I should have specified, curl -I
is curl --head
aka -I, --head Show document info only
, the HTTP HEAD request
Maybe I should have specified, curl -I
Ok didn't know about -I
I use requests library to get headers.
Everything works perfectly fine except for some corrupt jpeg files.
I uploaded one, you can try it with your code if you want and see if you get the same results.
Oh and also if you are not convinced yet ;) that it really doesn't transfer the whole file:
# a download, normal GET request
curl -L https://pic8.co/sh/m2v8Y4.jpg -o "/dev/null"
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0
0 0 0 0 0 0 0 0 --:--:-- 0:00:01 --:--:-- 0
100 33382 100 33382 0 0 13315 0 0:00:02 0:00:02 --:--:-- 25193
# HEAD request, received=0
curl -LI https://pic8.co/sh/m2v8Y4.jpg -o "/dev/null"
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0
0 0 0 0 0 0 0 0 --:--:-- 0:00:01 --:--:-- 0
0 33382 0 0 0 0 0 0 --:--:-- 0:00:02 --:--:-- 0
Some very stupid servers may not show content-length, and that's the ONLY time you'd want to download either the file header or the whole file.
One of the 'I keep forgetting everytime' with curl: it doesn't follow redirects automatically, needs the -L
parameter:
curl -I https://pic8.co/sh/m2v8Y4.jpg
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 --:--:-- 0:00:03 --:--:-- 0
HTTP/1.1 302 Found
Server: nginx
Date: Sat, 05 Dec 2020 00:03:26 GMT
Content-Type: text/html; charset=utf-8
Content-Length: 0
Connection: keep-alive
Location: /d/0c747999-9bdc-419e-ada4-f61f3fdad56b/
X-Frame-Options: ALLOWALL
X-Robots-Tag: noindex, nofollow, nosnippet, noarchive, noimageindex
X-Content-Type-Options: nosniff
X-XSS-Protection: 1; mode=block
but with -L it follows through the chain (oh God, I now know why pictures take so long to load):
curl -IL https://pic8.co/sh/m2v8Y4.jpg
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0
0 0 0 0 0 0 0 0 --:--:-- 0:00:01 --:--:-- 0
0 33382 0 0 0 0 0 0 --:--:-- 0:00:03 --:--:-- 0
HTTP/1.1 302 Found
Server: nginx
Date: Sat, 05 Dec 2020 00:03:49 GMT
Content-Type: text/html; charset=utf-8
Content-Length: 0
Connection: keep-alive
Location: /d/0c747999-9bdc-419e-ada4-f61f3fdad56b/
X-Frame-Options: ALLOWALL
X-Robots-Tag: noindex, nofollow, nosnippet, noarchive, noimageindex
X-Content-Type-Options: nosniff
X-XSS-Protection: 1; mode=block
HTTP/1.1 302 Found
Server: nginx
Date: Sat, 05 Dec 2020 00:03:49 GMT
Content-Type: image
Content-Length: 0
Connection: keep-alive
Location: https://ist6-3.filesor.com/pimpandhost.com/1/_/_/_/1/a/g/E/F/agEFH/di-SSGKY8.jpeg
Referer: https://pimpandhost.com/static/html/wide_iframe_dynamic.php
X-Frame-Options: ALLOWALL
X-Robots-Tag: noindex, nofollow, nosnippet, noarchive, noimageindex
X-Content-Type-Options: nosniff
X-XSS-Protection: 1; mode=block
HTTP/2 200
date: Sat, 05 Dec 2020 00:03:52 GMT
content-type: image/jpeg
content-length: 33382
last-modified: Fri, 04 Dec 2020 16:46:45 GMT
etag: "5fca67f5-8266"
expires: Thu, 31 Dec 2037 23:55:55 GMT
cache-control: max-age=315360000
cache-control: public
x-frame-options: ALLOW-FROM *
access-control-allow-origin: *
access-control-allow-credentials: true
access-control-allow-methods: GET, POST, OPTIONS
access-control-allow-headers: DNT,X-CustomHeader,Keep-Alive,User-Agent,X-Requested-With,If-Modified-Since,Cache-Control,Content-Type
server: CDNCENSORED
x-CENSORED-nzt: CENSORED
x-CENSORED-nzt-ray: CENSORED
x-edge-ip: XXX.xxx.xxx.xxX
x-edge-pop: CENSORED
x-cache: MISS
accept-ranges: bytes
2 redirects, hits the image on 3rd. content-length = 33382 (just read the thing: HTTP/2 headers, while supposed to be case-insensitive, are always lower-case for better compression.
Depending on what your library does or does not do for you, you'd need to follow by the redirections, if the response mandates so.
EDIT by me: removed server IPs/nodes
(post is archived)