WelcomeUser Guide
ToSPrivacyCanary
DonateBugsLicense

©2024 Poal.co

1.3K

I know this is a pretty smart crowd over here, but I didn't know this until just recently, so I'm sharing it here with you:

If you come across a gem of a website that you fear might get taken down soon, yes, go ahead and archive it to archive.org or the waybackmachine. But I found something else you can do too. Scroll all the way from top to the bottom of the website to make sure that everything gets uploaded and refreshed, then right-click on the page and click print (or just hit print screen on your keyboard). Know you're given the option to save as a pdf. Now you've got your own personal backup of that webpage that you can print out on paper at anytime you want or copy onto a thumb drive for easy access and portability. Even though archive.org is supposed to be like a digital ark or sanctuary for wayward webpages, you're still uploading that webpage to someones very hackable server farm somewhere. It's a lot more difficult for them to come thru a backdoor to your computer (or your house lol) and error 404 your thumb drive or your physical print out. I think we should all start adopting this practice in retrospect and from here on out. Internet Download Manager is also a supremely amazing program to have installed, it allows you to very easily download any video, music file, pdf, or anything from any website, in just 1 click. I've been using it for years to download every video I ever watch.

I know this is a pretty smart crowd over here, but I didn't know this until just recently, so I'm sharing it here with you: If you come across a gem of a website that you fear might get taken down soon, yes, go ahead and archive it to archive.org or the waybackmachine. But I found something else you can do too. Scroll all the way from top to the bottom of the website to make sure that everything gets uploaded and refreshed, then right-click on the page and click print (or just hit print screen on your keyboard). Know you're given the option to save as a pdf. Now you've got your own personal backup of that webpage that you can print out on paper at anytime you want or copy onto a thumb drive for easy access and portability. Even though archive.org is supposed to be like a digital ark or sanctuary for wayward webpages, you're still uploading that webpage to someones very hackable server farm somewhere. It's a lot more difficult for them to come thru a backdoor to your computer (or your house lol) and error 404 your thumb drive or your physical print out. I think we should all start adopting this practice in retrospect and from here on out. Internet Download Manager is also a supremely amazing program to have installed, it allows you to very easily download any video, music file, pdf, or anything from any website, in just 1 click. I've been using it for years to download every video I ever watch.

(post is archived)

[–] 0 pt (edited )

Also for offline archiving tools: SingleFile browser extension for saving pages as single HTML files. (saved html files can also be converted to PDF in the method you noted or in other software)

I've found SingleFile browser extension to be very useful for quickly saving web pages to single html files. Plenty of customization and control options and preferences. With my selected preferences, a single saved page for me normally ranges in size from just a couple hundred kilobytes to a couple megabytes.

Some extremely long pages with many images can be much larger in size (5, 10 or 20 megabytes). It depends on the page being saved and options selected. In my "jewflu" folder I have ~1000 pages I've saved over the past 1-2 years that are zipped up in 7z with ultra compression and it sits at about 900mb due to a few of the pages being quite large as well as including some PDFs related to the studies. Most pages are of the size I noted above; hundreds of kilobytes to just a couple megabytes before compression.

As an example for the higher file size pages, the Holocaust Deprogramming Course page's single HTML file is about 35mb for me with all of its images included in the file, and that's a huge page. https://web.archive.org/web/20210420201049/https://holocaustdeprogrammingcourse.com/

https://github.com/gildas-lormeau/SingleFile https://web.archive.org/web/20210909230324/https://github.com/gildas-lormeau/SingleFile https://archive.ph/1d75K

SingleFile is a Web Extension (and a CLI tool) compatible with Chrome, Firefox (Desktop and Mobile), Microsoft Edge, Vivaldi, Brave, Waterfox, Yandex browser, and Opera. It helps you to save a complete web page into a single HTML file.

I usually archive a page and then save the archived page, but some instances of archiving leaves the page not displaying the same as it does normally with some stuff missing, so I sometimes need to save the direct page.

[–] 1 pt

If you don't know about it already, there's few chrome/firefox/edge extensions you can install and use to archive a page in just one click. The 2 I use are called "Wayback Machine" and "Save to the Wayback Machine," and of the two, I find the later to be more helpful and quicker.

[–] 1 pt (edited )

Thank you for noting that also. I do already use extensions for both archive.org and archive.today for archiving pages and they are also very helpful for saving a few seconds and speeding up archiving.

For several weeks now archive.today switched to wanting me to do a captcha every single time I try to archive a page instead of only giving me that check if I quickly tried to archive 4 or more pages at the same time. I don't use them at all anymore because of that, but I do check now and then to see if they're still doing that so I know if I can start using it again or not.