Also for offline archiving tools: SingleFile browser extension for saving pages as single HTML files. (saved html files can also be converted to PDF in the method you noted or in other software)
I've found SingleFile browser extension to be very useful for quickly saving web pages to single html files. Plenty of customization and control options and preferences. With my selected preferences, a single saved page for me normally ranges in size from just a couple hundred kilobytes to a couple megabytes.
Some extremely long pages with many images can be much larger in size (5, 10 or 20 megabytes). It depends on the page being saved and options selected. In my "jewflu" folder I have ~1000 pages I've saved over the past 1-2 years that are zipped up in 7z with ultra compression and it sits at about 900mb due to a few of the pages being quite large as well as including some PDFs related to the studies. Most pages are of the size I noted above; hundreds of kilobytes to just a couple megabytes before compression.
As an example for the higher file size pages, the Holocaust Deprogramming Course page's single HTML file is about 35mb for me with all of its images included in the file, and that's a huge page. https://web.archive.org/web/20210420201049/https://holocaustdeprogrammingcourse.com/
https://github.com/gildas-lormeau/SingleFile https://web.archive.org/web/20210909230324/https://github.com/gildas-lormeau/SingleFile https://archive.ph/1d75K
SingleFile is a Web Extension (and a CLI tool) compatible with Chrome, Firefox (Desktop and Mobile), Microsoft Edge, Vivaldi, Brave, Waterfox, Yandex browser, and Opera. It helps you to save a complete web page into a single HTML file.
I usually archive a page and then save the archived page, but some instances of archiving leaves the page not displaying the same as it does normally with some stuff missing, so I sometimes need to save the direct page.
If you don't know about it already, there's few chrome/firefox/edge extensions you can install and use to archive a page in just one click. The 2 I use are called "Wayback Machine" and "Save to the Wayback Machine," and of the two, I find the later to be more helpful and quicker.
Thank you for noting that also. I do already use extensions for both archive.org and archive.today for archiving pages and they are also very helpful for saving a few seconds and speeding up archiving.
- For archive.org: Send to Internet Archive https://addons.mozilla.org/en-US/firefox/addon/send-to-internet-archive/
- For archive.today: Archive Page https://addons.mozilla.org/en-US/firefox/addon/archive-page/
For several weeks now archive.today switched to wanting me to do a captcha every single time I try to archive a page instead of only giving me that check if I quickly tried to archive 4 or more pages at the same time. I don't use them at all anymore because of that, but I do check now and then to see if they're still doing that so I know if I can start using it again or not.
Right on.
(post is archived)