哎,基础设施还是得折腾一遍,记录一下,如何下载网页的可能方案
注意:这些方案我都没有仔细研究,因为simpread简悦应该能解决我的大部分问题,这个话题暂时搁置
- ArchiveBox/ArchiveBox: 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more…
- gildas-lormeau/SingleFile: Web Extension for saving a faithful copy of a complete web page in a single HTML file
- scrapy/scrapy: Scrapy, a fast high-level web crawling & scraping framework for Python.
- Y2Z/monolith: ⬛️ CLI tool for saving complete web pages as a single HTML file
回想过去,firefox大版本升级之前,scrapy可以完美的解决问题的时代一去不复返了…
22:48