percollate
percollate copied to clipboard
A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.
Investigate the issues pointed out in @XiangRongLin's [comment here](https://github.com/danburzo/percollate/pull/139#issue-1237441545) related to naive usage of URLs / file paths in some areas. We should be using [`URL.fileURLtoPath()`](https://nodejs.org/api/url.html#urlfileurltopathurl).
Docker image
**Please create a docker image (amd64) of percollate.** This is such an amazing tool. I spent 3+ days to install nodejs, npm and percollate, puppeteer and chrome headless; still percollate...
I tried and found percollate a very useful tool. However, I would love to use it within html pages for on-demand creation of html page to pdf. How can I...
### Feature description NPM is a messy way for a user to install an app. Would you be interested in supporting a modern packaging format for your Linux users? Snaps,...
A PDF generated from many web pages would benefit from a Table of Contents, implemented as PDF bookmarks. We'll probably need to post-process the PDF with something like [HummusJS](https://github.com/galkahana/HummusJS) to...
On some pages, e.g. ```bash percollate pdf pdf https://de.wikipedia.org/wiki/JavaScript ``` The font size in the resulting PDF is much smaller than usual, although the page size remains the same.
The idea of the `imagesAtFullSize` enhancement is to get the largest available image from blogs using Blogspot, WordPress, and the like: https://github.com/danburzo/percollate/blob/3506b370fc1d54b9039a1f104c20defda7859eb8/src/enhancements.js#L1-L20 However, Wikipedia images are an exception: ```html ```...
It would be great to have an option to feed not just a plain list of URLs, but a tabbed, spaced, or somehow formatted (see below) file with captions and...
### Feature description When converting Wikipedia pages to Markdown, some constructs may trip up different Markdown parsers. See, for example, the output of: ```bash percollate md https://en.wikipedia.org/wiki/The_National_(band) ``` GitHub seems...