Dave Mateer
Dave Mateer
I've sponsored using GitHub https://github.com/aristocratos
Hi @loganwilliams and @borismw - agreed that should be easier to setup and run. I've mostly automated my server build and cron job setup: https://github.com/djhmateer/auto-archiver/tree/main/infra ... will PR this when...
Good thoughts @borismw I've got a PR in just now which reduces the memory overhead of Firefox by reloading the driver on each row, and I'm noticing only about 2GB...
Demo Sheet Created. Ping me an email [email protected] if you'd like access to it and the Google Drive. 
File hashes would be a good feature. In the meantime, I'm incrementing the directory eg ./assets23 every time I do a regenerate, to bust the cache. ```bash # generate transparent...
Dear @ikreymer - would you like me to submit a PR for this? I can give it a shot.
Hi https://myanmar-now.org/mm/ is a good website to test on. I ran the following command, then viewed the `screenshots.warc.gz` in https://replayweb.page/ ```bash docker run --rm -v /mnt/c/dev/v6-auto-archiver/tempx3:/crawls/ webrecorder/browsertrix-crawler crawl --url https://myanmar-now.org/mm/...
I've made a fb archiver on my fork of this codebase: https://github.com/djhmateer/auto-archiver/blob/main/auto_archive_fb.py Have been running in production well (with caveats!).. it's pretty specialised and has to run on its own...
Thank you @RafaPolit I've got a working test build using - Ubuntu 20.04 - Node 20 using nvm latest - ElasticSearch 7.17.6 (am guessing 7.17.17 will be okay) - ICU...
@Coedice - this does sound interesting. https://ethereum.org/en/developers/docs/smart-contracts/ something around this area? Not something I know about!