Dave Mateer

Results 15 comments of Dave Mateer

I've sponsored using GitHub https://github.com/aristocratos

Hi @loganwilliams and @borismw - agreed that should be easier to setup and run. I've mostly automated my server build and cron job setup: https://github.com/djhmateer/auto-archiver/tree/main/infra ... will PR this when...

Good thoughts @borismw I've got a PR in just now which reduces the memory overhead of Firefox by reloading the driver on each row, and I'm noticing only about 2GB...

Demo Sheet Created. Ping me an email [email protected] if you'd like access to it and the Google Drive. ![image](https://user-images.githubusercontent.com/111231/178253429-f3ec4b5d-df2a-44b3-b561-074ba5ee040f.png)

File hashes would be a good feature. In the meantime, I'm incrementing the directory eg ./assets23 every time I do a regenerate, to bust the cache. ```bash # generate transparent...

Dear @ikreymer - would you like me to submit a PR for this? I can give it a shot.

Hi https://myanmar-now.org/mm/ is a good website to test on. I ran the following command, then viewed the `screenshots.warc.gz` in https://replayweb.page/ ```bash docker run --rm -v /mnt/c/dev/v6-auto-archiver/tempx3:/crawls/ webrecorder/browsertrix-crawler crawl --url https://myanmar-now.org/mm/...

I've made a fb archiver on my fork of this codebase: https://github.com/djhmateer/auto-archiver/blob/main/auto_archive_fb.py Have been running in production well (with caveats!).. it's pretty specialised and has to run on its own...

Thank you @RafaPolit I've got a working test build using - Ubuntu 20.04 - Node 20 using nvm latest - ElasticSearch 7.17.6 (am guessing 7.17.17 will be okay) - ICU...

@Coedice - this does sound interesting. https://ethereum.org/en/developers/docs/smart-contracts/ something around this area? Not something I know about!