tuehlarsen

Results 23 comments of tuehlarsen

Here is the correct image of the video from my Fireshot rendering during the crawl:

I’m using the beta.browsertrix GUI v. 1,8* with no blocking of ads and I can’t change crawling browser. To me the ads replay seems much better than for a year...

If you download https://beta.browsertrix.cloud/orgs/kb/items/crawl/manual-20240323083932-bb9b135d-357?workflowId=bb9b135d-3573-4901-bdef-a80d35a15741#files:~:text=20240323084140064%2Dbb9b135d%2D357%2D0.wacz and load the wacz file offline with replay webpage 2.00.beta it replays the ads which are harvested. But if you unzip the file and only load...

At Royal Danish Library we need to crawl e.g newssites frontpages many times during a day because they change very often and specially by breaking news. We have today Heritrix...

Are above 3 new warc fields mandatory for modern browserbased replay and are they defacto used in other tools today?

I hope it will be more explicitly - as it is of great importance for large older web archives what the new warc fields are for and what they will...

I checked this morning again and only replay of berlingske.dk can't show the ads. tv2.dk and politiken.dk are replaying some of the ads. Any hints to what could be wrong...

I tried with the previous crawler version with berlingske.dk - it just ignores the browser profile totally and the accept of cookies. With the default crawler it crashes again and...

now it runs but berlingske.dk with no ads or no ads traces in replay - i saw the ads during the crawl and no cookies accept popup, so it should...

I can see all adds in a brave browser from a danish ip without shields activated, so perhaps a browsertrix replay issue?