matrix-static icon indicating copy to clipboard operation
matrix-static copied to clipboard

Do we set URLs, Etags & caching headers to ensure search engines don't respider the same content?

Open ara4n opened this issue 8 years ago • 5 comments

Right now google is hammering matrix.org by indexing view.matrix.org. Do we ensure that the historical pages have nice unique URLs, Etags and have a suitably massive caching header to ensure that Google doesn't go and try to reindex the whole thing again a few days later?

ara4n avatar Oct 23 '17 22:10 ara4n

ooi have we also considered having matrix-static blat historical stuff out to disk as, well, static HTML, rather than regenerating it each and every time?

ara4n avatar Oct 23 '17 22:10 ara4n

The issue is that each event is it's own anchor, so we'd get a html bundle for each and every event in every room

t3chguy avatar Oct 23 '17 23:10 t3chguy

i'm not sure I follow. i'm suggesting that the URL for "page 250 of matrix.org" remains the same (without any nasty cache-busting question marks etc), and we could even pregen the html of that page in future.

ara4n avatar Oct 23 '17 23:10 ara4n

What would page 250 be though, one new message comes in and the whole page gets shifted by 1 event

t3chguy avatar Oct 24 '17 06:10 t3chguy

Misclick, sorry

t3chguy avatar Oct 24 '17 06:10 t3chguy