localGoogoo icon indicating copy to clipboard operation
localGoogoo copied to clipboard

pdf

Open xuze1993 opened this issue 6 years ago • 3 comments

I've pulled a website from webhttrack which is mixed of pdf and html,it seems that localgoogle can only index html files,is there anyway to solve the problem?

xuze1993 avatar Jul 02 '19 06:07 xuze1993

The code can be modified to read pdf documents (with a pdf library) while crawling and index it, but a copy of the file would need to be kept so the user can open it in the search result page. thats not good i think.

kodejuice avatar Jul 02 '19 23:07 kodejuice

The code can be modified to read pdf documents (with a pdf library) while crawling and index it, but a copy of the file would need to be kept so the user can open it in the search result page. thats not good i think.

or we could just use the original pdf link in the search results, but if the original file is longer available, you wont be able to open it

kodejuice avatar Jul 02 '19 23:07 kodejuice

gocha,nice work anyway. Sadly that fewer static sites are left on the web.

xuze1993 avatar Jul 03 '19 01:07 xuze1993