GoBooDo-Linux icon indicating copy to clipboard operation
GoBooDo-Linux copied to clipboard

Script interruption

Open gitnewbiee opened this issue 3 years ago • 2 comments

Hi,

First of all, thanks to you and vaibhavk97 for the herculean effort in putting this tool together.

I'm trying to pull a book (approx 1000 pages), and things seemed to be going well until the 400th page or so. Now it can't connect to the proxy. From my very limited coding knowledge, it's probably due to what pages Google allows to be visible. Since the script has been running for a long while, what happens if I kill it? I don't want to lose the successfully created pages. Will it create a pdf using the pages that were successfully retrieved so far, or will everything be lost if I stop the python script?

Also, how would you recommend I handle the proxy errors in a case like this where each unsuccessful page attempt can take a minute or so as the code cycles through the proxies? It's now at PA457, and still getting proxy connection errors. I've made zero changes to the files except for adding my tesseract Windows directory to the settings.json file. An example of the proxy connection error is shown below,

Fetched link for PA391. Using proxy 85.90.215.111:3128 for the url of page PA392 Could not connect with this proxy

gitnewbiee avatar Oct 01 '22 01:10 gitnewbiee

I'm having the same problem.

pintovillamar avatar Oct 18 '22 20:10 pintovillamar

Hello there , unfortnuately, the script will will have to be begin from a fresh run and will not create any pdf using the pages retrieved so far and you will lose the gathered links if killed during the link gathering process. i may also unfortunately take the decision to abandon the fork in the near future.

memerememe avatar Oct 27 '22 13:10 memerememe