sitediff icon indicating copy to clipboard operation
sitediff copied to clipboard

Sitediff init not creating Paths.txt

Open Saibot97 opened this issue 3 years ago • 5 comments

Hello,

I just installed Sitediff on a fresh Ubuntu VM with Ruby 2.6.6. When i run the Sitediff init command, tried with a few webpages, it just creates the sitediff.yaml, no paths.txt at all. Unbenannt

Saibot97 avatar Apr 06 '22 09:04 Saibot97

sitediff init https://4nes.com will create the initial file. sitediff crawl will crawl the site and create the paths.txt file.

kdborg avatar Apr 10 '22 20:04 kdborg

Okay, i got it now. The page im accessing has an htaccess and a normal login area. The .htaccess got managed in url like: username:password@url But is there a way to give sitediff user credentials for a login area ?

Saibot97 avatar Apr 12 '22 09:04 Saibot97

In your sitediff.yml file you can add credentials to the curl options:

settings:
  curl_opts:
    userpwd: "username:password"

kirk-brown-ew avatar Apr 12 '22 14:04 kirk-brown-ew

Putted the credentials for the login Area in the settings like you said, but still get the same error when try to crawl.

The Error: Unbenannt

And this is the Login-Area i need to pass

Unbenannt2

Saibot97 avatar Apr 13 '22 09:04 Saibot97

Those credential settings are for Basic HTTP Auth.

There isn't a way to log in via a form just yet.

kirk-brown-ew avatar Apr 13 '22 13:04 kirk-brown-ew