Link without domain.

Open warmspringwinds opened this issue 10 years ago • 2 comments

Sometimes when I fetch an image from the article's page, I get the a link without the domain from article.top_image.src like '/IMG/someimage.jpg'

Should it work this way?

Apr 08 '15 14:04 warmspringwinds

I came for the same story. I guess an absolute URL would be better suited for more flexible behavior.

[Edit] I had a look to the code in the repo, it seems fine. I am using an old Goose version that I need to update first to confirm the behavior.

May 13 '15 10:05 jice-lavocat

I confirm the bug for version 1.0.25. I'm trying to find a way to fix it. At the moment, it seems that, when ImageExtractor is called, the object

is empty. Hence, there is no self.target_url that can be used to rebuild the absolute URL.

[Edit] The URL I am using to test this is : http://www.suwa.fr/news/terre/trail-randonnee/trail-blanc-de-la-restonica-2014-549

May 13 '15 11:05 jice-lavocat