CDPedia icon indicating copy to clipboard operation
CDPedia copied to clipboard

Some image links are not extracted

Open NicolasSandoval opened this issue 4 years ago • 0 comments

Description

Image links like maps with PNG format are left out.

2021-04-17 17:35:45,509  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:48,371  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:49,997  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:53,665  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,40.4308,-3.7031,266x250.png?lang=es'
2021-04-17 17:35:54,902  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,9,40.67,-73.94,275x250.png?lang=es'
2021-04-17 17:35:55,537  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,10,-34.599722222222,-58.381944444444,300x300.png?lang=es&domain=es.wikipedia.org&title=Buenos+Aires&groups=_1f2405ce1888c1041823dcda962c8595b0609749'
2021-04-17 17:35:55,890  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:56,701  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:56,702  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:57,699  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:59,303  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,48.8565,2.345,266x266.png?lang=es'
2021-04-17 17:36:02,625  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,35.6879633,139.7690085,275x250.png?lang=es'
2021-04-17 17:36:05,072  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:36:06,057  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,41.388,2.168,266x250.png?lang=es'
2021-04-17 17:36:09,118  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,10,-33.44889,-70.669265,275x250.png?lang=es'
2021-04-17 17:36:11,183  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,12,16.753056,-93.115556,255x255.png?lang=es'
2021-04-17 17:36:14,544  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,25.6715,-100.3085,200x200.png?lang=es'

and

2021-04-20 14:04:50,462  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/wikihiero/img/hiero_G1.png?4d556

Possible Solution

Links that not start with maps are excluded by ImageParser.replace of the extract.py module, perhaps such links could be included.

Another point to consider is why some links do not pass if they have the ? symbol without having lang after.

NicolasSandoval avatar Apr 21 '21 02:04 NicolasSandoval