hydrus icon indicating copy to clipboard operation
hydrus copied to clipboard

Parser: Formula Type: Embeded

Open bbappserver opened this issue 4 years ago • 2 comments

Very occasionally a website thinks it is a good idea to encode important data in two seperate formats that you need to suck out a file url.

For example the Tumblr API produces the json body

{
"...":"other keys we don't care abou",
"regular-body":"some encoded html containing an <img src=\"https://some.url/a.jpg\">"
}

Which is the actual CDN url for the data I'm interested in. I can awkwardly work around this by cutting out parts of the value until I'm left with the URL, but it would be nice if there was a recursive formula type, where the found output was passed to another parser and then all of the results collected.

bbappserver avatar May 17 '21 09:05 bbappserver

Isn't this exactly what sub page parsers do?

floogulinc avatar May 17 '21 10:05 floogulinc

If it is, it's either not working from JSON to html, or there is a lack of documentation on how to use it properly.

bbappserver avatar Jul 03 '22 21:07 bbappserver