python-readability
python-readability copied to clipboard
"comment" in unlikely candidates mutilates formatted code blocks
I've seen several cases of code blocks using color schemes where "comment" becomes a style and starts getting trimmed. At best this loses all the comments, and may further mutilate the structure.
I wonder if the unlikely candidates trimming should ignore elements inside of pre/code blocks?
Then it wouldn't cut ads and different "related" blocks that didn't get below the cut threshold. Could you please provide an (or several) example page(s)?