mf2py
mf2py copied to clipboard
Microformats2 parser written in Python
`mf2py` is not really a modern Python project anymore. This probably needs fixing, because the further behind current best practice a project gets, the less motivation people have to fix...
Are we ready to go Python 3 only? In 2020, Python 2.7 is going end-of-life. A bunch of popular Python projects including Django and Twisted have already announced Python 3...
combined with setup.py since it relies on features new to BS4.
Looks like there's a problem where relative path resolution doesn't kick in in some cases. Example: ``` Jane Doe ``` is parsed as ``` "items": [ { "properties": { "name":...
https://github.com/microformats/microformats2-parsing/issues/3 is the current proposal for extracting and surfacing the `lang` attribute in parsed mf2 for a page's or item's language. that proposal isn't official yet, but it's been stable...
Currently mf2py due to using BeautifulSoup closes empty HTML tags. e.g. `` gets converted to `` and `` gets converted into ``. This makes the `e-content[html]` different from the authored...
Given that for many use cases HTML is going to be sanitized after being extracted from a page (e.g. to be displayed as a comment), it could make sense to...
(based on https://github.com/microformats/php-mf2/issues/195) With [this test](https://github.com/microformats/tests/blob/master/tests/microformats-v1/hcard/single.html), when the `geo` property is backcompat parsed as `p-geo h-geo`, the parser is incorrectly adding an implied `p-name` for the `h-geo`. The `u-url` also...
something changed since 1.0.4 that hurt performance pretty significantly, at least for granary and bridgy. they both saw a slowdown of at least 30-50% (probably more) when they upgraded to...