Refael Ackermann
Refael Ackermann
> It's sort of questionable to me to treat `None` specially here. I think we've had someone before trying this (specifically for `dict.get`) IMHO only `.get` and `.pop` should be...
Very quick looking around seems like CCleaner changed ownership from Piriform to the Avest/NortonLifeLock corp. It seems they have 1. Not keeping the portable version updated the "portable" link goes...
For posterity: Gemini (2.5 pro) and I had the same idea for a workaround; just delete this gene and CDS from a local version of `GCF_000629465.1_genomic.gbff`
The [RefSeq of GCF_000629465.1](https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_000629465.1/) has this annotation In the gbff: ``` /transl_except=(pos:complement(746613..746615),aa:Sec) ``` in the gff: ``` NZ_KK213203.1 Protein Homology CDS 744123 747203 . - 0 ID=cds-WP_010895684.1;Parent=gene-V557_RS21465;Dbxref=GenBank:WP_010895684.1;Name=WP_010895684.1;Ontology_term=GO:0015942,GO:0008863,GO:0009055,GO:0043546,GO:0047111,GO:0009326;gbkey=CDS;gene=fdnG;go_component=formate dehydrogenase complex|0009326||IEA;go_function=formate dehydrogenase...
BTW: have you considered a GFF3 parser ([scikit.bio](https://scikit.bio/docs/dev/generated/skbio.io.format.gff3.html) is OKish), or parsing .gbff files with biopython?
> A PR would be most welcome :) On it 😊
Left is content of the released tarball Right is output after this patch  
> Please also change the occurrences in... 👍 > It would also be great if you could add a test to I'm out of the flow, so I'm not sure...
I added a second commit that also makes opening the files explicitly as UTF-8 https://github.com/PeterFeicht/cppreference-doc/blob/3c821dbde01f4a09dbb4d72348c899f0786bcda9/commands/preprocess_cssless.py#L32
https://github.com/PeterFeicht/cppreference-doc/blob/be3ce3c82f280fc5bfc09d29ceffc2a236bd90e6/commands/preprocess.py#L389 `HTMLParser(encoding='utf-8')` I'll post a PR shortly. I got sucked into a slightly bigger change using BeautifulSoup which makes things simpler and more robust. Strangely MediaWiki renders wonky HTML mixing...