Ignore unsupported UTF-8 characters
If a file contains an unsupported UTF-8 character, it will break the full runner.py script.
Ignoring the unwanted characters seems to be the best solution.
what is the error?
I don't remember the exact byte value and position but it was:
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x84 in position 747: invalid start byte
Do you have some minimal code with which I can reproduce the issue?
Unfortunately no, but I was using the codebrowser on Chromium source code.
Just putting invalid UTF-8 value in one of the generated file will reproduce the issue in fact.
It was in the /refs directory.