Karl Williamson
Karl Williamson
This should fix GH #19983. Some of the macros that extract a PV from an SV also will set a 'len' parameter to how many bytes long it is. The...
At the beginning of the development cycle, the project should look ahead at what to do about experimental/deprecated features. Some are coming due this cycle, so email should be sent...
This is to see how many CPAN modules call these with a differently sized 'len' than the specified 'STRLEN'. See https://github.com/Perl/perl5/issues/19983
This commit changes utf8_length to read the input a word at a time. The current method of looking per character is retained for shorter strings. The per-word method yields significant...
Previously, that was the case for backwards hops, but if a forward hop started at a continuation byte, each such byte in the current character consumed one hop count.
This changes utf8_to_bytes() to do a per-word initial scan to see if the source is actually downgradable, before starting the conversion. This is significantly faster than the current per-character scan....
My mental model of how the C preprocessor did things was flawed. This commit makes things clearer. It needs some polishing