joni icon indicating copy to clipboard operation
joni copied to clipboard

Java port of Oniguruma regexp library

Results 18 joni issues
Sort by recently updated
recently updated
newest added

Hi, I am struggling with proper configuration of `Option` passed to `search` method with the `Syntax.ECMAScript`. I would expect that with `Option.DEFAULT` / `Option.NONE` regex with usage of `^` ,`$`...

Joni's look-ahead/look-behind feature in evaluating regex matches can find themselves in large recursive loops causing things like https://github.com/elastic/elasticsearch/issues/28731 to occur. It would be nice to be able to enable [`Config.CEC`](https://github.com/jruby/joni/blob/c1911991387b9c898d09706fa2a878094598c2f9/src/org/joni/Config.java#L46)...

call thread.interrupt() and nothing happened. so how to stop the hanging thread? ``` Charset _charset = Charset.forName("GB18030"); /* text containing irregular binary data will make thread hang */ Thread thread...

In MRI 2.6: ``` % ruby -v ruby 2.6.0p0 (2018-12-25 revision 66547) [x86_64-darwin18] % ruby -e 'puts "foo========:bar baz================================================bingo".scan(/(?:=+=+)+:/)' ========: ``` With Latest JRuby snapshot: ``` /tmp/jruby-9.2.8.0-SNAPSHOT % java -version...

Steps to reproduce 1. [onig4j-v003-src.zip](https://github.com/jruby/joni/files/3382279/onig4j-v003-src.zip) 2. Update jni/Makefile with proper `JAVA_HOME` and then call `make` 3. Update lib location in `src/onig4j/OnigRegex.java` 4. Run `OnigPerformanceTest` We've got following results: java: 4261ms...

Hi, Can you give me some feedback on this issue? I'm trying to mach Named capture groups in a multiline byte[] content and getting a -1 -1 index range for...

Not sure if I should report this here or to JRuby but the error seems to come from joni: ```ruby $ bin/jruby -e 'p [0xA4].pack("C").force_encoding("UTF-8").grapheme_clusters' Unhandled Java exception: java.lang.ArrayIndexOutOfBoundsException: -1...

Related to #17. Fixes the infinite loops. - Before fix, `A\uD800` causes infinite loop. Now, it will fail loudly, as expected. - Before fix, `\uD800\uDC00` causes infinite loop. Now, it...

When a character is less than or equal to single byte size (0xff), yet it takes more than 1 byte in the current encoding, the case folding code incorrectly put...