Chris Emerson
2014-02-13 22:20:30 UTC
Hi,
I noticed that case-insensitive regular expressions stop working at around
60 characters. I believe this is because in the "NFA", each character turns
into a character set (for the two cases), which takes 33 bytes or so (256
bits plus opcode); the overall NFA limit is 2048 bytes.
It looks like it wouldn't be too hard to add a new NFA opcode to represent
a sparse character set, say with one length byte followed by N bytes, which
could be used whenever that's smaller than the bitset (or < some limit).
Would that kind of change be accepted? I'm aware of the recommendation to
use a separate regexp engine for serious use.
Regards,
Chris Emerson
I noticed that case-insensitive regular expressions stop working at around
60 characters. I believe this is because in the "NFA", each character turns
into a character set (for the two cases), which takes 33 bytes or so (256
bits plus opcode); the overall NFA limit is 2048 bytes.
It looks like it wouldn't be too hard to add a new NFA opcode to represent
a sparse character set, say with one length byte followed by N bytes, which
could be used whenever that's smaller than the bitset (or < some limit).
Would that kind of change be accepted? I'm aware of the recommendation to
use a separate regexp engine for serious use.
Regards,
Chris Emerson
--
You received this message because you are subscribed to the Google Groups "scintilla-interest" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scintilla-interest+***@googlegroups.com.
To post to this group, send email to scintilla-***@googlegroups.com.
Visit this group at http://groups.google.com/group/scintilla-interest.
For more options, visit https://groups.google.com/groups/opt_out.
You received this message because you are subscribed to the Google Groups "scintilla-interest" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scintilla-interest+***@googlegroups.com.
To post to this group, send email to scintilla-***@googlegroups.com.
Visit this group at http://groups.google.com/group/scintilla-interest.
For more options, visit https://groups.google.com/groups/opt_out.