Discussion:
regex, multibyte locales, and word boundaries
Yuri Pankov
2018-11-23 16:02:40 UTC
Permalink
Hi,

We have the following note in the BUGS section of regcomp(3):

----------------------------------------------------------------------
Word-boundary matching does not work properly in multibyte locales.
----------------------------------------------------------------------

It was added ages ago along with multibyte support in our regex
implementation, though I can't think of any positive test case to see
that the problem is real, and eventually fix it.

I'm wondering if anyone has real life examples showing the bug?

Loading...