[Haskell-beginners] regex and Unicode

Brian Sammon haskell-beginners at brisammon.fastmail.fm
Wed Aug 31 03:16:57 UTC 2016


On Sun, 28 Aug 2016 15:11:24 -0400
Brian Sammon <haskell-beginners at brisammon.fastmail.fm> wrote:

> I tried to write a program using Text.Regex.PCRE to search through a UTF8-
> encoded document.  It appears that the presence of non-breaking-space 
> characters (Charpoint 160) triggers some weird behavior in my program.

Well switching my code to use Text.RegexPR-based searches rather than Text.Regex.PCRE made the problem go away.

Text.Regex.PCRE seems to be unmaintained, so I guess I shouldn't be surprised that I had problems with it.


More information about the Beginners mailing list