[Haskell-cafe] [ANN] unicode-transforms-0.2.0 pure Haskell unicode normalization

William Yager will.yager at gmail.com
Tue Oct 25 17:06:15 UTC 2016


Interesting! What would you say allowed you to get better decompose
performance than the C library?

Will

On Tue, Oct 25, 2016 at 11:59 AM, Harendra Kumar <harendra.kumar at gmail.com>
wrote:

> Hi,
>
> I released unicode-transforms sometime back as bindings to a C library
> (utf8proc). Since then I have rewritten it completely in Haskell. Haskell
> data structures are automatically generated from unicode database, so it
> can be kept up-to-date with the standard unlike the C implementation which
> was stuck at unicode 5. The implementation comes with a test suite
> providing 100% code coverage.
>
> After a number of algorithmic and implementation efficiency optimizations,
> I was able to get several times better decompose performance compared to
> the C implementation. I have not yet got a chance to fully optimize the
> compose operations but they are still as fast as utf8proc.
>
> I would like to thank Antonio Nikishaev for the unicode character database
> parsing code which I borrowed from the prose library.
>
> https://github.com/harendra-kumar/unicode-transforms
> https://hackage.haskell.org/package/unicode-transforms
>
> -harendra
>
> _______________________________________________
> Haskell-Cafe mailing list
> To (un)subscribe, modify options or view archives go to:
> http://mail.haskell.org/cgi-bin/mailman/listinfo/haskell-cafe
> Only members subscribed via the mailman list are allowed to post.
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.haskell.org/pipermail/haskell-cafe/attachments/20161025/83bd2cde/attachment.html>


More information about the Haskell-Cafe mailing list